Nonparametric Instrumental Regression with Two-Way Fixed Effects

Enrico De Monte

doi:10.1515/jem-2022-0025

Article Open Access

Nonparametric Instrumental Regression with Two-Way Fixed Effects

Enrico De Monte

Published/Copyright: October 5, 2023

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal Journal of Econometric Methods Volume 13 Issue 1

Abstract

This paper proposes a novel estimator for nonparametric instrumental regression while controlling for additive two-way fixed effects. In particular, the Landweber–Fridman regularization, to overcome the ill-posed inverse problem in the nonparametric instrumental regression procedure, is combined with the local-within two-ways fixed effect estimator presented by Lee, Y., D. Mukherjee, and A. Ullah. (2019. “Nonparametric Estimation of the Marginal Effect in Fixed-Effect Panel Data Models.” Journal of Multivariate Analysis 171: 53–67). Compared to other estimators in this context, an appealing feature is its flexible applicability with respect to different panel model specifications, i.e. models comprising either individual, temporal, or two-way fixed effects. The estimator’s performance is tested on simulated data, where a Monte Carlo study reveals good finite sample behaviour. Confidence intervals are provided by applying the wild bootstrap.

Keywords: endogeneity; panel data; unobserved heterogeneity; regularization; kernel regression

JEL Classification: C01; C14; C23; C3; C33

1 Introduction

This paper addresses nonparametric estimation of panel data models with endogeneity in the explanatory variable. In particular, it develops an estimator that combines nonparametric instrumental regression with a local-within two-way fixed effects estimator.

In applied econometrics one of the major difficulties to overcome is endogeneity in the explanatory variables. With panel data, the use of fixed effects (FE) estimators, i.e. within or first-differences, allows to control for some potential sources of endogeneity when a specific part of the error component is related to the explanatory variables. Often, however, this does not suffice, since the remaining error term might still be correlated with the regressors. This is the case, for instance, when dealing with the estimation of market equilibrium models, such as product demand functions used in industry competition models (Amir and Lambson 2003; De Monte and Koebel 2023; Esponda and Pouzo 2019; Hopenhayn 1992). Here, the product demand function describes the quantity-price mechanism of an economy and maps price levels into production levels. Theoretically, production and price levels are inversely related, i.e. higher prices should translate into lower equilibrium production levels, which is also known as the law of demand – one of the most fundamental concepts in economics. Empirically, production levels might not only shift by changing prices but also by macroeconomic shocks common to all units and country/industry specific shocks. Since these shocks are difficult to model and likely to be correlated with prices, they are often taken into account by unobserved FE. Moreover, as the explanatory variable (prices) is jointly determined with the dependent variable (production) by the market equilibrium mechanism, the so-called simultaneity bias occurs when not taken into account by instrumental variable (IV) regression techniques (Wooldridge 2016, Chapter 16). Hence, in such a case both IV and FE regression techniques should be applied to cope with all sources of endogeneity.^[1]

For parametric (especially linear) models this is a common estimation and identification strategy. For example, see Ziliak (1997) who compares different panel data estimators as well as Koebel and Laisney (2016), and De Monte and Koebel (2023) for applications to the estimation of the inverse demand in the context of a Cournot competition model. If the functional form of the parametric model is misspecified, however, the estimator combining IV and FE regression will still be inconsistent and biased. To avoid misspecification issues, nonparametric methods are promising. In this field much work has been done concerning both nonparametric FE and IV regression. For instance, concerning nonparametric FE regression, Su and Ullah (2006) develop a nonparametric profile likelihood estimator with FE, Henderson, Carroll, and Li (2008) present an iterative kernel estimator to account for FE, and the method presented by Qian and Wang (2012) is based on marginal integration and first difference. Also, see Linton and Nielsen (1995) for foundations of the concept of marginal integration, as well as Azomahou, Laisney, and Nguyen Van (2006) presenting an application of that approach, dealing with a nonparametric panel model with additive FE to estimate the environmental Kuznets curve. Most of the mentioned approaches use global within or global first-difference to account for FE, i.e. canceling out the unobserved FE by subtracting either the global mean (i.e. and unweighted average over all individuals) or a lagged period of the variables comprised in the model. In this context, Lee, Mukherjee, and Ullah (2019) show that when applying global within or, equivalently, global first-difference estimation, a bias is introduced that is non-degenerating even for very large T. They suggest a nonparametric kernel estimator using the concept of local-demeaning or local-first-difference to avoid this bias. For a comprehensive survey on panel data models considering both nonparametric and semiparametric frameworks see Parmeter and Racine (2019) and Rodriguez-Poo and Soberon (2017).

Regarding to nonparametric instrumental regression important progress has been achieved, too. Here, the main challenge is to overcome the ill-posed inverse problem to solve for the nonparametric function of interest.^[2] For this purpose, Newey and Powell (2003) present a nonparametric 2-stage least-squares estimator. Hall and Horowitz (2005) use a ridge-type regularization method in combination with kernel regression methods, and Darolles et al. (2011) use the Tikhonov regularization to solve the ill-posed inverse problem. Horowitz (2011) further discusses this kind of estimators by comparing them to their parametric counterparts. Further, Florens, Johannes, and Van Bellegem (2012) present an extension of instrumental regression to the case of partially linear models and Florens, Racine, and Centorrino (2018) consider nonparametric IV regression where the interest lies on the estimation of the derivative of the conditional mean function, using the Landweber–Fridman regularization to handle the ill-posed inverse problem. Centorrino, Feve, and Florens (2017) discuss the implementation as well as the MSE-performance w.r.t. the different regularization methods in that context. An application of nonparametric IV regression is brought by Blundell, Chen, and Kristensen (2007) for the case of the estimation of the Engel curve.^[3]

However, as described above, in some econometric frameworks both IV and FE methods are required. The only study, to the best of my knowledge, that fills that gap in the nonparametric literature is presented by Fève and Florens (2014). They use the Tikhonov regularization, which states the IV-part in the estimation procedure, and rely on first-difference to control for unobserved individual FE.

In this paper I propose a nonparametric instrumental estimator while taking into account two-way FE, which has not yet been developed in the nonparametric literature. Considering jointly individual and temporal FE in an IV setting, the novel estimator can be considered as a useful extension to the one presented by Fève and Florens (2014). More specifically, the estimator combines the Landweber–Fridman regularization approach for the IV-part with the local-within kernel estimator presented by Lee, Mukherjee, and Ullah (2019) (LMU, henceforth) to control for unobserved individual and temporal effects. As mentioned, by the use of local-within transformation, the estimation procedure avoids problems related to the non-degenerating bias described by LMU, occurring when using global first-difference/within estimation. Moreover, an appealing feature for practitioners is the simple implementation of the estimator and its higher flexibility for treating different kind of panel models with endogeneity, i.e. models comprising either individual, temporal, or two-way FE. The estimator is applied on simulated data where a Monte Carlo simulation reveals a good finite sample behaviour and confidence intervals are provided by the application of the wild bootstrap.

The paper is organized as follows: Section 2 discusses related estimation methods, Section 3 presents the novel estimation procedure, Section 4 illustrates the performance of the proposed estimator on simulated data, and Section 5 concludes.

2 Related Estimation Methods

In this Section I will briefly discuss two related estimation methods. First, I will give an insight to the approach presented by Fève and Florens (2014) who, present the only nonparametric estimation method for the conditional mean function that treats endogeneity by using both IV and (one-way) FE. Second, I present the local-within FE estimator presented by Lee, Mukherjee, and Ullah (2019), likewise for the estimation of the conditional mean function, however, where only FE are taken into account, i.e. without an IV component. This second estimator is of particular interest as it will be used in the subsequent section for the newly constructed nonparametric IV estimator with two-way FE.

2.1 Nonparametric IV Regression with One-Way FE

Fève and Florens (2014) consider the a panel model given by

(1) Y i t = φ ( Z i t ) + ξ i + U i t , i = 1 , … , n , t = 1 , … , T

where Y i t ∈ R and Z i t ∈ R p are observable random variables, ξ i ∈ R is an unobserved individual effect, and U _it is an error term. Here, Z _i1, …, Z _iT ∀ i is supposed to be endogenous in that it might be correlated with both ξ _i and U _it.^[4] Estimating ξ _i ∀ i would lead into the well-known incidental parameter problem (Lancaster 2000; Neyman and Scott 1948). Hence, to eliminate the FE they first apply first-difference, i.e. Y _it − Y _i,t−1, which however still does not fully solve the endogeneity issue as E(φ(Z _it) − φ(Z _i,t−1)|U _it − U _i,t−1) ≠ 0. Supposing that there exists a valid set of instruments W ∈ R q , satisfying E(U _it − U _i,t−1|W) = 0, they then proceed by applying the Tikhonov regularization method (Tikhonov and Arsenin 1977) to solve the resulting problem below for φ, which is given by

(2) K φ = r ,

where ( K φ ) ( w ) = E ( φ ( Z i t ) − φ ( Z i , t − 1 ) | W = w ) ∈ L W 2 and r ( w ) = E ( Y i t − Y i , t − 1 | W = w ) ∈ L W 2 .^[5] Here, K is a conditional mean operator projecting functions of Z onto the space of W, i.e. K : L Z 2 → L W 2 . Hence, (Kφ)(w) can be expressed by

(3) ( K φ ) ( w ) = ∫ φ ( z ) f Z t , W ( z , w ) − f Z t − 1 , W ( z , w ) f W ( w ) d z ,

which is a Fredholm equation of the first kind (Fredholm 1903).

Consider again equation (2), by iterating the projection onto the space of the endogenous variable Z, one obtains

(4) K * K φ = K * r ,

where K* is an adjoint conditional expectation operator projecting functions of W onto the space of function of Z, i.e. K * : L W 2 → L Z 2 .

As described by the authors, simply inverting K*K for obtaining an estimate of φ results in an ill-posed inverse problem as the solution of (2) for φ requires the inversion of an integral which does not represent a continuous mapping. That is, while small changes in Kφ might induce small changes in r, this does not hold vice versa.^[6] Therefore, regularization methods are required, where, in this case, the Tikhonov regularization seeks to minimize

(5) ‖ K φ − r ‖ 2 + α ‖ φ ‖ 2

w.r.t. φ, where α > 0 is a regularization parameter to be chosen by the user, and with

(6) ‖ φ ‖ 2 = ∫ φ 2 ( z ) π ( z ) d z .

Fève and Florens (2014) show that the solution to (5) is then given by

(7) φ α = ( α I + K * K ) − 1 K * r .

The intuition behind Tikhonov regularization is to add the constant α to the eigenvalues of K*K which in turn allows for inversion. Finally, the function φ is estimated by replacing the conditional mean objects K, K*, and r by appropriate nonparametric estimates.

As outlined, the here presented method relies on “global first-difference” transformation to eliminate the FE. However, the expansion of the approach to properly deal with a two-way FE model seems to be rather complicated. Hence, by the use of the below described nonparametric estimator for the conditional mean function by taking into account FE, I will develop a nonparametric instrumental estimator that is appealing by its simple implementation and higher flexibility to treat both one-way and two-way FE panel models.

2.2 Nonparametric Regression with One- and Two-Way FE

A method to take potential endogeneity from FE into account was recently presented by LMU. They consider a similar model with regard to (1), however, suppose that only ξ _i (but not U _it) can be correlated with Z _i1, …, Z _iT. To consistently estimate the unknown function φ, they use what they call the local-within kernel estimator, which eliminates the FE locally. The motivation for the local-within kernel estimator lies in the fact, which they prove, that global-demeaning or global first-difference to remove the unobserved individual effect – i.e. by transformation the data as it is practiced in many studies as well as in the estimation procedure in Fève and Florens (2014) – introduces a non-degenerating bias, even for a large time horizon. Instead, removing the individual effect locally, i.e. by local-within or local-first-difference, avoids this bias.

2.2.1 Accounting for One-Way FE

Consider the typical objective function of local kernel regression, characterized by a Taylor expansion around z, given by

(8) Q n T ( β 0 , β 1 , ξ 1 , … , ξ n ) = ∑ i = 1 n ∑ t = 1 T { Y i t − β 0 − β 1 ( Z i t − z ) − ξ i } 2 C h ( Z i t − z ) ,

where C is a kernel function and h the bandwidth parameter.^[7] To circumvent the incidental parameter problem, LMU propose to perform the local-kernel regression based on locally demeaned data, which eliminates the FE. This, however, also removes the intercept, β ₀, yielding the new objective function based on transformed data, given by

(9) Q n T ( β 1 ) = ∑ i = 1 n ∑ t = 1 T Y i t * ( z ) − β 1 Z i t * ( z ) 2 C h ( Z i t − z ) ,

where

(10) Y i t * ( z ) = Y i t − ∑ s = 1 T Y i s w i s ( z ) ,

with the weight w i s ( z ) = C h ( Z i s − z ) / ∑ r = 1 T C h ( Z i r − z ) and the same transformation for Z i t * ( z ) .^[8] Two properties of the weight are important to mention: (i) w _it(z) ≥ 0 ∀ t and (ii) ∑ t = 1 T w i t ( z ) = 1 . Using the locally-demeaned variables, standard kernel regression methods can be applied, for instance using local-linear kernel regression of Y i t * ( z ) on Z i t * ( z ) . LMU estimate in a first step the gradient of the conditional mean, i.e.^[9]

(11) β 1 ( z ) = ∂ φ ∂ z ( z ) ,

which is obtained as the solution to minimizing (9), given by

(12) β ̂ 1 ( z ) = ∑ i = 1 n ∑ t = 1 T Z i t * ( z ) Z i t * ( z ) C h ( Z i t − z ) − 1 ∑ i = 1 n ∑ t = 1 T Z i t * ( z ) Y i t * ( z ) C h ( Z i t − z ) .

In a second step, the conditional mean function φ is recovered by

(13) φ ̂ ( z ) = 1 n ∑ i = 1 n γ ̂ i ( z ) − γ ̂ i ( 0 )

with

(14) γ ̂ i ( z ) = ∑ t = 1 T Y i t − β ̂ 1 ( z ) ( Z i t − z ) w i t ( z ) .

2.2.2 Accounting for Two-Way FE

The above presented one-way local-within estimator is easily extendable to take into account two-way FE, which is useful if there are not only individual but also time specific effects. In this case, the time specific effect δ _t is added to equation (1), where δ _t is likewise potentially correlated with Z _it. Then, to account for both ξ _i and δ _t, the following transformation is used:

(15) Y i t * * ( z ) = Y i t − ∑ s = 1 T w i s a ( z ) Y i s − ∑ j = 1 n w j t b ( z ) Y j t + ∑ j = 1 n ∑ s = 1 T w j s c ( z ) Y j s ,

with the weights

(16) w i s a ( z ) = C h ( Z i s − z ) ∑ r = 1 T C h ( Z i r − z ) , w j t b ( z ) = C h ( Z j t − z ) ∑ k = 1 n C h ( Z k t − z ) , w j s c ( z ) = C h ( Z j s − z ) ∑ k = 1 n ∑ r = 1 T C h ( Z k r − z ) .

Regressing nonparametrically Y i t * * ( z ) on Z i t * * ( z ) provides again a consistent estimate of the gradient of φ, whereupon φ can be recovered by^[10]

(17) φ ̂ ( z ) = 1 n ∑ i = 1 n γ ̂ i a ( z ) − γ ̂ i a ( 0 ) + 1 T ∑ t = 1 T γ ̂ t b ( z ) − γ ̂ t b ( 0 ) − γ ̂ c ( z ) − γ ̂ c ( 0 ) ,

with

(18) γ ̂ i a ( z ) = ∑ t = 1 T Y i t − β ̂ 1 ( z ) ( Z i t − z ) w i t a ( z ) , γ ̂ t b ( z ) = ∑ i = 1 n Y i t − β ̂ 1 ( z ) ( Z i t − z ) w i t b ( z ) , γ ̂ c ( x ) = ∑ i = 1 n ∑ t = 1 T Y i t − β ̂ 1 ( x ) ( Z i t − z ) w i t c ( z ) .

The estimator defined in (17) will be of particular importance for the next section.

3 Nonparametric IV Regression with Two-Way FE

This section presents a novel estimator that allows to easily conduct nonparametric IV regression by controlling for two-way FE. For many econometric settings, especially for the estimation of equilibrium models, this is required, and hence of particular interest for practitioners. The method states an extension to the one presented by Fève and Florens (2014), who only discuss one-way FE in the context of nonparametric IV regression. For this purpose, the Landweber–Fridman regularization method for nonparametric instrumental regression is combined with the nonparametric FE estimator presented by Lee, Mukherjee, and Ullah (2019).

3.1 Setup of the Estimator

Consider the panel model given by

(19) Y i t = φ ( Z i t ) + ξ i + δ t + U i t , i = 1 , … , n , t = 1 , … , T

with the observable random variables Y i t ∈ R and Z i t ∈ R (for simplicity with only one covariate). The function of interest φ is supposed to belong to the space of square integrable functions of Z, denoted by L z 2 . The endogenous explanatory variable Z _it is potentially correlated with the unobserved individual and time specific effects, ξ i ∈ R and δ t ∈ R , as well as with the error term U _it. If there exists an instrument W ∈ R satisfying E(U _it|W = w) = E(U _it|w) = 0, we can write

(20) E ( Y i t − φ ( Z i t ) − ξ i − δ t | w ) = 0 ,

where, given this characterization, φ(Z _it) is the solution of a Fredholm integral equation of the first kind.^[11] Given the joint density f(y, z, ξ, δ, w) and using the common abuse of notation f for denoting different densities, (20) can be expressed as

(21) ∫ φ ( z ) f ( w , z ) f ( w ) d z + ∫ ξ f ( w , ξ ) f ( w ) d ξ + ∫ δ f ( w , δ ) f ( w ) d δ − ∫ y f ( w , y ) f ( w ) d y = 0 ,

where the two middle integrals cannot be computed since ξ _i and δ _t are unobserved. Further, let K φ ̃ = E ( φ ( Z i t ) + ξ i + δ t | w ) , where K denotes again the conditional expectation operator, projecting functions of Z onto the space of functions of W, i.e. K : L z 2 → L w 2 (Centorrino, Feve, and Florens 2017) and φ ̃ ≡ φ ( Z i t ) + ξ i + δ t , and let r = E(Y _it|w). Then, equation (20) can be expressed by

(22) K φ ̃ − r = 0 .

Similar to what was shown in the context of the estimator presented by Fève and Florens (2014) discussed above, the challenge now is to solve (22) for φ ̃ – which is an ill-posed inverse problem – while recovering φ. To be more precise, the ill-posed inverse problem here again arises since the inversion of the operator K, to solve for φ, is discontinuous, which leads to inconsistent estimates when not regularized. To deal with the ill-posed inverse problem I make use of the Landweber–Fridman regularization (Fridman 1965; Landweber 1951), where I follow Racine (2019, p. 282). As will be shown, the motivation to use the Landweber–Fridman instead of the Tikhonov regularization is that it allows to combine nonparametric instrumental regression with the advantageous features of the local-within LMU estimator to control for the unobserved FE (i.e. to flexibly deal with one- or two-way FE).

Let K* be the adjoint conditional mean operator of K projecting functions of W onto the space of functions of Z _t, i.e. K * : L w 2 → L z 2 . In other words, the adjoint conditional mean operator K* is the reverse projection of K (for more details see for instance Florens, Johannes, and Van Bellegem (2011)).^[12] Consider equation (22), taking the scalar product with respect to K* and multiplying with a constant c yields

c K * K φ ̃ = c K * r ,

where, in contrast to the Tikhonov regularization used in Fève and Florens (2014), the Landweber–Fridman regularization does not aim to invert K*K but avoids its inversion using instead the iterative scheme obtained by equivalently writing

φ ̃ − c K * K φ ̃ = φ ̃ − c K * r ( I − c K * K ) φ ̃ = φ ̃ − c K * r φ ̃ = ( I − c K * K ) φ ̃ + c K * r φ ̃ = φ ̃ + c K * ( r − K φ ̃ ) .

Using the Landweber–Fridman regularization, setting c < 1, φ ̃ can be obtained in an iterative manner, by

(23) φ ̃ k = φ ̃ k − 1 + c K k − 1 * ( r − K k − 1 φ ̃ k − 1 )

and so

(24) φ k ( z ) = φ k − 1 ( z ) + c E E Y i t − φ k − 1 ( Z i t ) − ξ i − δ t | w | z .

In practice, the regularized estimator of φ is obtained by substituting the conditional mean objects r, K, and K* in equation (23) by consistent estimators. To consistently estimate these objects, the unobserved FE, ξ _i and δ _t, need to be controlled for – whose appearance is explicitly shown in the equivalent expression (24) – which, as shown below, will be effectively done by the use of the LMU two-way FE estimator defined in equation (17).^[13] The regularization procedure is then conducted for k = 1 , … , k ̄ iterations until convergence, with k ̄ the total number of iterations, determined by a stopping rule described in the following sub-section.

3.2 Implementation of the Estimation Algorithm

As mentioned above, to compute φ _k(z) various conditional mean objects need to be consistently estimated, i.e. controlling for FE, where the regularization procedure involves to be repeated (i.e. updated) over several iterations, where the last iteration, k ̄ , should yield φ ̂ k ̄ ( z ) ≈ φ ( z ) . To achieve this objective, the following estimation algorithm is employed:

Step 1. Compute the initial guess φ ̂ 0 ( z ) .

To start with the procedure, an initial guess is estimated by considering E(Y _it|z, ξ _i, δ _t) = φ ₀(z) + ξ _i + δ _t, supposing E(ξ _i|z) ≠ 0 and E(δ _t|z) ≠ 0. That is, if we regressed Y _it on Z _it using conventional kernel regression methods we would obtain biased estimates for φ ₀(z). Instead, to account for FE the LMU estimator is applied, which consists in locally demeaning the dependent and explanatory variable by applying equation (15), obtaining Y i t * * ( z ) and Z i t * * ( z ) , and using local-linear kernel regression of Y i t * * ( z ) on Z i t * * ( z ) . In doing so, according to the LMU approach, we first obtain the (pointwise) gradient, i.e. φ ̂ 0 ′ ( z ) , whereupon the corresponding conditional mean function, φ ̂ 0 ( z ) , is recovered by equation (17).

Step 2. Compute E ̂ Y ̃ i t | w , ξ i , δ t with Y ̃ i t ≡ Y i t − φ ̂ 0 ( Z i t ) .

The previous step yielded and unbiased estimate of φ ̂ 0 ( Z i t ) where FE were accounted for. When subtracting φ ̂ 0 ( Z i t ) from Y _it it can be shown that the empirical model to be considered in this step consists in E Y ̃ i t | w , ξ i , δ t = φ Y ̃ | w 0 ( w ) + ξ i + δ t , where φ Y ̃ | w 0 ( w ) is the conditional mean function we aim to consistently estimate.^[14] That is, if the instrument is correlated with the FE, the LMU estimator has to be applied by first obtaining the locally demeaned variables and conduct local-linear kernel regression of Y ̃ i t * * ( w ) on W**(w), which yields an estimate of the gradient, φ ̃ Y ̃ | w ′ 0 ( w ) , and recover subsequently φ ̂ Y ̃ | w 0 ( w ) ≡ E ̂ ( Y ̃ i t | w , ξ i , δ t ) by (17).^[15]

Step 3. Compute E ̂ E ̂ Y ̃ i t | w , ξ i , δ t | z , ξ i , δ t .

Let Y ̃ ̂ i t ≡ E ̂ Y ̃ i t | w , ξ i , δ t , obtained from the previous step. The regression performed in this step builds on the empirical model given by E Y ̃ ̂ | z , ξ i , δ t = φ Y ̃ ̂ | z 0 ( z ) + ξ i + δ t , where φ Y ̃ ̂ | z 0 ( z ) is the conditional mean function we aim to consistently estimate. Since, again, E(ξ _i|z) ≠ 0 and E(δ _t|z) ≠ 0 is assumed, the LMU estimator is applied by computing the respective locally demeaned variables and use local-linear regression of Y ̃ ̂ i t * * ( z ) on Z i t * * ( z ) yielding first an estimate of the gradient φ Y ̃ ̂ | z ′ 0 ( z ) and recover subsequently φ ̂ Y ̃ ̂ | z 0 ( z ) ≡ E ̂ E ̂ Y ̃ i t | w , ξ i , δ t | z , ξ i , δ t by (17).

Step 4. Compute φ ̂ 1 ( z ) = φ ̂ 0 ( z ) + c E ̂ E ̂ Y ̃ i t | w , ξ i , δ t | z , ξ i , δ t .

For k = 1 we now have an estimate φ ̂ 1 ( z ) , while having effectively controlled for unobserved two-way FE.

Step 5. Repeat steps 2–4, replacing φ ̂ 0 ( z ) by φ ̂ 1 ( z ) to compute φ ̂ 2 ( z ) and so on.

Step 6. This is continued until the stopping criterion given by

(25) ∑ i = 1 n ∑ t = 1 T E ̂ Y i t | w , ξ i , δ t − E ̂ φ ̂ k ( Z i t ) | w , ξ i , δ t E ̂ Y i t | w , ξ i , δ t 2

stabilizes throughout the iterations.^[16]

To establish the stopping criterion E(Y _it|w, ξ _i, δ _t) = φ _Y|w(Y|w) + ξ _i + δ _t is estimated by the LMU approach before the algorithm starts. That is, we aim to consistently estimate φ _Y|w(w) under the assumption that potentially E(ξ _i|w) ≠ 0 and E(δ _t|w) ≠ 0. Hence, the LMU estimator is employed by locally regressing in a first step the locally demeaned variables Y i t * * ( w ) on W**(w) (yielding an estimate of the gradient φ Y | w ′ ( w ) ) and, subsequently, φ ̂ Y | w ( w ) ≡ E ̂ ( Y i t | w , ξ i , δ t ) is recovered. The second object in the stopping criterion, E ̂ ( φ ̂ k ( Z i t ) | w ) , needs to be computed and updated in each iteration. Here, the LMU estimator is likewise employed, allowing to control for potential unobserved FE. More precisely, let φ ̂ k , i t ≡ φ ̂ k ( Z i t ) , and regress the locally demeaned variables φ ̂ k , i t * * ( w ) on W i t * * ( w ) and recover the corresponding conditional mean function subsequently.

3.2.1 Number of Iterations

The number of iterations is here determined by the stopping criterion given in (25). More specifically, the estimation procedure continues until this criterion stabilizes throughout the iterations. I specify a tolerance level of 1 %, meaning that if the stopping criterion’s value between two consecutive iterations changes by 1 % or less, the iteration procedure stops. One could set a lower tolerance level to achieve a higher accuracy, which, however, also increases the computational burden. Note that another (but here not applied) possibility for determining the number of iterations, is to conduct a sufficient number of iterations and analyze subsequently at which iteration the stopping criterion has reached its minimum (Centorrino, Feve, and Florens 2017). Further, to ensure convergence of the iterative scheme the constant c requires to be chosen such that 0 < c < 1 (see equation (24)). The higher the value of c, the faster the iterative scheme converges (Centorrino, Feve, and Florens 2017). However, the exact choice of c is not of relevant importance as it is not a regularization/tuning parameter and does therefore not have a direct impact on the regularization of the ill-posed inverse problem (Johannes, Van Bellegem, and Vanhems 2013). For practical purposes Florens, Racine, and Centorrino (2018) fix c = 0.5, which I follow in the paper.

3.2.2 Bandwidth Selection

As described above, in each iteration of the Landweber–Fridman regularization, various nonparametric conditional mean estimates are required. Hereby, the performance of kernel estimators, such as the here applied LMU estimator, heavily rely on the bandwidth selection. For this application I suggest to compute optimal bandwidths by the use of the conditional mean based least squares cross-validation (LSCV) method. See Appendix A for details on the applied LSCV method. Further, Florens, Racine, and Centorrino (2018) show that the use of updated bandwidths, i.e. computing optimal bandwidths in each of the iterations, leads to higher accuracy of the final estimate. This procedure, however, is computationally very time intensive. By this reason, for the estimates presented in this paper, I compute cross-validated bandwidths for the conditional mean objects only once, obtained in the first iteration, and use them for all successive iterations. As will turn out in the next section, the estimator’s performance is not significantly affected by that.

3.2.3 Bootstrap Inference

Deriving the variance of the proposed estimator analytically is complicated given the complexity of the iterative Landweber–Fridman procedure in combination with the LMU estimator. By this reason, to estimate pointwise confidence intervals, I propose using the wild residual block bootstrap, which is frequently applied in nonparametric estimation. For instance, Malikov, Zhao, and Kumbhakar (2020) use the wild bootstrap in the framework of the Landweber–Fridman regularization for providing statistical inference in the context of production function and productivity estimation. Azomahou, Laisney, and Nguyen Van (2006) use the technique in the framework of a nonparametric panel model with FE to estimate the environmental Kuznets curve.^[17] See Appendix B for details on the applied bootstrap method.

3.2.4 Consistency of the Estimator

The convergence properties of the proposed estimator depend upon those of the Landweber–Fridman procedure and those of the LMU estimator. That is, to consistently estimate the function of interest φ, both the iterative scheme of the Landweber–Fridman regularization and the LMU estimator for the estimation of the different conditional mean objects within the single iterations need to converge to the true function by increasing the number of observations. Consider first the LMU estimator. Lee, Mukherjee, and Ullah (2019) showed that their estimator yields consistent estimates for the conditional mean function while controlling for unobserved fixed effects for h → 0 (the bandwidth parameter of the nonparametric local-within kernel regression) and nTh → ∞ as nT → ∞. They also derive the optimal rate of convergence (i.e. the rate by which h should decrease by an increase in nT to optimally balance the trade-off between bias and variance of the estimator). Further, in a setting without FE, Florens, Racine, and Centorrino (2018) show that the iterative Landweber–Fridman scheme converges to the true function φ for N → ∞ (where N envelopes the total number of observations which in the panel data setting is given by nT), given the different conditional mean objects within the iterations are consistently estimated. Hence, as both the Landweder-Fridman procedure and the LMU estimator are proven to be consistent, the combination of both estimators results likewise in a consistent estimator.

4 Simulation, Bootstrap Inference, and Finite Sample Behaviour

This section aims to study the performance and the finite sample behaviour of the proposed estimator. For this purpose, I consider the two-way FE panel model presented in (19), where the unknown conditional mean function φ is a function of the endogenous variable Z _it, correlated with the individual and temporal specific effects, ξ _i and δ _t, as well as with the error U _it. W is a valid instrument satisfying cor(Z _it, W) ≠ 0 and E(U _it|W) = 0. In the following, the data generating process (DGP) is presented to obtain the observables { y i t , z i t , w i t } i , t = 1 n , T . I then apply the estimator on the simulated data and discuss the estimation results graphically. Further down, confidence intervals are provided and a Monte Carlo simulation studies the estimator’s convergence and finite sample behaviour based on different sample sizes.

4.1 Data Generating Process (DGP)

I extend the DGP presented in Darolles et al. (2011) to the case of endogeneity characterized by correlation between the explanatory variable and both the error term and the unobserved two-way FE. More precisely, I generate a panel dataset setting n = 100 and T = 20, yielding nT = 2000 observations. The unobserved two-way FE are generated by^[18]

(26) ξ i ∼ u ( 0,1.5 ) i = 1 , … , n and

(27) δ t ∼ u ( 0,1.7 ) t = 1 , … , T .

Next, an auxiliary explanatory variable z ̃ i t is generated as a function of the FE, given by

(28) z ̃ i t ∼ N ( ξ i + δ t , 1 ) i = 1 , … , n , t = 1 , … , T .

From these draws, the instrumental variable w _it is generated by

(29) w i t = ρ z w z ̃ i t + v w , i t ,

where ρ _zw = 0.2 denotes the correlation coefficient between the explanatory and the instrumental variable, and v w , i t ∼ N ( 0,0.15 ) denotes and error term. Only now the generation of the endogenous explanatory variable is completed by

(30) z i t = z ̃ i t + v z , i t ,

where v z , i t ∼ N ( 0,0.8 ) . Finally, the dependent variable y _it is obtained by

(31) y i t = φ ( z i t ) + ξ i + δ t + u i t ,

where φ ( z i t ) = z i t 2 refers to as the true DGP and u _it = ρ _uz v _z,it + ϵ _it, with ρ _uz = 0.8 and ϵ i t ∼ N ( 0,0.05 ) .

Table 1 presents the covariance matrix to illustrate the introduced endogeneity. As can be seen, the endogenous explanatory variable z _it is correlated with all other important components, i.e. with the instrument, w _it, with the unobserved individual and temporal effects, ξ _i and δ _t, as well as with the error term, u _it. Note that the instrument is also slightly correlated with the individual/temporal effects but uncorrelated with the error term, which is necessary to serve as a valid instrument.

Table 1:

Covariance between observables and unobservables.

	z _it	w _it	ξ _i	δ _t	u _it
z _it	1.00	0.27	0.19	0.25	0.52
w _it		1.00	0.04	0.05	0.00
ξ _i			1.00	0.00	−0.01
δ _t				1.00	0.01
u _it					1.00

I use the statistical software R to generate and treat the data. Random draws were obtained by specifying set.seed(49).

4.2 Estimation Results

Figure 1 illustrates the regularized solution path to obtain the final estimate. The initial guess estimate, i.e. φ ̂ 0 ( z ) , is indicated by the bottom solid blue line. The iterative estimation procedure then approaches the true conditional mean function φ(z), indicated by the dashed red line. The estimation resulting from the last iteration, indicated by the green line, corresponding to φ ̂ k ̄ ( z ) , is then very close to the true curve, i.e. the DGP. For better visibility, Figure 2 only shows the initial guess (blue line) as well as the last estimate of the iteration (green line).

$Figure 1: The regularized solution path starting from the initial guess φ ̂ 0 ( z ) ${\hat{\varphi }}_{0}\left(z\right)$ until reaching the final IV regression estimate φ ̂ k ̄ ( z ) ${\hat{\varphi }}_{\bar{k}}\left(z\right)$ .$

Figure 1:

The regularized solution path starting from the initial guess φ ̂ 0 ( z ) until reaching the final IV regression estimate φ ̂ k ̄ ( z ) .

$Figure 2: Comparison between the initial guess φ ̂ 0 ( z ) ${\hat{\varphi }}_{0}\left(z\right)$ and the final IV regression estimate. φ ̂ k ̄ ( z ) ${\hat{\varphi }}_{\bar{k}}\left(z\right)$ .$

Figure 2:

Comparison between the initial guess φ ̂ 0 ( z ) and the final IV regression estimate. φ ̂ k ̄ ( z ) .

As mentioned above, the number of iterations is determined by the stopping criterion given in (25), i.e. the iterations continue until there is no significant change in the value of this criterion between two successive iterations. Figure 3 shows the evolution of the value of the stopping criterion throughout the iterations. As illustrated, the criterion’s value decreases from the first iteration on until it flattens out. More precisely, after k ̄ = 44 iterations, the value of the stopping criterion only changes by 1 % or less, which is where the iteration procedure stops.

Figure 3:

Evolution of the function value of the stopping criterion.

Finally, to illustrate possible biases I compare different nonparametric estimators, based on the described DGP. An overview of the compared estimators is presented in Table 2.

Table 2:

Description of the compared nonparametric estimators.

Estimator	Description
LL	Local-linear kernel regression (nonparametric OLS)
LMU	Local-within two-way FE estimator (Lee et al. 2019)
L-F	Landweber–Fridman regularization (IV)
L-F/LMU	Landweber–Fridman with local-within two-way FE (Section 3)

In particular, I compare (i) simple local-linear estimation (LL), where no source of endogeneity is taken into account at all; (ii) the local-within FE estimator presented by Lee et al. (2019) (LMU), that is, only taking into account individual and temporal FE; (iii) the Landweber–Fridman procedure (L-F), i.e. only applying nonparametric IV regression without taking into account FE; and (iv) the here presented estimator, the Landweber–Fridman regularization in combination with the local-within estimator (L-F/LMU), i.e. nonparametric IV and FE regression;

Figure 4 shows the results applying the different estimators. It can be seen that all estimators, except the L-F/LMU estimator, lead to biased estimates, as none of the corresponding estimated conditional mean functions hits the true DGP, indicated by the dashed red line. As expected, the L-F/LMU estimator is closely located around the true curve, which suggests a good finite sample behaviour.

Figure 4:

Comparison between nonparametric estimators.

4.3 Bootstrap Inference and Finite Sample Behaviour

To allow for statistical inference of the presented estimator, I apply the wild residual block bootstrap (described in more detail in Appendix B) to estimate confidence intervals (CI). Here, the α/2 × 100 % and (1 − α/2) × 100 % pointwise percentiles are estimated from the empirical distribution of B = 400 bootstrap estimates φ ̂ 1 ( Z i t ) , … , φ ̂ B ( Z i t ) . For α = 0.05 this yields the pointwise 95 % confidence interval of φ ̂ ( Z i t ) . Figure 5 illustrates that the used bootstrap method provides very narrow confidence intervals suggesting statistically reliable estimates of φ ̂ ( Z i t ) , except at the boundaries where the data is scarce, which is typical for nonparametric kernel estimation. While the employed bootstrap method yields convincing results, providing proof of its validity in this context is not trivial and beyond the scope of this paper.

Figure 5:

Pointwise estimates of the regularized conditional mean function along with the bootstrapped 95 % confidence interval (CI), using 400 replications.

Further, to properly investigate the proposed estimator’s finite sample performance, I conduct a Monte Carlo simulation using different sample sizes. More specifically, building on the Monte Carlo simulation shown by Lee, Mukherjee, and Ullah (2019), I use four sets of samples of n ∈ {50, 100} and T ∈ {10, 20}, each generated from the above described DGP. Then, φ(Z _it) from (19) is estimated based on 400 random samples (for each of the four sets) using the estimators L-F/LMU, L-F, LMU, and LL (see Table 2 for a description of the estimators). Further, for each of the estimators, the integrated mean squared errors (IMSE) and the integrated mean absolute errors (IMAE) are computed by averaging the pointwise MSE and MAE over z, where the RIMSE reports the root of IMSE. Typically, Monte Carlo simulations with nonparametric estimators are computationally time intensive, not only because of pointwise estimation, but also because, ideally, optimal bandwidths should be used for each repetition. Beyond that, the nonparametric instrumental estimators, based on the Landweber–Fridman regularization method, need various iterations themselves to yield the final estimate. The available computational power, however, does not allow me to conduct the Monte Carlo simulation using updated cross-validated optimal bandwidths for each of the repetitions. Instead, for each of the compared estimators and each of the four samples, I compute optimal bandwidths only for the first of the 400 repetitions and which will then be used for the remaining repetitions. While this might introduce some inaccuracy at the boundaries of the data, it does not change the quality of the results. This is because optimal bandwidths are only marginally affected by resampling from the same distributions or more generally from the same DGP. The employed Monte Carlo simulation does therefore allow to conjecture on the estimator’s finite sample behaviour.^[19]

Table 3 presents the results. For all datasets, the proposed L-F/LMU estimator, taking endogeneity in the explanatory variable and FE into account, reaches the lowest values both for the RIMSE and the IMAE, indicating the best estimation performance among the applied nonparametric estimators. Considering the RIMSE, it can be seen that increasing the sample size improves the here proposed L-F/LMU estimator’s performance. Especially when increasing the time dimension from T = 10 to T = 20 leads to a significant improvement. The IMAE shows a similar pattern, where the best performance of the L-F/LMU estimator is shown for the largest dataset with n = 100 and T = 20, suggesting convergence and a good finite sample behaviour of the estimator. Generally, the RIMSE is higher compared to the IMSE because the RIMSE squares the errors before averaging, which puts higher weights on large errors that occur particularly at the boundaries. This effect becomes somewhat amplified when not using up-dated (optimal) bandwidths for each of the 400 random samples. Hence, the IMAE might here be preferably considered to evaluate the estimators’ performance.

Table 3:

Monte Carlo simulation: RIMSE and IMAE comparison.

N	T	RIMSE				IMAE
		L-F/LMU	L-F	LMU	LL	L-F/LMU	L-F	LMU	LL
50	10	1.041	1.664	1.501	1.793	0.350	1.606	0.781	1.675
50	20	0.309	1.660	0.894	1.797	0.123	1.604	0.760	1.678
100	10	0.822	1.670	1.258	1.778	0.250	1.609	0.865	1.660
100	20	0.524	1.663	0.915	1.786	0.116	1.608	0.717	1.668

RIMSE is the Root Integrated Mean Squared Error and IMAE is the Integrated Mean Absolute Error of the estimators. The different estimators are applied to estimate φ(Z _it) from equation (19), repeated 400 times for each dataset. See Table 2 for a description of the compared estimators.

Among the other estimators, the LMU estimator shows the best performance, as it grasps a large part of the unobserved heterogeneity. Instead, the L-F estimator and the local-linear estimator do not show any significant improvement when increasing the sample size. It would be interesting to conduct the same simulation based on different DGP’s to confirm and further study the performance of the estimator.

5 Conclusions

Using instrumental regression while taking into account fixed-effects (FE) is frequently applied in the estimation of market equilibrium models where empirically different sources of endogeneity occur, i.e. from simultaneity by the equilibrium mechanism as well as from correlated individual and temporal specific effects. To offer practitioners the possibility to estimate such models without presuming a specific functional form, this paper presents a nonparametric instrumental estimator for the conditional mean function, while taking into account (additive) two-way FE. The novel estimator can be considered as an extension to the one presented in Fève and Florens (2014) and is appealing for applied research with respect to its flexibility in treating different panel models with endogenous variables, that is, controlling either for individual, temporal, or two-way FE. This is achieved by using the Landweber–Fridman regularization method for the nonparametric IV-part combined with the local-within FE estimator presented by Lee, Mukherjee, and Ullah (2019). The proposed estimator is applied on simulated data, where a Monte Carlo simulation shows good finite sample behaviour. Also, confidence intervals are provided by applying the wild residual block bootstrap.

An important improvement of the presented estimator would be an extension for the estimation of the marginal effect (and higher order derivatives) of the conditional mean function, similar to what is presented in Fève and Florens (2014), Florens, Racine, and Centorrino (2018), and Lee, Mukherjee, and Ullah (2019). It seems likewise interesting to generalize the estimator to a dynamic panel model, where the explanatory variables include a lagged dependent variable, which is an often applied approach in panel data econometrics (see Lee (2014) for the nonparametric estimation of a dynamic model with fixed effects and De Monte and Koebel (2023) for an example of a parametric linear dynamic panel model for the estimation of the product demand function).

Corresponding author: Enrico De Monte, ZEW-Leibniz-Centre for European Economic Research, L7 1, 68161 Mannheim, Germany, E-mail: enrico.demonte@zew.de.

Funding source: Chair Gutenberg

Acknowledgments

I would especially like to thank Bertrand Koebel for his precious support and many helpful discussions throughout the whole writing process of the paper. Further, I wish to thank Yoonseok Lee, Aman Ullah, and Philip Heiler for useful discussions and suggestions. I also thank an unknown referee whose comments substantially improved the paper as well as Phu Nguyen-Van, Felix Kiessner, David Happersberger, and Johannes Gerling for their advice and support. Finally, I would like to acknowledge Chair Gutenberg (research fund of the eastern region of France) that supported the project financially.

Appendix A: Bandwidth Selection via Leave-One-Out Cross-Validation

When using the LMU estimator within the estimation procedure described in Section 3.2, optimal bandwidths are computed applying least-squares leave-one-out cross-validation (LSCV). As described in Section 2.2, the estimate of the conditional mean function φ ̂ ( z ) , is obtained upon a first-step estimate of the first gradient of φ ̂ ( z ) , denoted by β ̂ 1 ( z ) . Henderson et al. (2015) point out that the optimal bandwidth for the estimation of the gradient(s), is not necessarily optimal for the estimation of the conditional mean and vice-versa. Therefore, while in Lee, Mukherjee, and Ullah (2019) a gradient based cross-validation procedure is described, I here propose to use a conditional mean based procedure. More precisely, in the case where individual effect are controlled for, the optimal bandwidth for estimating the conditional mean function is obtained by minimizing the LSCV criterion given by

L S C V ( h ) = arg min h 1 n T ∑ i = 1 n ∑ t = 1 T Y i t − φ ̂ − i ( Z i t ; h ) 2 ,

where φ ̂ − i is the leave-one-out estimate obtained based on the locally demeaned variables

Y − i , t * ( Z i t ) = Y i t − ∑ j ≠ i C h ( Z j t − z i t ) Y j t ∑ j ≠ i C h ( X j t − x i t ) , Z − i , t * ( z i t ) = Z i t − ∑ j ≠ i C h ( Z j t − z i t ) Z j t ∑ j ≠ i C h ( Z j t − z i t ) .

Analogously, in the case where φ ̂ is aimed to be estimated in the framework of two-way FE, Y − i , t * and Z i t * simply need to be replaced by the corresponding expressions for Y − i , t * * and Z − i , t * * defined in equation (15).

Appendix B: The Wild Bootstrap

Similar to Malikov, Zhao, and Kumbhakar (2020) and Azomahou, Laisney, and Nguyen Van (2006) I use the wild residual block bootstrap to approximate sampling distributions of the presented nonparametric instrumental estimator with two-way FE. The method takes the panel structure into account and is also appropriate to account for heteroskedasticity and correlation between observations (Azomahou, Laisney, and Nguyen Van 2006; Henderson and Parmeter 2015). Consider the model

Y i t = φ ( Z i t ) + V i t ,

where V _it = ξ _i + δ _t + U _it. The wild bootstrap algorithm is given by the following:

Compute the residuals by v ̂ i t = y i t − φ ̂ ( z i t ) , where φ ̂ ( z i t ) is estimated according to Section 3 (small letters denote realizations of the corresponding random variables in capital letters).
Recenter the residuals by v ̂ i t c = v ̂ i t − v ̂ ̄ i t , where v ̂ ̄ i t is the mean over all (i, t) residuals.
Generate the bootstrap residuals for each observation (i, t) by v i t b = b i v ̂ i t c , where b _i are the bootstrap weights for i = 1, …, n drawn from the two-points mass distribution, given by
b i = 1 + 5 2 , with probability of 5 − 1 2 5 1 − 5 2 , with probability of 5 + 1 2 5 .
Generate the new dependent variable by y i t b = φ ̂ ( z i t ) + v i t b .
Re-estimate φ(z _it) according to Section 3 based on the bootstrap sample y i t b , z i t , w i t i , t = 1 n , T , where w _it is the instrument.
Repeat the steps 3. to 5. B -times yielding the empirical distribution of B bootstrap estimates, φ 1 ( z i t ) , … , φ B ( z i t ) , from which the α/2 × 100 % and (1 − α/2) × 100 % pointwise percentiles, i.e. the confidence intervals, are estimated. For this application I specify B = 400 and α = 0.05.

Remark: To save computational time, I only estimate optimal bandwidths once (those used within the Landweber–Fridman regularization procedure for estimation of the conditional mean function φ(Z _it)) and use the same bandwidths over the B bootstrap repetitions. As optimal bandwidths do only change marginally by resampling in the wild bootstrap procedure, results are, if at all, only little affected by that (see also footnote 19 on a similar issue).

References

Amir, R., and V. E. Lambson. 2003. “Entry, Exit, and Imperfect Competition in the Long Run.” Journal of Economic Theory 110 (1): 191–203. https://doi.org/10.1016/s0022-0531(03)00002-4.Search in Google Scholar

Azomahou, T., F. Laisney, and P. Nguyen Van. 2006. “Economic Development and CO2 Emissions: A Nonparametric Panel Approach.” Journal of Public Economics 90 (6–7): 1347–63. https://doi.org/10.1016/j.jpubeco.2005.09.005.Search in Google Scholar

Blundell, R., X. Chen, and D. Kristensen. 2007. “Semi-nonparametric Iv Estimation of Shape-Invariant Engel Curves.” Econometrica 75 (6): 1613–69. https://doi.org/10.1111/j.1468-0262.2007.00808.x.Search in Google Scholar

Carrasco, M., J.-P. Florens, and E. Renault. 2007. “Linear Inverse Problems in Structural Econometrics Estimation Based on Spectral Decomposition and Regularization.” Handbook of Econometrics 6: 5633–751.10.1016/S1573-4412(07)06077-1Search in Google Scholar

Centorrino, S., F. Feve, and J.-P. Florens. 2017. “Additive Nonparametric Instrumental Regressions: A Guide to Implementation.” Journal of Econometric Methods 6 (1): 20150010, https://doi.org/10.1515/jem-2015-0010.Search in Google Scholar

Chetverikov, D., D. Kim, and D. Wilhelm. 2018. “Nonparametric Instrumental-Variable Estimation.” STATA Journal 18 (4): 937–50. https://doi.org/10.1177/1536867x1801800411.Search in Google Scholar

Darolles, S., Y. Fan, J. P. Florens, and E. Renault. 2011. “Nonparametric Instrumental Regression.” Econometrica 79 (5): 1541–65.10.3982/ECTA6539Search in Google Scholar

De Monte, E., and B. Koebel. 2023. Cournot Equilibrium and Welfare with Heterogeneous Firms. mimeo, Université de Strasbourg, BETA.Search in Google Scholar

D’Haultfoeuille, X. 2011. “On the Completeness Condition in Nonparametric Instrumental Problems.” Econometric Theory 27 (3): 460–71. https://doi.org/10.1017/s0266466610000368.Search in Google Scholar

Esponda, I., and D. Pouzo. 2019. “The Industry Supply Function and the Long-Run Competitive Equilibrium with Heterogeneous Firms.” Journal of Economic Theory 184: 104946. https://doi.org/10.1016/j.jet.2019.104946.Search in Google Scholar

Fève, F., and J. P. Florens. 2014. “Non Parametric Analysis of Panel Data Models with Endogenous Variables.” Journal of Econometrics 181 (2): 151–64. https://doi.org/10.1016/j.jeconom.2014.03.009.Search in Google Scholar

Florens, J. 2003. “Inverse Problems and Structural Econometrics: The Example of Instrumental Variables.” In Advances in Economics and Econometrics: Theory and Applications – Eight World Congress, 284–311. Cambridge University Press.10.1017/CCOL0521818737.009Search in Google Scholar

Florens, J.-P., J. Johannes, and S. Van Bellegem. 2011. “Identification and Estimation by Penalization in Nonparametric Instrumental Regression.” Econometric Theory 27 (3): 472–96. https://doi.org/10.1017/s026646661000037x.Search in Google Scholar

Florens, J.-P., J. Johannes, and S. Van Bellegem. 2012. “Instrumental Regression in Partially Linear Models.” The Econometrics Journal 15 (2): 304–24. https://doi.org/10.1111/j.1368-423x.2011.00358.x.Search in Google Scholar

Florens, J. P., J. S. Racine, and S. Centorrino. 2018. “Nonparametric Instrumental Variable Derivative Estimation.” Journal of Nonparametric Statistics 30 (2): 368–91. https://doi.org/10.1080/10485252.2018.1428745.Search in Google Scholar

Fredholm, E.I. 1903. “Sur Une Classe D’équations Fonctionnelles.” Acta Mathematica 27: 365–90, https://doi.org/10.1007/bf02421317.Search in Google Scholar

Freyberger, J. 2017. “On Completeness and Consistency in Nonparametric Instrumental Variable Models.” Econometrica 85 (5): 1629–44. https://doi.org/10.3982/ecta13304.Search in Google Scholar

Fridman, V. 1965. “A Method of Successive Approximations for Fredholm Integral Equations of the First Kind.” Uspeki mat Nauk 11: 233–4.Search in Google Scholar

Hall, P., and J. L. Horowitz. 2005. “Nonparametric Methods for Inference in the Presence of Instrumental Variables.” Annals of Statistics 33 (6): 2904–29. https://doi.org/10.1214/009053605000000714.Search in Google Scholar

Hayfield, T., and J. S. Racine. 2008. “Nonparametric Econometrics: The Np Package.” Journal of Statistical Software 27 (5): 1–32. https://doi.org/10.18637/jss.v027.i05.Search in Google Scholar

Henderson, D. J., and C. F. Parmeter. 2015. Applied Nonparametric Econometrics. Cambridge University Press.10.1017/CBO9780511845765Search in Google Scholar

Henderson, D. J., R. J. Carroll, and Q. Li. 2008. “Nonparametric Estimation and Testing of Fixed Effects Panel Data Models.” Journal of Econometrics 144 (1): 257–75. https://doi.org/10.1016/j.jeconom.2008.01.005.Search in Google Scholar PubMed PubMed Central

Henderson, D. J., Q. Li, C. F. Parmeter, and S. Yao. 2015. “Gradient-based Smoothing Parameter Selection for Nonparametric Regression Estimation.” Journal of Econometrics 184 (2): 233–41. https://doi.org/10.1016/j.jeconom.2014.09.007.Search in Google Scholar

Hopenhayn, H. A. 1992. “Entry, Exit, and Firm Dynamics in Long Run Equilibrium.” Econometrica 60: 1127–50, https://doi.org/10.2307/2951541.Search in Google Scholar

Horowitz, J. L. 2001. “The Bootstrap.” Handbook of Econometrics 5: 3159–228.10.1016/S1573-4412(01)05005-XSearch in Google Scholar

Horowitz, J. L. 2011. “Applied Nonparametric Instrumental Variables Estimation.” Econometrica 79 (2): 347–94.10.3982/ECTA8662Search in Google Scholar

Horowitz, J. L. 2014. “Ill-posed Inverse Problems in Economics.” Annual Review of Economics 6 (1): 21–51. https://doi.org/10.1146/annurev-economics-080213-041213.Search in Google Scholar

Johannes, J., S. Van Bellegem, and A. Vanhems. 2011. “Convergence Rates for Ill-Posed Inverse Problems with an Unknown Operator.” Econometric Theory 27 (3): 522–45. https://doi.org/10.1017/s0266466610000393.Search in Google Scholar

Johannes, J., S. Van Bellegem, and A. Vanhems. 2013. “Iterative Regularisation in Nonparametric Instrumental Regression.” Journal of Statistical Planning and Inference 143 (1): 24–39. https://doi.org/10.1016/j.jspi.2012.07.010.Search in Google Scholar

Koebel, B., and F. Laisney. 2016. “Aggregation with Cournot Competition: An Empirical Investigation.” Annals of Economics and Statistics (121/122): 91–119. https://doi.org/10.15609/annaeconstat2009.121-122.91.Search in Google Scholar

Lancaster, T. 2000. “The Incidental Parameter Problem since 1948.” Journal of Econometrics 95 (2): 391–413. https://doi.org/10.1016/s0304-4076(99)00044-5.Search in Google Scholar

Landweber, L. 1951. “An Iteration Formula for Fredholm Integral Equations of the First Kind.” American Journal of Mathematics 73 (3): 615–24. https://doi.org/10.2307/2372313.Search in Google Scholar

Lee, Y. 2014. “Nonparametric Estimation of Dynamic Panel Models with Fixed Effects.” Econometric Theory 30 (6): 1315–47. https://doi.org/10.1017/s0266466614000188.Search in Google Scholar

Lee, Y., D. Mukherjee, and A. Ullah. 2019. “Nonparametric Estimation of the Marginal Effect in Fixed-Effect Panel Data Models.” Journal of Multivariate Analysis 171: 53–67. https://doi.org/10.1016/j.jmva.2018.11.013.Search in Google Scholar

Lewbel, A. 2012. “Using Heteroscedasticity to Identify and Estimate Mismeasured and Endogenous Regressor Models.” Journal of Business & Economic Statistics 30 (1): 67–80. https://doi.org/10.1080/07350015.2012.643126.Search in Google Scholar

Linton, O., and J. P. Nielsen. 1995. “A Kernel Method of Estimating Structured Nonparametric Regression Based on Marginal Integration.” Biometrika 82: 93–100, https://doi.org/10.1093/biomet/82.1.93.Search in Google Scholar

Malikov, E., S. Zhao, and S. C. Kumbhakar. 2020. “Estimation of Firm-Level Productivity in the Presence of Exports: Evidence from China’s Manufacturing.” Journal of Applied Econometrics 35 (4): 457–80. https://doi.org/10.1002/jae.2757.Search in Google Scholar

Mammen, E. 1993. “Bootstrap and Wild Bootstrap for High Dimensional Linear Models.” Annals of Statistics 21 (1): 255–85. https://doi.org/10.1214/aos/1176349025.Search in Google Scholar

Mroz, T. A. 1987. “The Sensitivity of an Empirical Model of Married Women’s Hours of Work to Economic and Statistical Assumptions.” Econometrica 55 (4): 765–99. https://doi.org/10.2307/1911029.Search in Google Scholar

Newey, W. K., and J. L. Powell. 2003. “Instrumental Variable Estimation of Nonparametric Models.” Econometrica 71 (5): 1565–78. https://doi.org/10.1111/1468-0262.00459.Search in Google Scholar

Neyman, J., and E. L. Scott. 1948. “Consistent Estimates Based on Partially Consistent Observations.” Econometrica 16: 1–32, https://doi.org/10.2307/1914288.Search in Google Scholar

Parmeter, C. F., and J. S. Racine. 2019. “Nonparametric Estimation and Inference for Panel Data Models.” Panel Data Econometrics: 97–129. https://doi.org/10.1016/b978-0-12-814367-4.00004-6.Search in Google Scholar

Qian, J., and L. Wang. 2012. “Estimating Semiparametric Panel Data Models by Marginal Integration.” Journal of Econometrics 167: 483–93. https://doi.org/10.1016/j.jeconom.2011.09.030.Search in Google Scholar

Racine, J. S. 2019. An Introduction to the Advanced Theory of Nonparametric Econometrics: A Replicable Approach Using R. Cambridge University Press.10.1017/9781108649841Search in Google Scholar

Racine, J. S., and T. Hayfield. 2023. Package ‘np’. https://cran.r-project.org/web/packages/np/np.pdf: CRAN.Search in Google Scholar

Rodriguez-Poo, J. M., and A. Soberon. 2017. “Nonparametric and Semiparametric Panel Data Models: Recent Developments.” Journal of Economic Surveys 31 (4): 923–60. https://doi.org/10.1111/joes.12177.Search in Google Scholar

Su, L., and A. Ullah. 2006. “Profile Likelihood Estimation of Partially Linear Panel Data Models with Fixed Effects.” Economics Letters 92 (1): 75–81. https://doi.org/10.1016/j.econlet.2006.01.019.Search in Google Scholar

Tikhonov, A. N., and V. Y. Arsenin. 1977. Solutions of Ill-Posed Problems, 1–30. New York: Winston.Search in Google Scholar

Wooldridge, J. M. 2016. Introductory Econometrics: A Modern Approach. Nelson Education.Search in Google Scholar

Ziliak, J. P. 1997. “Efficient Estimation with Panel Data when Instruments Are Predetermined: An Empirical Comparison of Moment-Condition Estimators.” Journal of Business & Economic Statistics 15 (4): 419–31. https://doi.org/10.2307/1392488.Search in Google Scholar

Received: 2022-07-15

Accepted: 2023-09-15

Published Online: 2023-10-05

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

https://doi.org/10.1515/jem-2022-0025

Keywords for this article

endogeneity; panel data; unobserved heterogeneity; regularization; kernel regression

Creative Commons

BY 4.0