Estimation and testing of the factor-augmented panel regression models with missing data

Difa Xiao; Lu Wang; Jianhong Wu

doi:10.1515/snde-2022-0042

Enjoy 40% off

academic books on De Gruyter Brill *

Article

Estimation and testing of the factor-augmented panel regression models with missing data

Difa Xiao , Lu Wang and Jianhong Wu

Published/Copyright: March 2, 2023

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal Studies in Nonlinear Dynamics & Econometrics Volume 28 Issue 4

Abstract

This paper focuses on the factor-augmented panel regression models with missing data and individual-varying factors. A so-called CCEM estimator for the slope coefficient is proposed and its asymptotic properties are investigated under some regularity conditions. Furthermore, a joint test statistic is constructed for serial correlation and heteroscedasticity in the idiosyncratic errors. Under the null hypothesis, the test statistic can be shown to be asymptotically chi-square distributed. Monte Carlo simulation results show that the proposed estimator and test statistic have desired performance in finite samples.

Keywords: estimation; factor-augmented regression models; individual-varying factors; missing data; test

JEL Classification: C12; C13; C33

Corresponding author: Jianhong Wu, Shanghai Normal University, Shanghai 200234, China, E-mail: wujianhong@shnu.edu.cn

Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.
Research funding: This paper is supported in part by the National Nature Science Foundation of China (Grant No. 72173086).
Conflict of interest statement: The authors declare no conflicts of interest regarding this article.

Appendix

The appendix contains the proofs of Theorems 1–3.

A.1 Proof of Theorem 1

As a step of the proof of Theorem 1, an asymptotic expansion of N ( β ̂ CCEM − β ) will be firstly derived. Moreover, since the proof of Theorem 1 is similar to that of Theorem 1 in Westerlund et al. (2019), only some important details are presented here.

A.1.1 Step1: deriving the asymptotic expansion of N ( β ̂ CCEM − β )

This expansion depends on whether m < k + 1 or m = k + 1. Suppose first that m < k + 1. We introduce a full rank matrix H _j = [H _m,j, H _−m,j] and a normalization matrix D N j = diag ( I m , N j I k + 1 − m ) , where H _j and D N j are both (k + 1) × (k + 1). Also, H _m,j and H _−m,j are (k + 1) × m and (k + 1) × (k + 1 − m) matrices, respectively. H m , j = Q ′ C ̄ j ′ C ̄ j Q Q ′ C ̄ j ′ − 1 and H _−m,j = G _j B _−m,j where G _j and B _−m,j are the same as G and B _−m in Westerlund et al. (2019). The useful properties of H _j and D N j are respectively C ̄ j Q H j = I m , 0 m × ( k + 1 − m ) and U ̄ j Q H j D N j = [ U ̄ j Q H m , j , N j U ̄ j Q H − m , j ] = U ̄ m , j 0 , U ̄ − m , j 0 = U ̄ j 0 , where U ̄ m , j 0 = O p N j − 1 / 2 and U ̄ − m , j 0 = O p ( 1 ) . We have

Z ̄ j 0 = Z ̄ j H j D N j = F j , 0 T j × ( k + 1 − m ) + U ̄ m , j 0 , U ̄ − m , j 0 = F j , U ̄ − m , j 0 + O p N j − 1 / 2 , Y i = X j i β + Z ̄ j H m , j γ j i − ( Z ̄ j − F j C ̄ j Q ) H m , j γ j i + ε j i = X j i β + Z ̄ j H m , j γ j i − U ̄ m , j 0 γ j i + ε j i .

Moreover, since M Z ̄ j = M Z ̄ j 0 , we have

N ( β ̂ CCEM − β ) = 1 N ∑ j = 1 n ∑ i = 1 N j X j i ′ M Z ̄ j 0 X j i − 1 1 N ∑ j = 1 n ∑ i = 1 N j X j i ′ M Z ̄ j 0 ε j i − U ̄ m , j 0 γ j i .

Consider the numerator,

1 N ∑ j = 1 n ∑ i = 1 N j X j i ′ M Z ̄ j 0 ε j i − U ̄ m , j 0 γ j i = ∑ j = 1 n N j N 1 N j ∑ i = 1 N j X j i ′ M Z ̄ j 0 ε j i − U ̄ m , j 0 γ j i .

In analogy to (A.46) in Westerlund et al. (2019), we have

1 N j ∑ i = 1 N j X j i ′ M Z ̄ j 0 ε j i − U ̄ m , j 0 γ j i = 1 N j ∑ i = 1 N j V j i ′ ( M F j − P − m , j ) ε j i + O p N j − 1 / 2 ,

where P − m , j = P M F j U ̄ − m , j 0 as m < k + 1, and P − m , j = 0 T j × T j as m = k + 1 (see also, Westerlund et al. 2019). Hence, the numerator of N ( β ̂ CCEM − β ) can be written as

(A.1) 1 N ∑ j = 1 n ∑ i = 1 N j X j i ′ M Z ̄ j 0 ε j i − U ̄ m , j 0 γ j i = ∑ j = 1 n N j N 1 N j ∑ i = 1 N j V j i ′ ( M F j − P − m , j ) ε j i + O p N j − 1 / 2 = 1 N ∑ j = 1 n ∑ i = 1 N j V j i ′ ( M F j − P − m , j ) ε j i + O p ( N − 1 / 2 ) .

Next, consider the denominator of N ( β ̂ CCEM − β ) . By analogy to Lemma A.2 in Westerlund et al. (2019), we can write

(A.2) 1 N ∑ j = 1 n ∑ i = 1 N j X j i ′ M Z ̄ j 0 X j i = ∑ j = 1 n N j N 1 N j ∑ i = 1 N j V j i ′ ( M F j − P − m , j ) V j i + O p N j − 1 / 2 = 1 N ∑ j = 1 n ∑ i = 1 N j V j i ′ ( M F j − P − m , j ) V j i + O p ( N − 1 / 2 ) = Σ N + O p ( N − 1 / 2 ) .

According to (A.1 & A.2), we can obtain the asymptotic expansion of N ( β ̂ CCEM − β ) as follows,

(A.3) N ( β ̂ CCEM − β ) = Σ N − 1 1 N ∑ j = 1 n ∑ i = 1 N j V j i ′ ( M F j − P − m , j ) ε j i + O p ( N − 1 / 2 ) .

A.1.2 Step 2: proof of Theorem 1

Now, we try to derive the asymptotic distribution of N ( β ̂ CCEM − β ) , and first consider the case with m = k + 1. In analogy to Westerlund et al. (2019), we let ξ j i = V j i ′ ( M F j − P − m , j ) ε j i = V j i ′ M F j ε j i , ζ F j being the σ-field generated by F _j, and F j i being the corresponding σ-field generated by ζ F j and ( ξ j 1 , … , ξ j i ) . Then { ( ξ j i , F j i ) : i ≥ 1 } is a martingale difference sequence, and ξ j i is independent across j _i and E ( ξ j i | F j i − 1 ) = E ( ξ j i | ζ F j ) = 0 k × 1 . Hence, the outer sum has a mixed normal distribution (see also, Westerlund et al. 2019), which is given by

(A.4) 1 N ∑ j = 1 n ∑ i = 1 N j ξ j i → d M N ( 0 k × 1 , S ) ,

where

S = lim N → ∞ 1 N ∑ j = 1 n ∑ i = 1 N j E ξ j i ξ j i ′ | F j i − 1 = lim N → ∞ 1 N ∑ j = 1 n ∑ i = 1 N j E V j i ′ M F j Σ ε , j i M F j V j i | ζ F j = ∑ j = 1 n w j lim N j → ∞ 1 N j ∑ i = 1 N j E V j i ′ M F j Σ ε , j i M F j V j i | ζ F j = ∑ j = 1 n w j S j ,

with Σ ε , j i = E ε j i ε j i ′ . By the law of large numbers of Hall and Heyde (1980), we can obtain

Σ N = 1 N ∑ j = 1 n ∑ i = 1 N j V j i ′ M F j V j i → p ∑ j = 1 n w j lim N j → ∞ 1 N j ∑ i = 1 N j E V j i ′ M F j V j i | ζ F j = ∑ j = 1 n w j Σ j = Σ .

Hence, as N → ∞, we can also show that

(A.5) N ( β ̂ CCEM − β ) = Σ N − 1 1 N ∑ j = 1 n ∑ i = 1 N j V j i ′ M F j ε j i + O p ( N − 1 / 2 ) → d M N 0 k × 1 , Σ − 1 S Σ − 1 .

Next, consider the case with m < k + 1. It follows that

1 N ∑ j = 1 n ∑ i = 1 N j V j i ′ ( M F j − P − m , j ) ε j i = 1 N ∑ j = 1 n ∑ i = 1 N j ∑ t = 1 T j ∑ s = 1 T j v j i , t ( M F j , t , s − P − m , j , t , s ) ε j i , s = ∑ j = 1 n ∑ t = 1 T j ∑ s = 1 T j N j N ( M F j , t , s − P − m , j , t , s ) N j z ̄ j , t , s ,

where z ̄ j , t , s = N j − 1 ∑ i = 1 N j ε j i , s v j i , t , M F j , t , s and p_−m,j,t,s are the (t, s)-th element of M F j and p_−m,j respectively. In analogy to (A.56 & A.61) in Westerlund et al. (2019), we have

N j u ̄ j , t → d n u j , t = d N ( 0 ( k + 1 ) × 1 , Σ u , j , t ) , N j z ̄ j , t , s → d n z j , t , s = d N ( 0 k × 1 , Σ z , j , t , s ) .

Hereafter, “ = d ” means equality in distribution, Σ u , j , t = lim N j → ∞ N j − 1 ∑ i = 1 N j ( Σ u , j i , t ) and Σ z , j , t , s = lim N j → ∞ N j − 1 ∑ i = 1 N j σ ε , j i , s 2 Σ v , j i , t . By using u ̄ − m , j , t 0 to signify the tth row of U ̄ − m , j 0 , we have

u ̄ − m , j , t 0 = H − m , j ′ Q ′ N j u ̄ j , t → d H − m , j ′ Q ′ n u j , t .

Then, we obtain

P − m , j , t , s = ∑ l = 1 T j M F j , t , l u ̄ − m , j , l 0 ′ U ̄ − m , j 0 ′ M F j U ̄ − m , j 0 − 1 ∑ r = 1 T j u ̄ − m , j , r 0 M F j , s , r → d ∑ l = 1 T j M F j , t , l n u j , l ′ Q H − m , j H − m , j ′ Q ′ ∑ l = 1 T j ∑ r = 1 T j M F j , l , r n u j , l n u j , r ′ Q H − m , j − 1 × ∑ r = 1 T j M F j , s , r H − m , j ′ Q ′ n u j , r = p − m , j , t , s ,

as N _j → ∞, which implies

1 N ∑ j = 1 n ∑ i = 1 N j V j i ′ ( M F j − P − m , j ) ε j i = ∑ j = 1 n ∑ t = 1 T j ∑ s = 1 T j N j N ( M F j , t , s − p − m , j , t , s ) N j z ̄ j , t , s → d ∑ j = 1 n ∑ t = 1 T j ∑ s = 1 T j w j ( M F j , t , s − p − m , j , t , s ) n z j , t , s .

According to (A.64) in Westerlund et al. (2019), we have E u j i , t z l r , n , s ′ = 0 ( k + 1 ) × k for all i, j, l, r, t, n and s. It means that n z j , n , s and M F j , t , s − p − m , j , t , s are independent of each other. Hence,

∑ j = 1 n ∑ t = 1 T j ∑ s = 1 T j w j ( M F j , t , s − p − m , j , t , s ) n z j , t , s = d M N 0 k × 1 , S ,

where S = ∑ j = 1 n w j S j , and

S j = ∑ t = 1 T j ∑ s = 1 T j ∑ l = 1 T j ∑ r = 1 T j ( M F j , t , s − p − m , j , t , s ) ( M F j , l , r − p − m , j , l , r ) Σ z j , t , l , s , r ,

with Σ z j , t , l , s , r = lim N j → ∞ N j − 1 ∑ i = 1 N j ( σ ε , j i , s , r Σ v , j i , t , l ) , σ ε , j i , s , r = E ( ε j i , s ε j i , r ) and Σ v , j i , t , l = E v j i , t v j i , l ′ . For the denominator, we have

Σ N = ∑ j = 1 n ∑ t = 1 T j ∑ s = 1 T j N j N ( M F j , t , s − P − m , j , t , s ) 1 N j ∑ i = 1 N j v j i , t v j i , s ′ → p ∑ j = 1 n ∑ t = 1 T j ∑ s = 1 T j w j ( M F j , t , s − p − m , j , t , s ) Σ v , j i , t , s = ∑ j = 1 n w j Σ j = Σ .

Hence, as N → ∞, we can show that

N ( β ̂ CCEM − β ) → d M N 0 k × 1 , Σ − 1 S Σ − 1 .

The proof of Theorem 1 is then complete. □

A.2 Proof of Theorems 2 and 3

The proofs of Theorems 2–3 are respectively similar to those of Westerlund et al. (2019) and Wu (2020), and then omitted to save the space.

References

Ahn, S. C., Y. H. Lee, and P. Schmidt. 2001. “GMM Estimation of Linear Panel Data Models with Time-Varying Individual Effects.” Journal of Econometrics 101: 219–55. https://doi.org/10.1016/s0304-4076(00)00083-x.Search in Google Scholar

Ahn, S. C., Y. H. Lee, and P. Schmidt. 2013. “Panel Data Models with Multiple Time-Varying Individual Effects.” Journal of Econometrics 174: 1–14. https://doi.org/10.1016/j.jeconom.2012.12.002.Search in Google Scholar

Bai, J. 2009. “Panel Data Models with Interactive Fixed Effects.” Econometrica 77: 1229–79.10.3982/ECTA6135Search in Google Scholar

Bai, J., Y. Liao, and J. Yang. 2015. “Unbalanced Panel Data Models With Interactive Effects.” In Prepared for Oxford Handbook of Panel Data Econometrics. Wellington Square, Oxford: Oxford University Press.10.1093/oxfordhb/9780199940042.013.0005Search in Google Scholar

Baltagi, B. H. 1985. “Pooling Cross-Sections with Unequal Time-Series Lengths.” Economics Letters 18: 133–6. https://doi.org/10.1016/0165-1765(85)90167-3.Search in Google Scholar

Baltagi, B. H., Y. Chang, and Q. Li. 1999. Testing for Random Individual and Time Effects Using Unbalanced Panel Data Messy Data, Vol. 13, 1–20. Bingley: Emerald Group Publishing Limited.10.1108/S0731-9053(1999)0000013003Search in Google Scholar

Baltagi, B. H., and S. H. Song. 2006. “Unbalanced Panel Data: A Survey.” Statistical Papers 47: 493–523. https://doi.org/10.1007/s00362-006-0304-0.Search in Google Scholar

Baltagi, B. H., S. H. Song, and B. C. Jung. 2002. “A Comparative Study of Alternative Estimators for the Unbalanced Two-Way Error Component Regression Model.” The Econometrics Journal 5: 480–93. https://doi.org/10.1111/1368-423x.t01-1-00094.Search in Google Scholar

Chudik, A., and M. H. Pesaran. 2015. “Common Correlated Effects Estimation of Heterogeneous Dynamic Panel Data Models with Weakly Exogenous Regressors.” Journal of Econometrics 188: 393–420. https://doi.org/10.1016/j.jeconom.2015.03.007.Search in Google Scholar

Chen, J., R. Yue, and J. Wu. 2018. “Hausman-type Tests for Individual and Time Effects in the Panel Regression Model with Incomplete Data.” Journal of the Korean Surgical Society 47: 347–63. https://doi.org/10.1016/j.jkss.2018.04.002.Search in Google Scholar

Hall, P., and C. Heyde. 1980. Martingale Limit Theory and its Application. New York: Academic Press.Search in Google Scholar

Oya, K.. 2004. “Test of Random Effects with Incomplete Panel Data.” Mathematics and Computers in Simulation 64: 409–19. https://doi.org/10.1016/s0378-4754(03)00107-1.Search in Google Scholar

Pesaran, M. H. 2006. “Estimation and inference in large heterogeneous panels with a multifactor error structure.” Econometrica 74 (4): 967–1012.10.1111/j.1468-0262.2006.00692.xSearch in Google Scholar

Pesaran, M. H., and E. Tosetti. 2011. “Large Panels with Common Factors and Spatial Correlation.” Journal of Econometrics 161: 182–202. https://doi.org/10.1016/j.jeconom.2010.12.003.Search in Google Scholar

Robertson, D., and V. Sarafidis. 2015. “IV Estimation of Panels with Factor Residuals.” Journal of Econometrics 185: 526–41. https://doi.org/10.1016/j.jeconom.2014.12.001.Search in Google Scholar

Westerlund, J., Y. Petrova, and M. Norkute. 2019. “CCE in Fixed-T Panels.” Journal of Applied Econometrics 34: 746–61. https://doi.org/10.1002/jae.2707.Search in Google Scholar

Wu, J.. 2020. “A Joint Test for Serial Correlation and Heteroscedasticity in Fixed-T Panel Regression Models with Interactive Effects.” Economics Letters 197: 109594. https://doi.org/10.1016/j.econlet.2020.109594.Search in Google Scholar

Wu, J., J. Qin, and Q. Ding. 2015. “A Moment-Based Test for Individual Effects in the Error Component Model with Incomplete Panels.” Statistics & Probability Letters 104: 153–62. https://doi.org/10.1016/j.spl.2015.05.013.Search in Google Scholar

Yue, L., G. Li, and J. Zhang. 2017. “Statistical Inference for the Unbalanced Two-Way Error Component Regression Model with Errors-In-Variables.” Journal of the Korean Surgical Society 46: 593–607. https://doi.org/10.1016/j.jkss.2017.06.001.Search in Google Scholar

Zhou, Q., and Y. Zhang. 2016. “Common Correlated Effects Estimation of Unbalanced Panel Data Models with Cross-Sectional Dependence.” Journal of Economic Theory and Econometrics 27: 25–45.Search in Google Scholar

Received: 2022-04-27

Accepted: 2023-02-07

Published Online: 2023-03-02

You are currently not able to access this content.

Articles in the same Issue

https://doi.org/10.1515/snde-2022-0042

Keywords for this article

estimation; factor-augmented regression models; individual-varying factors; missing data; test