Home On determination of the number of factors in an approximate factor model
Article
Licensed
Unlicensed Requires Authentication

On determination of the number of factors in an approximate factor model

  • Jinshan Liu , Jiazhu Pan , Qiang Xia EMAIL logo and Li Xiao
Published/Copyright: October 3, 2022

Abstract

This paper proposes a ridge-type method for determining the number of factors in an approximate factor model. The new estimator of factor number is obtained by maximizing both the ratio of two adjacent eigenvalues and the cumulative contribution rate of the factors which represents the explanatory power of the common factors for response variables. Our estimator is proved to be as asymptotically consistent as those in (Ahn, S., and A. Horenstein. 2013. “Eigenvalue Ratio Test for the Number of Factors.” Econometrica 81: 1203–27). But Monte Carlo simulation experiments show our method has better correct selection rates in finite sample cases. A real data example is given for illustration.

JEL Classification: C1; C3

Corresponding author: Qiang Xia, College of Mathematics and Informatics, South China Agricultural University, Guangzhou 510642, China, E-mail:

Funding source: the National Natural Science Foundation of China

Award Identifier / Grant number: 12171161, 91746102

Funding source: Natural Science Foundation of Guangdong Province

Award Identifier / Grant number: 2022A1515011754

Funding source: East China Normal University

Award Identifier / Grant number: Unassigned

Funding source: Ministry of Education in China Project of Humanities and Social Sciences

Award Identifier / Grant number: 17YJA910002

Acknowledgments

We thank the Editor and the Referee(s) for their insightful comments and suggestions that help us significantly to improve our manuscript.

  1. Author contribution: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.

  2. Research funding: This research was supported by the National Natural Science Foundation of China (No. 12171161, 91746102), the Natural Science Foundation of Guangdong Province of China (No. 2022A1515011754), and Ministry of Education in China Project of Humanities and Social Sciences (No. 17YJA910002), the Open Research Fund of Key Laboratory of Advanced Theory and Application in Statistics and Data Science (East China Normal University), Ministry of Education.

  3. Conflict of interest statement: The authors declare no conflicts of interest regarding this article.

Appendix A

The proofs of our theorems will need the following lemmas.

Lemma 1

Under Assumptions A–D, for j = 1, …, r,

μ ̃ N T , j = μ N T , j + O p ( N 1 / 2 ) + O p ( m 1 ) .

Lemma 2

Under Assumptions A, C and D, for j = 1, …, [d c m] − 2r, where d c ∈ (0, 1], we have that

c ̲ + o p ( 1 ) m μ ̃ N T , r + j c ̄ + o p ( 1 ) ,

where c ̲ = c 2 2 y * * ( 1 b y * ) 2 , y * * = lim m ( N / M ) and c ̄ = c 1 2 ( 1 + y ) 2 .

Lemma 3

Under Assumptions A–D, V(r + 1) ≍ O p (1).

For the proofs of Lemma 1 to Lemma 3, see Lemma A.11, Lemma A.9 and Lemma A.12 in Ahn and Horenstein (2013), respectively.

Proof of Theorem 1

It follows from Lemma A.11 and Lemma A.9 of Ahn and Horenstein (2013) that μ ̃ N T , j = μ N T , j + O p ( N 1 / 2 ) + O p ( m 1 ) O p ( 1 ) , for j = 1, …, r and μ ̃ N T , j O p ( 1 / m ) , for j = r + 1, …, [d c m] − r. By Lemma A.12 of Ahn and Horenstein (2013), V(r + 1) ≍ O p (1). Therefore, for j = 1, …, r − 1,

μ ̃ N T , j μ ̃ N T , j + 1 1 v + V ( j ) μ ̃ N T , j μ ̃ N T , j + 1 1 v + V ( r 1 ) = μ ̃ N T , j μ ̃ N T , j + 1 1 v + μ ̃ N T , r + μ ̃ N T , r + 1 + V ( r + 1 ) = O P ( 1 ) O P ( 1 ) = O P ( 1 ) .

For j = r,

μ ̃ N T , r μ ̃ N T , r + 1 1 v + V ( r ) = μ ̃ N T , r μ ̃ N T , r + 1 1 v + μ ̃ N T , r + 1 + V ( r + 1 ) O P ( m ) O P ( 1 ) O P ( m ) .

For j = r + 1, …, kmax,

μ ̃ N T , j μ ̃ N T , j + 1 1 v + V ( j ) < μ ̃ N T , j μ ̃ N T , j + 1 1 v = O P ( 1 ) .

Thus, the consistency of the estimator r ̂ EC defined by (2.7) is verified.

Proof of Theorem 2

For j = 1, …, r − 1,

μ ̃ N T , j μ ̃ N T , j + 1 V ( j ) V ( j 1 ) μ ̃ N T , j μ ̃ N T , j + 1 = O P ( 1 ) .

For j = r,

μ ̃ N T , r μ ̃ N T , r + 1 V ( r ) V ( r 1 ) = μ ̃ N T , r μ ̃ N T , r + 1 V ( r ) μ ̃ N T , r + V ( r ) O P ( m ) O P ( 1 ) O P ( m ) .

For j = r + 1, …, kmax, kmax ≤ [d c m] − r,

μ ̃ N T , j μ ̃ N T , j + 1 V ( j ) V ( j 1 ) μ ̃ N T , j μ ̃ N T , j + 1 = O P ( 1 ) .

Therefore, the consistency of r ̂ CR defined by (2.8) is proved.

References

Ahn, S., and A. Horenstein. 2013. “Eigenvalue Ratio Test for the Number of Factors.” Econometrica 81: 1203–27.10.3982/ECTA8968Search in Google Scholar

Alessi, L., M. Barigozzi, and M. Capasso. 2010. “Improved Penalization of Determining the Number of Factors in Approximate Factor Models.” Statistics & Probability Letters 80: 1806–13. https://doi.org/10.1016/j.spl.2010.08.005.Search in Google Scholar

Amengual, D., and M. Watson. 2007. “Consistent Estimation of the Number of Dynamic Factors in a Large N and T Panel.” Journal of Business & Economic Statistics 25: 91–6. https://doi.org/10.1198/073500106000000585.Search in Google Scholar

Bai, J., and S. Ng. 2002. “Determining the Number of Factors in Approximate Factor Models.” Econometrica 70: 191–221. https://doi.org/10.1111/1468-0262.00273.Search in Google Scholar

Bai, J., and S. Ng. 2007. “Determining the Number of Primitive Shocks in Factor Models.” Journal of Business & Economic Statistics 25: 52–60. https://doi.org/10.1198/073500106000000413.Search in Google Scholar

Bernanke, B., J. Boivin, and P. Eliasz. 2005. “Measuring the Effects of Monetary Policy: A Factor Augmented Vector Autoregressive (Favar) Approach.” Quarterly Journal of Economics 120: 387–422. https://doi.org/10.1162/qjec.2005.120.1.387.Search in Google Scholar

Caner, M., and X. Han. 2014. “Selecting the Correct Number of Factors in Approximate Factor Models: The Large Panel Case with Group Bridge Estimators.” Journal of Business & Economic Statistics 32: 359–74. https://doi.org/10.1080/07350015.2014.880349.Search in Google Scholar

Chamberlain, G., and M. Rothschild. 2014. “Arbitrary, Factor Structure, and Mean-Variance Analysis on Large Asset Markets.” Econometrica 51: 1281–304.10.2307/1912275Search in Google Scholar

Chan, N., Y. Lu, and C. Yau. 2017. “Factor Modelling for High-Dimensional Time Series: Inference and Model Selection.” Journal of Time Series Analysis 38: 285–307. https://doi.org/10.1111/jtsa.12207.Search in Google Scholar

Connor, G., and R. Korajzcyk. 1993. “A Test for the Number of Factors in an Approximate Factor Model.” The Journal of Finance 48: 1263–91. https://doi.org/10.1111/j.1540-6261.1993.tb04754.x.Search in Google Scholar

Fan, J., Y. Liao, and M. Mincheva. 2013. “Large Covariance Estimation by Thresholding Principal Orthogonal Complements.” Journal of the Royal Statistical Society: Series B 75: 603–80. https://doi.org/10.1111/rssb.12016.Search in Google Scholar PubMed PubMed Central

Forni, M., M. Wallin, M. Lippi, and L. Reichlin. 2000. “The Generalized Dynamic-Factor Model: Idendification and Estimation.” The Review of Economics and Statistics 82: 540–54. https://doi.org/10.1162/003465300559037.Search in Google Scholar

Hallin, M., and R. Liska. 2007. “Determining the Number of Factors in General Dynamic Factor Model.” Journal of the American Statistical Association 102: 603–17. https://doi.org/10.1198/016214506000001275.Search in Google Scholar

Lam, C., and Q. Yao. 2012. “Factor Modeling for High-Dimensional Time Series: Inference for the Number of Factors.” Annals of Statistics 40: 694–726. https://doi.org/10.1214/12-aos970.Search in Google Scholar

Lam, C., Q. Yao, and N. Bathia. 2011. “Estimation for Latent Factor Models for High-Dimensional Time Series.” Biometrika 98: 901–18. https://doi.org/10.1093/biomet/asr048.Search in Google Scholar

Li, H., Q. Li, and Y. Shi. 2017. “Determining the Number of Factors when the Number of Factors Can Increase with Sample Size.” Journal of Econometrics 197: 76–86. https://doi.org/10.1016/j.jeconom.2016.06.003.Search in Google Scholar

Li, Q., and J. Pan. 2008. “Determining the Number of Factors in a Multivariate Error Correction-Volatility Factor Model.” Journal of Econometrics 12: 45–61.10.1111/j.1368-423X.2008.00259.xSearch in Google Scholar

Onatski, A. 2009. “Testing Hypotheses about the Number of Factors in Large Factor Models.” Ecomometrica 77: 1447–79.10.3982/ECTA6964Search in Google Scholar

Onatski, A. 2010. “Determining the Number of Factors from Empirical Distribution of Eigenvalues.” The Review of Economics and Statistics 92: 1004–16. https://doi.org/10.1162/rest_a_00043.Search in Google Scholar

Pan, J., and Q. Yao. 2008. “Modelling Multiple Time Series via Common Factors.” Biometrika 95: 356–79. https://doi.org/10.1093/biomet/asn009.Search in Google Scholar

Stock, J., and M. Watson. 2002. “Forecasting Using Principal Components from a Large Number of Predictors.” Journal of the American Statistical Association 97: 1167–79. https://doi.org/10.1198/016214502388618960.Search in Google Scholar

Wu, J. 2016. “Robust Determination for the Number of Common Factors in the Approximate Factor Models.” Economics Letters 144: 102–6. https://doi.org/10.1016/j.econlet.2016.04.026.Search in Google Scholar

Wu, J. 2018. “Eigenvalue Difference Test for the Number of Common Factors in the Approximate Factor Models.” Economics Letters 169: 63–7. https://doi.org/10.1016/j.econlet.2018.05.009.Search in Google Scholar

Xia, Q., R. Liang, and J. Wu. 2017. “Transformed Contribution Ratio Test for the Number of Factors in Static Approximate Factor Models.” Computational Statistics & Data Analysis 112: 235–41. https://doi.org/10.1016/j.csda.2017.03.005.Search in Google Scholar

Xia, Q., R. Liang, J. Wu, and H. Wong. 2018. “Determining the Number of Factors for High-Dimensional Time Teries.” Statistics and Its Interface 11: 307–16. https://doi.org/10.4310/sii.2018.v11.n2.a8.Search in Google Scholar

Xia, Q., H. Wong, S. Shen, and K. He. 2022. “Factor Analysis for High Dimensional Time Series: Consistent Estimation and Efficient Computation.” Statistical Analysis and Data Mining 15: 247–63. https://doi.org/10.1002/sam.11557.Search in Google Scholar

Xia, Q., W. Xu, and L. Zhu. 2015. “Consistently Determining the Number of Factors in Multivariate Volatility Modelling.” Statistica Sinica 25: 1025–44. https://doi.org/10.5705/ss.2013.252.Search in Google Scholar

Received: 2020-05-12
Revised: 2022-05-31
Accepted: 2022-06-03
Published Online: 2022-10-03

© 2022 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 21.11.2025 from https://www.degruyterbrill.com/document/doi/10.1515/snde-2020-0055/html
Scroll to top button