Home Initial Beliefs Uncertainty
Article
Licensed
Unlicensed Requires Authentication

Initial Beliefs Uncertainty

  • Jaqueson K. Galimberti ORCID logo EMAIL logo
Published/Copyright: October 23, 2023

Abstract

This paper evaluates how initial beliefs uncertainty can affect data weighting and the estimation of models with adaptive learning. One key finding is that misspecification of initial beliefs uncertainty, particularly with the common approach of artificially inflating initials uncertainty to accelerate convergence of estimates, generates time-varying profiles of weights given to past observations in what should otherwise follow a fixed profile of decaying weights. The effect of this misspecification, denoted as diffuse initials, is shown to distort the estimation and interpretation of learning in finite samples. Simulations of a forward-looking Phillips curve model indicate that (i) diffuse initials lead to downward biased estimates of expectations relevance in the determination of actual inflation, and (ii) these biases spill over to estimates of inflation responsiveness to output gaps. An empirical application with U.S. data shows the relevance of these effects for the determination of expectational stability over decadal subsamples of data. The use of diffuse initials is also found to lead to downward biased estimates of learning gains, both estimated from an aggregate representative model and estimated to match individual expectations from survey expectations data.

JEL Classification: E70; D83; D84; E37; C32; C63

Corresponding author: Jaqueson K. Galimberti, Asian Development Bank, 6 ADB Avenue, 1550 Mandaluyong, Metro Manila, Phillipines; Centre for Applied Macroeconomic Analysis, Australian National University, Canberra, Australia; and KOF Swiss Economic Institute, ETH Zurich, Zürich, Switzerland, E-mail: , https://sites.google.com/site/jkgeconoeng/

I thank the Associate Editor Eva Carceles-Poveda and an anonymous reviewer for their helpful and insightful assessments. Paper presented at the 27th International Conference Computing in Economics and Finance, held online in 2021. I thank the conference organizers and participants for the comments that helped shape the current draft. Earlier versions of this paper circulated under the title ‘[Initial beliefs uncertainty and] Information weighting in the estimation of models with adaptive learning.’ Any remaining errors are my own. The views expressed in this paper are those of the author and do not necessarily represent the views of his corresponding institutional affiliations.


Appendix A: Proofs and Derivations

A.1 Correspondence Between Penalized WLS and RLS

To see how the RLS of (2) and (3) can be derived from the penalized WLS formulation of (5) and (6), first notice that iterating (3) recursively from R 0 we have that

R t = i = 1 t ω t , i x i x i + ω t , 0 R 0 ,

which is the inverse of the first term in (5), leading to

(22) ϕ ̂ t = R t 1 i = 1 t ω t , i x i y i + ω t , 0 R 0 ϕ 0 .

For the second term notice that

i = 1 t ω t , i x i y i = i = 1 t 1 ω t , i x i y i + γ t x t y t , = 1 γ t i = 1 t 1 ω t 1 , i x i y i + γ t x t y t ,

and

ω t , 0 R 0 ϕ 0 = 1 γ t ω t 1,0 R 0 ϕ 0 ,

where we use

ω t , i = 1 γ t ω t 1 , i ,

which follows from (6). Hence, (22) is equivalent to

(23) ϕ ̂ t = R t 1 γ t x t y t + 1 γ t i = 1 t 1 ω t 1 , i x i y i + ω t 1,0 R 0 ϕ 0 .

Lagging (22) one period we find that

R t 1 ϕ ̂ t 1 = i = 1 t 1 ω t 1 , i x i y i + ω t 1,0 R 0 ϕ 0 ,

which can be substituted into (23) to yield

(24) ϕ ̂ t = R t 1 γ t x t y t + 1 γ t R t 1 ϕ ̂ t 1 .

From (3) notice that

1 γ t R t 1 = R t γ t x t x t ,

which substituted into (24) and after rearranging leads to

ϕ ̂ t = R t 1 γ t x t y t + R t γ t x t x t ϕ ̂ t 1 , = γ t R t 1 x t y t + ϕ ̂ t 1 γ t R t 1 x t x t ϕ ̂ t 1 , = ϕ ̂ t 1 + γ t R t 1 x t y t x t ϕ ̂ t 1 ,

establishing the correspondence between the penalized WLS solution of (5) and the RLS of (2).

A.2 Absolute and Relative Weights

Letting W t n stand for the sum of weights starting from weight n up to weight t, from the definition of the absolute weights, (6), this sum of weights can be expanded according to

(25) W t 0 = i = 0 t ω t , i , = j = 1 t 1 γ j + i = 1 t 1 γ i j = i + 1 t 1 γ j + γ t .

Expanding the first term of (25) we have that

(26) ω t , 0 = 1 γ 1 1 γ 2 1 γ t 1 1 γ t , = 1 γ 2 1 γ t 1 1 γ t γ 1 j = 2 t 1 γ j , = 1 γ t i = 1 t 1 γ i j = i + 1 t 1 γ j .

Returning to (25) we then have

W t 0 = 1 γ t i = 1 t 1 γ i j = i + 1 t 1 γ j + i = 1 t 1 γ i j = i + 1 t 1 γ j + γ t , = 1 .

A.3 Equivalent Time-Varying Gains Under Diffuse Initials

The sequence of gains, γ ̃ t , that generates equivalent weightings as a constant-gain under diffuse initials needs to solve

(27) ϖ t , l = ϖ t , l dcg , = κ 1 γ ̄ t 1 + κ 1 1 γ ̄ t for l = t , γ ̄ 1 γ ̄ l 1 + κ 1 1 γ ̄ t for 0 l < t ,

for all t and l. From Equation (7), starting with l = 0 we simply have that

(28) γ ̃ t = γ ̄ / 1 + κ 1 1 γ ̄ t .

It only remains to validate if Equation (28) also solves Equation (27) for l > 0. Substituting Equation (28) into Equation (7) for 0 < l < t,

ϖ t , l = γ ̄ 1 + κ 1 1 γ ̄ t l j = 0 l 1 1 γ ̄ 1 + κ 1 1 γ ̄ t j , = γ ̄ 1 γ ̄ l 1 + κ 1 1 γ ̄ t l j = 0 l 1 1 + κ 1 1 γ ̄ t j 1 1 + κ 1 1 γ ̄ t j , = γ ̄ 1 γ ̄ l 1 + κ 1 1 γ ̄ t l 1 + κ 1 1 γ ̄ t l 1 + κ 1 1 γ ̄ t , = γ ̄ 1 γ ̄ l 1 + κ 1 1 γ ̄ t ,

which solves Equation (27) for 0 < l < t. Similarly, for the initial, l = t,

ϖ t , l = j = 1 t 1 γ ̄ 1 + κ 1 1 γ ̄ j , = 1 γ ̄ t j = 1 t 1 + κ 1 1 γ ̄ j 1 1 + κ 1 1 γ ̄ j , = 1 γ ̄ t κ 1 + κ 1 1 γ ̄ t , = κ 1 γ ̄ t 1 + κ 1 1 γ ̄ t .

Finally, notice that under diffuse initials, κ = 0, the weight given to the learning initials is null, i.e. ϖ t , t dcg = 0 . This is equivalent to using a γ 1 = 1, which is again satisfied by Equation (28).

Appendix B: Supplementary Results

Figure 12: 
Estimates of simulated Phillips curve model with constant-gain learning – using diffuse WLS pre-sample initials. Notes: Same as Figures 4 and 5 except that using diffuse WLS pre-sample estimates for R
0.
Figure 12:

Estimates of simulated Phillips curve model with constant-gain learning – using diffuse WLS pre-sample initials. Notes: Same as Figures 4 and 5 except that using diffuse WLS pre-sample estimates for R 0.

Figure 13: 
Estimates of simulated Phillips curve model with constant-gain learning – using OLS pre-sample initials. Notes: Same as Figures 4 and 5 except that using OLS pre-sample estimates for R
0.
Figure 13:

Estimates of simulated Phillips curve model with constant-gain learning – using OLS pre-sample initials. Notes: Same as Figures 4 and 5 except that using OLS pre-sample estimates for R 0.

Figure 14: 
Estimates of simulated Phillips curve model with constant-gain learning – using OLS plus diffuse WLS pre-sample initials. Notes: Same as Figures 4 and 5 except that using OLS plus diffuse WLS pre-sample estimates for R
0.
Figure 14:

Estimates of simulated Phillips curve model with constant-gain learning – using OLS plus diffuse WLS pre-sample initials. Notes: Same as Figures 4 and 5 except that using OLS plus diffuse WLS pre-sample estimates for R 0.

Figure 15: 
Estimates of simulated Phillips curve model with constant-gain learning – using OLS plus WLS pre-sample initials. Notes: Same as Figures 4 and 5 except that using OLS plus WLS pre-sample estimates for R
0.
Figure 15:

Estimates of simulated Phillips curve model with constant-gain learning – using OLS plus WLS pre-sample initials. Notes: Same as Figures 4 and 5 except that using OLS plus WLS pre-sample estimates for R 0.

Figure 16: 
Estimates of simulated Phillips curve model with constant-gain learning – with jointly estimated learning gains. Notes: Same as Figures 4 and 5 except that jointly estimating the learning gains as in Figure 8.
Figure 16:

Estimates of simulated Phillips curve model with constant-gain learning – with jointly estimated learning gains. Notes: Same as Figures 4 and 5 except that jointly estimating the learning gains as in Figure 8.

Figure 17: 
Estimates of simulated Phillips curve model with constant-gain learning – alternative initials uncertainty re-scaling. Notes: Same as Figures 4 and 5 with additional re-scaling of initials uncertainty, R
0.
Figure 17:

Estimates of simulated Phillips curve model with constant-gain learning – alternative initials uncertainty re-scaling. Notes: Same as Figures 4 and 5 with additional re-scaling of initials uncertainty, R 0.

Figure 18: 
Estimates of individual learning gains from survey forecasts – using OLS pre-sample initials. Notes: See notes to Figure 11.
Figure 18:

Estimates of individual learning gains from survey forecasts – using OLS pre-sample initials. Notes: See notes to Figure 11.

Table 2:

Joint estimates of simulated Philips curve model with constant-versus decreasing-gain.

Sample Constant-gain Decreasing-gain
Size Correct Diffuse Δ Correct Diffuse Δ
(a) Median β estimates
T = 50 0.617 0.183 −0.434 0.360 0.152 −0.208
T = 100 0.689 0.333 −0.356 0.528 0.283 −0.245
T = 250 0.787 0.572 −0.215 0.717 0.497 −0.220
T = 1000 0.866 0.804 −0.062 0.867 0.764 −0.103
(b) Median λ estimates
T = 50 0.339 0.496 0.157 0.445 0.510 0.065
T = 100 0.298 0.434 0.136 0.373 0.460 0.087
T = 250 0.254 0.341 0.087 0.293 0.375 0.082
T = 1000 0.218 0.246 0.028 0.224 0.268 0.044
  1. Notes: The estimates under constant-gain are from the same exercise as reported in 5. The estimates under decreasing-gain only differ in that a sequence of decreasing gains, given by γ t = 1/t + 1, is used instead of a constant-gain.

References

Berardi, M., and J. K. Galimberti. 2013. “A Note on Exact Correspondences between Adaptive Learning Algorithms and the Kalman Filter.” Economics Letters 118 (1): 139–42. https://doi.org/10.1016/j.econlet.2012.10.002.Search in Google Scholar

Berardi, M., and J. K. Galimberti. 2017a. “Empirical Calibration of Adaptive Learning.” Journal of Economic Behavior & Organization 144: 219–37. https://doi.org/10.1016/j.jebo.2017.10.004.Search in Google Scholar

Berardi, M., and J. K. Galimberti. 2017b. “On the Initialization of Adaptive Learning in Macroeconomic Models.” Journal of Economic Dynamics and Control 78: 26–53. https://doi.org/10.1016/j.jedc.2017.03.002.Search in Google Scholar

Berardi, M., and J. K. Galimberti. 2019. “Smoothing-Based Initialization for Learning-To-Forecast Algorithms.” Macroeconomic Dynamics 23 (3): 1008–23. https://doi.org/10.1017/s1365100517000128.Search in Google Scholar

Bray, M. 1982. “Learning, Estimation, and the Stability of Rational Expectations.” Journal of Economic Theory 26 (2): 318–39. https://doi.org/10.1016/0022-0531(82)90007-2.Search in Google Scholar

Carceles-Poveda, E., and C. Giannitsarou. 2007. “Adaptive Learning in Practice.” Journal of Economic Dynamics and Control 31 (8): 2659–97. https://doi.org/10.1016/j.jedc.2006.09.004.Search in Google Scholar

Chevillon, G., M. Massmann, and S. Mavroeidis. 2010. “Inference in Models with Adaptive Learning.” Journal of Monetary Economics 57 (3): 341–51. https://doi.org/10.1016/j.jmoneco.2010.02.003.Search in Google Scholar

Cho, I.-K., N. Williams, and T. J. Sargent. 2002. “Escaping Nash Inflation.” The Review of Economic Studies 69 (1): 1–40. https://doi.org/10.1111/1467-937x.00196.Search in Google Scholar

Christopeit, N., and M. Massmann. 2018. “Estimating Structural Parameters in Regression Models with Adaptive Learning.” Econometric Theory 34: 68–111. https://doi.org/10.1017/s0266466616000529.Search in Google Scholar

Coibion, O., Y. Gorodnichenko, and R. Kamdar. 2018. “The Formation of Expectations, Inflation, and the Phillips Curve.” Journal of Economic Literature 56 (4): 1447–91. https://doi.org/10.1257/jel.20171300.Search in Google Scholar

Cole, S. J., and F. Milani. 2021. “Heterogeneity in Individual Expectations, Sentiment, and Constant-Gain Learning.” Journal of Economic Behavior & Organization 188: 627–50. https://doi.org/10.1016/j.jebo.2021.05.034.Search in Google Scholar

Evans, G. W., and S. Honkapohja. 1993. “Adaptive Forecasts, Hysteresis, and Endogenous Fluctuations.” Economic Review: 3–13.10.1016/0014-2921(93)90049-GSearch in Google Scholar

Evans, G. W., and S. Honkapohja. 2001. “Learning and Expectations in Macroeconomics.” In Frontiers of Economic Research. Princeton: Princeton University Press.Search in Google Scholar

Evans, G. W., S. Honkapohja, and N. Williams. 2010. “Generalized Stochastic Gradient Learning.” International Economic Review 51 (1): 237–62. https://doi.org/10.1111/j.1468-2354.2009.00578.x.Search in Google Scholar

Galimberti, J. K. 2019. “An Approximation of the Distribution of Learning Estimates in Macroeconomic Models.” Journal of Economic Dynamics and Control 102: 29–43. https://doi.org/10.1016/j.jedc.2019.03.003.Search in Google Scholar

Gordon, R. J. 2011. “The History of the Phillips Curve: Consensus and Bifurcation.” Economica 78 (309): 10–50. https://doi.org/10.1111/j.1468-0335.2009.00815.x.Search in Google Scholar

Ljung, L., and T. Soderstrom. 1983. Theory and Practice of Recursive Identification. Cambridge, MA: The MIT Press.Search in Google Scholar

Lubik, T. A., and C. Matthes. 2016. “Indeterminacy and Learning: An Analysis of Monetary Policy in the Great Inflation.” Journal of Monetary Economics 82: 85–106. https://doi.org/10.1016/j.jmoneco.2016.07.006.Search in Google Scholar

Malmendier, U., and S. Nagel. 2016. “Learning from Inflation Experiences.” Quarterly Journal of Economics 131 (1): 53–87. https://doi.org/10.1093/qje/qjv037.Search in Google Scholar

Marcet, A., and J. P. Nicolini. 2003. “Recurrent Hyperinflations and Learning.” The American Economic Review 93 (5): 1476–98. https://doi.org/10.1257/000282803322655400.Search in Google Scholar

Marcet, A., and T. J. Sargent. 1989. “Convergence of Least Squares Learning Mechanisms in Self-Referential Linear Stochastic Models.” Journal of Economic Theory 48 (2): 337–68. https://doi.org/10.1016/0022-0531(89)90032-x.Search in Google Scholar

Markiewicz, A., and A. Pick. 2014. “Adaptive Learning and Survey Data.” Journal of Economic Behavior & Organization 107: 685–707. https://doi.org/10.1016/j.jebo.2014.04.005.Search in Google Scholar

Mavroeidis, S., M. Plagborg-Moeller, and J. H. Stock. 2014. “Empirical Evidence on Inflation Expectations in the New Keynesian Phillips Curve.” Journal of Economic Literature 52 (1): 124–88. https://doi.org/10.1257/jel.52.1.124.Search in Google Scholar

Milani, F. 2007. “Expectations, Learning and Macroeconomic Persistence.” Journal of Monetary Economics 54 (7): 2065–82. https://doi.org/10.1016/j.jmoneco.2006.11.007.Search in Google Scholar

Milani, F. 2008. “Learning, Monetary Policy Rules, and Macroeconomic Stability.” Journal of Economic Dynamics and Control 32 (10): 3148–65. https://doi.org/10.1016/j.jedc.2007.12.004.Search in Google Scholar

Milani, F. 2011. “Expectation Shocks and Learning as Drivers of the Business Cycle.” The Economic Journal 121 (552): 379–401. https://doi.org/10.1111/j.1468-0297.2011.02422.x.Search in Google Scholar

Orphanides, A., and J. C. Williams. 2005. “The Decline of Activist Stabilization Policy: Natural Rate Misperceptions, Learning, and Expectations.” Journal of Economic Dynamics and Control 29 (11): 1927–50. https://doi.org/10.1016/j.jedc.2005.06.004.Search in Google Scholar

Pfajfar, D., and E. Santoro. 2010. “Heterogeneity, Learning and Information Stickiness in Inflation Expectations.” Journal of Economic Behavior & Organization 75 (3): 426–44. https://doi.org/10.1016/j.jebo.2010.05.012.Search in Google Scholar

Primiceri, G. E. 2006. “Why Inflation Rose and Fell: Policy-Makers’ Beliefs and U. S. Postwar Stabilization Policy.” Quarterly Journal of Economics 121 (3): 867–901. https://doi.org/10.1162/qjec.121.3.867.Search in Google Scholar

Sargent, T., N. Williams, and T. Zha. 2006. “Shocks and Government Beliefs: The Rise and Fall of American Inflation.” The American Economic Review 96 (4): 1193–224. https://doi.org/10.1257/aer.96.4.1193.Search in Google Scholar

Sargent, T. J. 1999. The Conquest of American Inflation. Princeton: Princeton University Press.10.1515/9780691186689Search in Google Scholar

Sargent, T. J., and N. Williams. 2005. “Impacts of Priors on Convergence and Escapes from Nash Inflation.” Review of Economic Dynamics 8 (2): 360–91. https://doi.org/10.1016/j.red.2004.10.010.Search in Google Scholar

Slobodyan, S., and R. Wouters. 2012a. “Learning in a Medium-Scale Dsge Model with Expectations Based on Small Forecasting Models.” American Economic Journal: Macroeconomics 4 (2): 65–101. https://doi.org/10.1257/mac.4.2.65.Search in Google Scholar

Slobodyan, S., and R. Wouters. 2012b. “Learning in an Estimated Medium-Scale Dsge Model.” Journal of Economic Dynamics and Control 36 (1): 26–46. https://doi.org/10.1016/j.jedc.2011.01.016.Search in Google Scholar

Williams, N. 2019. “Escape Dynamics in Learning Models.” The Review of Economic Studies 86 (2): 882–912. https://doi.org/10.1093/restud/rdy033.Search in Google Scholar

Woodford, M. 2003. Interest and Prices. Princetion: Princeton University Press.10.1515/9781400830169Search in Google Scholar

Received: 2023-05-02
Accepted: 2023-10-06
Published Online: 2023-10-23

© 2023 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 6.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/bejm-2023-0069/html
Scroll to top button