Abstract
We propose a new state space model to estimate the Integrated Variance (IV) in the presence of microstructure noise. Applying the pre-averaging sampling scheme to the irregularly spaced high-frequency data, we derive equidistant efficient price approximations to calculate the noise-contaminated realised variance (NCRV), which is used as an IV estimator. The theoretical properties of the new volatility estimator are illustrated and compared with those of the realised volatility. We highlight the robustness of the new estimator to market microstructure noise (MMN). The pre-averaging sampling effectively eliminates the influence of the MMN component on the NCRV series. The empirical illustration features the EUR/USD exchange rate and provides evidence of a superior performance in volatility forecasting at very high sampling frequencies.
-
Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.
-
Research funding: None declared.
-
Conflict of interest statement: The authors declare no conflicts of interest regarding this article.
Appendix A. Proofs
A.1 Proof of Lemma 3.1
Given an equidistant partition in Eq. (3.1), for any two adjacent true prices within a sub-interval (τ
i
, τ
i+1] on day T, in other words,
In order to prove Lemma 3.1, we require values of
The definition of the RV shows that
where
Given the assumption that the price process follows a one factor SR-SARV, we can make use of the results in Barndorff-Nielsen and Shephard (2002) and calculate the variance as
where λ 1 and ω 1 are parameters in the SR-SARV model. Given an assumption that the IV is constant throughout a trading day, the price process within a trading day follows a Brownian motion with λ 1 = 0 and ω 1 = 0. Hence,
The result above shows the conditional variance of the discretised error
where m is the number of sub-intervals on the partition in Eq. (3.1). Based on Eq. (A.1) and under the assumption of a constant IV throughout a trading T, we have
To find the value of
On day T, using the property of the Brownian motion that
and
In Eq. (3.9), we defined the discretised error d T when the pre-averaging sampling scheme is employed. Since σ T and σ T−1 are known and are constant on corresponding days, it holds that
To find the variance of d
T
, the proof will focus on the elements of the ith sub-interval when expanding the variance of the
Using the pre-averaging sampling scheme, we notice that the elements of the ith sub-interval only appear in term
where A and B represent the elements that are not contained in the ith sub-interval. In the above equation
Given the equations
we can separately calculate the variance of ② + ③:
We can expand A 2 and B 2 to get similar results.
After we expand ①, we obtain
Then,
The same result can be obtained for ④. Finally, the discretised error can be expressed as
We note that
A.2 Proof of Lemma 3.2
Let
in other words, θ i ’s are also i.i.d. random variables. Given definition of e i = θ i − θ i−1 in Eq. (3.5) and based on the properties of θ i from Eqs. (A.2) and (A.3), we can complete the proof of Lemma 3.2 as follows:
A.3 Proof of Lemma 3.3
Since MMN is independent of the price process, we can write:
Lemma 3.2 shows that
Again, using the independence between the MMN and the price process, we have
We proceed by calculating the value of each part in Eqs. (A.4) and (A.5) separately. For ①, since
Using the methodology outlined in Alexeev, Chen, and Ignatieva (2021),
and
where
Combining the result with Lemma 3.2, we obtain
For part ②, using the independence of the random variables θ i ’s in the Proof of Lemma 3.2, we obtain
In Eq. (A.6),
Therefore,
and
For part ③,
where
and
using i.i.d. property of θ i ’s. Therefore,
and
For part ④ we have
Thus,
Finally,
Then, when considering the covariance between u T and u T−1, we obtain
and
Appendix B. Simulation study
In this simulation study we compare and contrast the finite sample performance of both methods; the one developed in this paper and the method of Nagakura and Watanabe (2015). Following Alexeev, Chen, and Ignatieva (2021), we choose Heston model for the data generation process, refer to Heston (1993). We assume that X t , the true log price of an asset, follows Heston stochastic volatility model defined as follows:
Here, ν
t
denotes stochastic variance. We set the same parameters as in Jacod et al. (2009), that is, μ = 0.05/252, κ = 5/252, θ = 0.04/252, ξ = 0.05/252 and ρ = −0.5, where ρ = corr(W
1,t
, W
2,t
) is the correlation between Brownian motions. The microstructure noise
To estimate the parameters of the two models in our paper, given generated data, we set the initial values for each model as follows:
Estimation results are reported in Tables B.7 and B.8 below.
Estimated parameters for the SV model.
| M | 4320 | 2880 | 1440 | 288 |
|---|---|---|---|---|
| Panel A: previous tick sampling scheme | ||||
| Δ k | 20 s | 30 s | 1 min | 5 mins |
|
|
0.3032 | 0.3355 | 0.3289 | 0.9328 |
|
|
0.0908 | 0.0950 | 0.1046 | 0.1392 |
|
|
0.0031 | 0.0026 | 0.0012 | 0.0006 |
|
|
0.0822 | 0.0955 | 0.1191 | 0.3201 |
|
|
0.0069 | 0.0082 | 0.0112 | 0.0757 |
| L | −4145.4951 | −3046.3032 | −1219.7400 | −2826.5891 |
| Panel B: pre-averaging sampling scheme | ||||
| Δ k | 20 s | 30 s | 1 min | 5 mins |
|
|
0.9999 | 0.9997 | 0.9998 | 1.0000 |
|
|
0.0684 | 0.0323 | 0.0136 | 0.0330 |
|
|
6.0309 | 3.1934 | 0.2093 | 0.0545 |
|
|
0.0684 | 0.0323 | 0.0136 | 0.0330 |
| L | −148.2576 | 1244.3137 | 7422.0095 | 2861.3558 |
Estimates of parameters in the state space model.
| M | 4320 | 2880 | 1440 | 288 |
|---|---|---|---|---|
| Panel A: previous tick sampling scheme | ||||
| Δ k | 20 s | 30 s | 1 min | 5 mins |
|
|
0.3032 | 0.3355 | 0.3289 | 0.9328 |
|
|
0.0633 | 0.0632 | 0.0702 | 0.0093 |
|
|
0.2478 | 0.2508 | 0.2502 | 0.2679 |
|
|
0.0016 | 0.0013 | 0.0006 | 0.0000 |
|
|
7.1040 | 5.5000 | 3.4289 | 1.8440 |
|
|
0.0001 | 0.0001 | 0.0002 | 0.0007 |
|
|
0.1320 | 0.1058 | 0.0739 | 0.1024 |
|
|
0.0000 | 0.0000 | 0.0000 | 0.0001 |
| Panel B: pre-averaging sampling scheme | ||||
| Δ k | 20 s | 30 s | 1 min | 5 mins |
|
|
0.9999 | 0.9997 | 0.9998 | 1.0000 |
|
|
0.0000 | 0.0001 | 0.0001 | 0.0000 |
|
|
0.2681 | 0.2679 | 0.2680 | 0.3595 |
|
|
0.0011 | 0.0013 | 0.0001 | 0.0000 |
|
|
0.0684 | 0.0323 | 0.0136 | 0.0330 |
From the tables we observe that κ 1 is less than one in Model 1, which indicates that IV can be modelled as an AR(1) model. In Model 2, κ 1 is close to 1 and c IV is close to 0, which means IV follows a random walk process. Furthermore, c u is decreasing when the sampling frequency is getting lower. In addition, Table B.9 reports MSE & QLIKE loss functions computed for two models based on NCRVlobs (Panel A) and NCRVavg (Panel B) at four sampling frequencies. Based on MSE and QLIKE, we observe that model 2, which is based on the pre-averaging sampling scheme, outperforms the model based on the previous tick sampling scheme at all four sampling frequencies.
In-sample forecast evaluation. MSE and QLIKE loss functions are computed for two models based on NCRVlobs (Panel A) and NCRVavg (Panel B) at four sampling frequencies in the in-sample study.
| Panel A: previous tick sampling scheme | ||
|---|---|---|
| Δ k | MSE | QLIKE |
| 20 s | 2.8771 | 14.4314 |
| 30 s | 2.8639 | 13.6612 |
| 1 min | 2.8339 | 12.1419 |
| 5 mins | 2.7281 | 8.5102 |
| Panel B: pre-averaging sampling scheme | ||
|---|---|---|
| Sampling frequency | MSE | QLIKE |
| 20 s | 4.7524 | 0.33730 |
| 30 s | 1.9162 | 0.2081 |
| 1 min | 0.4381 | 0.0822 |
| 5 mins | 0.6592 | 0.1462 |
References
Alexeev, V., J. Chen, and K. Ignatieva. 2021. “From Previous-Tick to Pre-averaging: Spectra of Equidistant Transformations for Unevenly Spaced High Frequency Data.” PhD Thesis, School of Risk and Actuarial Studies, UNSW Business School, UNSW Sydney.Search in Google Scholar
Andersen, T. G. 1994. “Stochastic Autoregressive Volatility: A Framework for Volatility Modeling.” Mathematical Finance 4: 75–102.Search in Google Scholar
Andersen, T. G., and T. Bollerslev. 1997. “Intraday Periodicity and Volatility Persistence in Financial Markets.” Journal of Empirical Finance 4: 115–58. https://doi.org/10.1016/s0927-5398(97)00004-2.Search in Google Scholar
Andersen, T. G., and T. Bollerslev. 1998. “Deutsche Mark–Dollar Volatility: Intraday Activity Patterns, Macroeconomic Announcements, and Longer Run Dependencies.” The Journal of Finance 53: 219–65. https://doi.org/10.1111/0022-1082.85732.Search in Google Scholar
Andersen, T. G., T. Bollerslev, F. X. Diebold, and H. Ebens. 2001. “The Distribution of Realized Stock Return Volatility.” Journal of Financial Economics 61: 43–76. https://doi.org/10.1016/s0304-405x(01)00055-1.Search in Google Scholar
Andersen, T. G., T. Bollerslev, F. X. Diebold, and P. Labys. 2003. “Modeling and Forecasting Realized Volatility.” Econometrica 71: 579–625. https://doi.org/10.1111/1468-0262.00418.Search in Google Scholar
Andreou, E., and E. Ghysels. 2002. “Detecting Multiple Breaks in Financial Market Volatility Dynamics.” Journal of Applied Econometrics 17: 579–600. https://doi.org/10.1002/jae.684.Search in Google Scholar
Bandi, F. M., and J. R. Russell. 2008. “Microstructure Noise, Realized Variance, and Optimal Sampling.” The Review of Economic Studies 75: 339–69. https://doi.org/10.1111/j.1467-937x.2008.00474.x.Search in Google Scholar
Barndorff-Nielsen, O. E., and N. Shephard. 2002. “Econometric Analysis of Realized Volatility and its Use in Estimating Stochastic Volatility Models.” Journal of the Royal Statistical Society: Series B 64: 253–80. https://doi.org/10.1111/1467-9868.00336.Search in Google Scholar
Beine, M., J. Lahaye, S. Laurent, C. J. Neely, and F. C. Palm. 2007. “Central Bank Intervention and Exchange Rate Volatility, its Continuous and Jump Components.” International Journal of Finance & Economics 12: 201–23. https://doi.org/10.1002/ijfe.330.Search in Google Scholar
Campbell, J. Y., A. W. Lo, and A. MacKinlay. 1997. The Econometrics of Financial Markets, 2nd ed. Princeton University Press.10.1515/9781400830213Search in Google Scholar
Caporin, M. 2022. “The Role of Jumps in Realized Volatility Modeling and Forecasting.” Journal of Financial Econometrics 1–26. https://doi.org/10.1093/jjfinec/nbab030.Search in Google Scholar
Fukasawa, M. 2010. “Realized Volatility with Stochastic Sampling.” Stochastic Processes and their Applications 120: 829–52. https://doi.org/10.1016/j.spa.2010.02.006.Search in Google Scholar
Heston, S. L. 1993. “A Closed-form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options.” Review of Financial Studies 6: 327–43. https://doi.org/10.1093/rfs/6.2.327.Search in Google Scholar
Jacod, J., Y. Li, P. A. Mykland, M. Podolskij, and M. Vetter. 2009. “Microstructure Noise in the Continuous Case: The Pre-averaging Approach.” Stochastic Processes and their Applications 119: 2249–76. https://doi.org/10.1016/j.spa.2008.11.004.Search in Google Scholar
Jacod, J., P. Protter (1998): “Asymptotic Error Distributions for the Euler Method for Stochastic Differential Equations,” Annals of Probability, 26, 267–307, https://doi.org/10.1214/aop/1022855419.Search in Google Scholar
Meddahi, N. 2001. “An Eigenfunction Approach for Volatility Modeling.” In Working Paper. Université de Montréal. Département de sciences économiques.Search in Google Scholar
Meddahi, N. 2003. “ARMA Representation of Integrated and Realized Variances.” The Econometrics Journal 6: 335–56. https://doi.org/10.1111/1368-423x.t01-1-00112.Search in Google Scholar
Mykland, P. A., and L. Zhang. 2016. “Between Data Cleaning and Inference: Pre-averaging and Robust Estimators of the Efficient Price.” Journal of Econometrics 194: 242–62. https://doi.org/10.1016/j.jeconom.2016.05.005.Search in Google Scholar
Nagakura, D., and T. Watanabe. 2015. “A State Space Approach to Estimating the Integrated Variance under the Existence of Market Microstructure Noise.” Journal of Financial Econometrics 13: 45–82. https://doi.org/10.1093/jjfinec/nbt015.Search in Google Scholar
Nelson, D. B. 1990. “Arch Models as Diffusion Approximations.” Journal of Econometrics 45: 7–38. https://doi.org/10.1016/0304-4076(90)90092-8.Search in Google Scholar
Owens, J. P., and D. G. Steigerwald. 2006. “Noise Reduced Realized Volatility: A Kalman Filter Approach.” In Econometric Analysis of Financial and Economic Time Series, 211–27. Emerald Group Publishing Limited.10.1016/S0731-9053(05)20008-7Search in Google Scholar
Patton, A. J., and K. Sheppard. 2009. “Evaluating Volatility and Correlation Forecasts.” In Handbook of Financial Time Series, 801–38. Springer.10.1007/978-3-540-71297-8_36Search in Google Scholar
Wasserfallen, W., and H. Zimmermann. 1985. “The Behavior of Intra-daily Exchange Rates.” Journal of Banking & Finance 9: 55–72. https://doi.org/10.1016/0378-4266(85)90062-7.Search in Google Scholar
Supplementary Material
This article contains supplementary material (https://doi.org/10.1515/snde-2021-0093).
© 2023 Walter de Gruyter GmbH, Berlin/Boston
Articles in the same Issue
- Frontmatter
- Research Article
- Analysis of heterogeneous duopoly game with information asymmetry based on extrapolative mechanism
- Review
- Modelling volatility dependence with score copula models
- Research Articles
- A new test for non-linear hypotheses under distributional and local parametric misspecification
- Optimization study of momentum investment strategies under asymmetric power-law distribution of return rate
- Comparison of Score-Driven Equity-Gold Portfolios During the COVID-19 Pandemic Using Model Confidence Sets
- Integrated variance of irregularly spaced high-frequency data: A state space approach based on pre-averaging
Articles in the same Issue
- Frontmatter
- Research Article
- Analysis of heterogeneous duopoly game with information asymmetry based on extrapolative mechanism
- Review
- Modelling volatility dependence with score copula models
- Research Articles
- A new test for non-linear hypotheses under distributional and local parametric misspecification
- Optimization study of momentum investment strategies under asymmetric power-law distribution of return rate
- Comparison of Score-Driven Equity-Gold Portfolios During the COVID-19 Pandemic Using Model Confidence Sets
- Integrated variance of irregularly spaced high-frequency data: A state space approach based on pre-averaging