VEC-MSF models in Bayesian analysis of short- and long-run relationships

Anna Pajor; Justyna Wróblewska

doi:10.1515/snde-2016-0004

Artikel Öffentlich zugänglich

VEC-MSF models in Bayesian analysis of short- and long-run relationships

Anna Pajor und Justyna Wróblewska

Veröffentlicht/Copyright: 6. April 2017

Veröffentlicht von

Veröffentlichen auch Sie bei De Gruyter Brill

Manuskript einreichen Informationen für Autor*innen

Aus der Zeitschrift Studies in Nonlinear Dynamics & Econometrics Band 21 Heft 3

Abstract

The paper aims at developing new Bayesian Vector Error Correction – Stochastic Volatility (VEC-SV) models, which combine the VEC representation of a VAR structure with stochastic volatility, represented by either the multiplicative stochastic factor (MSF) process or the MSF-SBEKK specification. Appropriate numerical methods (MCMC-based algorithms) are adapted for estimation and comparison of these type of models. Based on data coming from the Polish economy (time series of unemployment, inflation, interest rates, and of PLN/EUR, PLN/USD and EUR/USD exchange rates) it is shown that the models and numerical methods proposed in our study work well in simultaneous modelling of volatility and long-run relationships.

Keywords: Bayesian analysis; cointegration; Markov Chain Monte Carlo methods; stochastic volatility

JEL Classification: C11; C32

Introduction

A frequently used model structure for macroeconomic time series is the vector autoregression with the error correction mechanism and with a constant conditional covariance matrix. In turn, conditional heteroscedasticity processes serve as typical models for the volatility of financial time series. However, some research has shown that the variability of macroeconomic time series (measured, for example, by the conditional covariance matrix) can vary over time, and also that on financial markets there exist long-term relationships. Therefore, it appears essential to formulate such models in which the possible presence of long-run relationships and time-variable volatility are simultaneously taken into account. A fast growing econometrics literature (see, e.g. Cogley and Sargent 2005; Primiceri 2005; Sims and Zha 2006; Koop, León-González, and Strachan 2009; Koop and Potter 2011; Koop 2012; Belmonte and Koop 2014) has mainly focused on two particular extensions of a basic vector autoregressive (VAR) model: specifications with time-varying parameters and the ones with time-varying conditional covariances. Our particular attention is paid to the vector error correction (VEC) representation of VAR with the foregoing extensions (see, e.g. Seo 2007; Herwartz and Lütkepohl 2011; Koop, León-González, and Strachan 2011; Cavaliere et al. 2015). However, due to the lack of identification of cointegrating vectors (only the space spanned by the cointegrating vectors can be identified), the literature on the VEC models is much more limited than that on VAR. Also, studies on the Bayesian inference within the corresponding VEC structures are quite rare. Therefore, it is desirable to fill this gap in the literature and to develop Bayesian methods for VEC models allowing for stochastic volatility (SV).

The paper aims at developing new Bayesian Vector Error Correction – Stochastic Volatility (VEC-SV) models, which combine the VEC representation of a VAR structure with stochastic volatility, represented by either the multiplicative stochastic factor (MSF) process or the MSF-SBEKK specification. The MSF structure is one of the simplest amongst the Multivariate Stochastic Volatility (MSV) models (since it features only one latent process, and the conditional covariance matrix does not depend on the past information set), and it can be used to describe time-varying volatility in a parsimonious manner. It is worth mentioning that this SV specification has been used in constructing hybrid models combining multivariate stochastic volatility and multivariate generalized conditional heteroscedasticity structures (MSF-SBEKK, among others), proposed by (Osiewalski 2009; Osiewalski and Pajor 2009), and further developed and extended by (Osiewalski and Pajor 2010; Pajor and Osiewalski 2012; Osiewalski and Osiewalski 2012).

Another objective of this paper is to develop Bayesian methodology for estimation and comparison of the models in question. Bayesian statistical analysis in the class of proposed models requires using some numerical methods, in particular Markov Chain Monte Carlo (MCMC) algorithms. Therefore, one of objectives of the research is to adapt relevant numerical methods to allow the estimation, prediction and comparison of such model structures. In particular, we employ the Gibbs sampler and the Metropolis-Hastings procedure.

The paper is organized as follows. In Section 2 the basic framework is set and the Bayesian models are formulated. In Section 3 we elaborate on the numerical methods. Finally, Section 4 is devoted to an empirical study, in which we use a standard set of the Polish macroeconomics variables (including the unemployment, inflation and interest rates), and show that relaxing the usual assumption of conditional homoscedasticity by introducing the MSF or the MSF-SBEKK structures is empirically valid. The resulting VEC-MSF and VEC-MSF-SBEKK models are also used for the analysis of relationships and volatility of selected main official Polish exchange rates.

Bayesian VEC-SV models

Consider a linear n-variate and k-order vector autoregressive [VAR(k)] process with deterministic terms and a stochastic volatility (SV) structure, represented in the form of a vector error correction (VEC) process, i.e. Vector Error Correction with Stochastic Volatility (VEC-SV) process:

(1)Δxt=Π˜xt − 1+∑i = 1k − 1ΓiΔxt − i+ΦDt+εt, t=1, 2, …, T,
(2)εt|ψt−1, Dt, qt, θ∼N(0, Σt),

where x_t is an n×1 random vector, Π˜ and Γ_i are n×n matrices of real coefficients (i=1, … k–1), matrix D_t is comprised of deterministic variables such as the constant and seasonal centred dummies, Φ is a parameter matrix, q_t is a latent variable, θ is a vector of parameters, ψ_t−1 denotes the past of the process {x_t } up to time t−1, and Σ_t =Σ(q_t , ψ_t−1). Moreover, Π˜=αβ˜′, with α and β˜′ being some n×r matrices, where r<n is the number of cointegration relationships (if they exist). The initial conditions x_−k+1, x_−k+2, …, x₀ are assumed to be known.

MSF and MSF-SBEKK structures in the VEC model

In this paper we consider two alternative stochastic volatility structures for matrix Σ_t : multiplicative stochastic factor (MSF) and hybrid MSF-SBEKK (type I; see Osiewalski and Pajor 2009).

The multiplicative stochastic factor structure for matrix Σt is as follows:

(3)Σt=qtΣ,

with ln q_t =ϕ ln q_t−1+σ_qη_t , {η_t }~iiN(0, 1).

Henceforth, this specification will be referred to as the VEC-MSF (VEC with Multiplicative Stochastic Factor) process. Although the VEC-MSF process features non-zero time-varying conditional covariances, the conditional correlations remain constant over time. Such a result is attributable to the fact that the very same q_t factor drives the dynamics of each element of Σ_t . The idea of the MSF structure dates back to (Quintana and West 1987), who specified the scaling factors to be some known constants. Under a different name of the stochastic discount factor (SDF) process, the MSF specification was then revived (though only conceptually, without being used in any application) by (Jacquier, Polson, and Rossi 1995) in modelling financial time series volatility. Later, the process was employed, e.g. by (Osiewalski and Pajor 2009; Pajor 2011; Pajor and Osiewalski 2012; Osiewalski and Osiewalski 2013).

As an alternative MSV specification we employ the MSF-SBEKK structure proposed by (Osiewalski 2009). Following (Osiewalski and Pajor 2009), for matrix Σ_t we assume the so-called type I hybrid MSF-SBEKK process:

(4)Σt=qtΣ˜t,

(5)Σ˜t=(1−a−b)Σ+b(εt−1εt−1′)+aΣ˜t−1,

with ln q_t =ϕ ln q_t−1+σ_qη_t , {η_t }~iiN(0, 1), a,b∈R.

Matrix Σ˜t is square, of order n and follows the scalar BEKK(1,1) structure. The specification resulting from introducing MSF-SBEKK error term (ε_t ) into the VEC model is further referred to as VEC-MSF-SBEKK. The presence of the scalar BEKK(1,1) structure in the conditional covariance matrix allows us to model time-varying conditional correlations without introducing more latent processes. In the hybrid MSF-SBEKK model two simple basic structures are nested. In the limiting case of σ_g →0 and ϕ=0 we obtain the VEC-SBEKK process, while setting b=0 and a=0 leads to the VEC-MSF case. As regards the initial conditions for Σ˜t, we assume ε₀=0, Σ˜0=s0,ΣIn, where s_0,Σ>0 and I_n denotes the identity matrix of size n.

Equation (1) can be decomposed and written as:

(6)Δxt=α[β˜′, Φ1′][xt − 1Dt(1)]+∑i = 1k − 1ΓiΔxt − i+Φ2Dt(2)+εt=αβ′z1,t+Γ′z2,t+Γs′z3,t+εt,

where β′=[β˜′, Φ1′],z_1,t =[x_t−1′, D_t⁽¹⁾′]′, z_2,t =(Δx_t−1′, Δx_t−2′, …, Δx_t−k+1′)′, z_3,t =D_t⁽²⁾, Γ=[Γ₁, Γ₂, …, Γ_k−1]′, Γ_s =Φ₂′, and α Φ1′Dt(1)+Φ2Dt(2)=ΦDt.

In order to simplify the notation let us write the basic model (6) in a matrix form:

(7)Z0=Z1Π′+Z2Γ+Z3Γs+E,

where Π=αβ′, Z₀=[Δx₁, Δx₂ … Δx_T ]′=[z_0,1z_0,2 … z_0,T ]′, Z₁=[z_1,1z_1,2 … z_1,T ]′, Z₂=[z_2,1z_2,2 … z_2,T ]′, Z₃=[z_3,1z_3,2 … z_3,T ]′, E=[ε₁, ε₂ … ε_T ]′.

The conditional distribution of x_t (given the past of the process, ψ_t−1, the deterministic variables, D_t , the parameters and the latent variable q_t ) is n-variate Normal with the mean μt=xt−1+Π˜xt−1+∑i=1k−1ΓiΔxt−i+ΦDt and the covariance matrix Σ_t :

(8)p(xt|ψt−1, Dt, α, β, Γ, Γs, qt, Σ, θΣ, q0,Σ)=fN,n(xt|μt, Σt),

where θ_Σ and q_0,Σ are the vectors of the stochastic volatility parameters: in the VEC-MSF model θΣ=(ϕ, σq2)′, and q_0,Σ=ln q₀, whereas in the VEC-MSF-SBEKK model we have θΣ=(ϕ, σq2, a, b)′ and q_0,Σ=(ln q₀, s_0,Σ)′. The vector q_0,Σ is treated as an additional vector of parameters and is estimated jointly with other parameters. The density of the data (given the parameters) is the mixture (over q=(q₁, q₂, …, q_T )′) distribution:

(9)p(x|D, α, β, Γ, Γs, Σ, θΣ, q0,Σ)=∫p(x|D, α, β, Γ, Γs, Σ, θΣ, q0,Σ, q)p(q|θΣ, q0,Σ)dq,

where x=[x₁x₂ … x_T ]′ denotes the full data set and D=[D₁D₂ … D_T ]. The two densities on the right hand side of (9) are given as:

p(x|D, α, β, Γ1, Γs, Σ, θΣ, q0,Σ, q)=∏t=1TfN,n(xt|μt,Σt),

and

p(q|θΣ, q0,Σ)=∏t=1Tqt−1fN,1(lnqt|ϕlnqt−1, σq2).

The prior distribution and Bayesian VEC-MSF model

We are interested in making inference about both the parameter vector θ=(vecα′, vecβ′, vecΓ′, vecΓ_s ′, vechΣ′, θ_Σ′, q_0,Σ)′ and the latent variables vector q. Within the Bayesian approach, parameters and latent variables are treated as random variables. The joint density of the vector of observations, latent variables and parameters, which determines the Bayesian model, can be written as follows:

where p(θ) denotes the density of prior distribution. This density is also conditioned by certain initial observations, which are omitted from our notation. To complete the Bayesian models, we specify the prior distribution of the parameter vector θ:

(11)p(θ)=p(α|β)p(β)p(Γ)p(Γs)p(Σ)p(θΣ)p(q0,Σ)I[0,1](|λ|max),

where I_{[a, b]}(.) denotes the indicator function of the interval [a, b] and λ is the vector of the eigenvalues of the companion matrix, that is the matrix of the form:

[Π˜+In+Γ1Γ2−Γ1⋯Γk−1−Γk−2−Γk−1In0⋯000In00⋮⋱⋮⋮00⋯In0],

which makes it possible to write the analysed process in the VAR(1) form.

According to (11), we assume that certain blocks of the parameters are a priori independent, and that the stability condition is imposed on the parameters of the VEC process. Furthermore, for matrix Π the following parameterisations are used (as proposed by Koop, León-González, and Strachan 2010):

αβ′=(αMΠ)(αMΠ−1)′≡AB′,

where M_Π is an r×r symmetric positive-definite matrix, A and B are unrestricted matrices. Moreover, α=A(B′B)12, and β=B(B′B)− 12, so β has orthonormal columns, and it is an element of the Stiefel manifold V_r,m (represented by the matrix space of m×r matrices with orthonormal columns). The data inform only about the cointegration space, which is the element of the Grassmann manifold G_r,m−r , i.e. the space of r-dimensional hyperplanes in ℝ^m . In order to take into account the many-to-one relationship between the Stiefel and the Grassmann manifolds, we normalise the columns of β to have positive first elements (with the help of a diagonal matrix whose elements are equal to either 1 or −1).

Next, we state the prior distributions (see Koop, León-González, and Strachan 2010 for the discussion).

For matrix B the matrix normal distribution is used:
p(B|τ, r)=f_mN (B|0, I_r , P_τ ), which leads us to the matrix angular central Gaussian (MACG) distribution for β: p(β)=f_MACG (β|P_τ ) (see e.g. Chikuse 2002).
The prior information for the space spanned by β may be incorporated into the model via matrix P_τ , which is constructed as follows:
Pτ=HBHB′+τHB⊥HB⊥′,
where H_B is a matrix with orthonormal columns containing prior information about the cointegration space and HB⊥ represents its orthogonal complement. If we assume that parameter matrix P_τ in the MACG distribution is an identity matrix, we obtain the uniform distribution over the Stiefel manifold V_r,m and also the uniform distribution over the Grassmann manifold G_r,m−r . Additionally, it is worth recalling that unless the prior information about the cointegration space of the highest considered rank is available, the researcher willing to perform Bayesian model selection (including inference about the cointegration rank) should use the uniform prior over the Grassmann manifold.
On matrix A we also impose the matrix normal distribution:
p(A|μA, G, v, r)=fmN(A|μA, vIr, G).
Parameters ν and τ control the degrees of informativeness of the distributions stated above, and may be either set arbitrarily by the researcher or estimated. In the latter case, usually inverse gamma prior distributions are used: p(v/s_v , n_v )=f_IG(v|s_v , n_v ) and p(τ|s_τ , n_τ )=f_IG (τ|s_τ , n_τ ) (see e.g. Koop, León-González, and Strachan 2010). Such values of s_τ and n_τ should be set that almost all the resulting prior probability is allocated close to zero. For τ close to zero one imposes most of the prior probability to the spaces close to those spanned by H_B , whereas for τ equal 1 one gets noninformative priors for the estimated spaces (see Koop, León-González, and Strachan 2010 for the discussion).

The priors for the remaining parameters are as follows:

p(Γ|μ_Γ, H, h)=f_mN (Γ|μ_Γ, H, hI_l ), where l=n(k−1), H – an n×n positive-definite symmetric matrix,
p(Γs|μΓs, Hs, hs)=fmN(Γs|μΓs, Hs, hsIls), where H_s – an n×n positive-definite symmetric matrix, l_s – the number of deterministic terms in D_t ,
where p(h|n_h , s_h )=f_IG (h|n_h , s_h ) and p(h_s |n_hs , s_hs )=f_IG (h_s |n_hs , s_hs ).
p(ϕ)∝f_N,1(ϕ|μ_φ , Ω_φ ) I_(−1,1)(ϕ), – the normal density with mean μ_φ and variance Ω_φ , truncated by the restriction |ϕ|<1;
p(σq2)=fIG(σq2|aσ, bσ) – the density of the inverse gamma distribution with the mean b_σ /(a_σ −1) and variance equal to bσ2/[(aσ−1)(aσ−2)2];
p(Σ)=f_IW (Σ|Ω_Σ, μ_Σ, n) – the inverted Wishart distribution with mean Ω_Σ/(μ_Σ−n−1), μ_Σ>n+1;
p(a,b)∝I_(0,1)(a+b) – the uniform distribution over the unit simplex;
p(lnq₀)=f_N,1 (lnq₀|μ_q , Ω_q );
p(s_0,Σ)=f_Exp(s_0,Σ|μ_q,Σ) – the exponential distribution with mean 1/μ_q .
Finally, for θ_Σ we use the same priors as in (Osiewalski and Pajor 2009; Pajor and Osiewalski 2012).

MCMC methods for Bayesian estimation and model comparison

The joint posterior distribution, represented by p(q, θ|x), is too complicated to obtain a closed-form expression for any of its characteristics. To estimate these, we resort to MCMC methods, which allow one to generate a (pseudo-) random sample from the considered distribution.

Gibbs and Metropolis-Hastings algorithms

In order to draw from the joint posterior distribution of unknown parameters and latent variables we use a MCMC method, specifically, the Metropolis and Hastings algorithm within the Gibbs sampler. The procedure generates drawings from a joint distribution (as a stationary distribution) by sequentially sampling from the full conditional distributions (see, e.g. Gamerman 1997). Thus, we construct a Gibbs sampler with limiting distribution equal to the joint posterior distribution p(q, θ|x). The Gibbs sampler for the univariate stochastic volatility models, presented by (Jacquier, Polson, and Rossi 1994), can be easily generalised to the VEC-SV models.

In the VEC-MSF model one has to draw the [T+nr+mr+l_sn+n²(k−1)+n(n+1)/2]-dimensional vector (θ′, q′)′, which is split up into T+12 separate Gibbs steps, 11 of which are quite straightforward, while the remaining one poses more of a challenge.

Before presenting the conditional posterior distributions for the model parameters, let us introduce some additional notation:

Wi=[wi,1wi,2⋮wi,T] with wi,t=Σt−1/2z′i,t for i=0, 1, 2, 3,

where Σ_t^−1/2 denotes the matrix inverse of the Cholesky factor of Σ_t .

The equation for the observation t may be written as follows:

w0,t=w1,tvec(BA′)+w2,tvec(Γ)+w3,tvec(Γs)+ut, {ut}∼iiN(0, In).

We can now move on to presenting the conditional distributions of the VEC form parameters. They are either Normal (for the vectorisations of the matrix parameters) or inverse Gamma for the scalar ones:

(12)p(vec(A′)|⋅, x)=fN(vec(A′)|μ¯A′, Ω¯A′),
where
Ω¯A′=[(G−1⊗1νIr)+(In⊗B′)W1′W1(In⊗B)]−1,μ¯A′=Ω¯A′[(G−1⊗1νIr)vec(μ′A)+(In⊗B′)W1′[W0−W2vec(Γ)−W3vec(Γs)]],
(13)p(vec(B)|⋅, x)=fN(vec(B)|μ¯B, Ω¯B),
where
Ω¯B=[(mIr⊗P1/τ)+(A′⊗Im)W1′W1(A⊗Im)]−1,
μ¯B=Ω¯B(A′⊗Im)W′1 [W0−W2vec(Γ)−W3vec(Γs)],
(14)p(vec(Γ)|⋅, x)=fN(vec(Γ)|μ¯Γ, Ω¯Γ),
where Ω¯Γ=[(H−1⊗1hIl)+W2′W2]−1,
μ¯Γ=Ω¯Γ{(H−1⊗1hIl)vec(μΓ)+W′2 [W0−W1vec(BA′)−W3vec(Γs)]},
(15)p(vec(Γs)|⋅, x)=fN(vec(Γs)|μ¯Γs, Ω¯Γs),
where Ω¯Γs=[(Hs−1⊗1hsIls)+W3′W3]−1,
μ¯Γs=Ω¯Γs{(Hs−1⊗1hsIls)vec(μΓs)+W′3 [W0−W1vec(BA′)−W2vec(Γ)]},
(16)p(v|⋅, x)=fIG(v|nv+nr2, sv+12tr[(A−μA)′G−1(A−μA)]),
(17)p(h|⋅, x)=fIG(h|nh+nl2, sh+12tr[H−1(Γ−μΓ)′(Γ−μΓ)]),
(18)p(hs|⋅, x)=fIG(hs|nhs+nls2, shs+12tr[Hs−1(Γs−μΓs)′(Γs−μΓs)]),
additionally, in the case of estimated τ with an inverse gamma prior imposed [τ~IG(n_τ , s_τ )], the full conditional posterior distribution is of the form:
(19)p(τ|⋅, x)∝|Pτ|−r/2τ−nτ − 1exp{−1τ[sτ+12tr(mB′HB⊥HB⊥′B)]}.
Distribution (19) does not belong to any standard distribution families. To obtain a sample from it, a Metropolis-Hastings algorithm can be employed with the proposed values drawn from the inverse gamma distribution: IG(nτ+mr2, sτ+12tr(mB′HB⊥HB⊥′B)). For a discussion about the meaning of τ and further recommendations for its estimation see e.g. (Koop, León-González, and Strachan 2010).

The conditional posteriors of the MSF parameters are defined by the following inverse gamma, truncated normal, inverted Wishart, and normal distributions:

(20)p(σq2|⋅, x)=fIG(σq2|μσ+T/2, s∗),
with s∗=bσ+0.5∑t=1T(lnqt−ϕlnqt−1)2;
(21)p(ϕ|⋅, x)∝fN,1(ϕ|α∗, σq2A∗−1)I(−1, 1)(ϕ),
where
α∗=A∗−1(W′Q+σq2Ωϕμϕ), A∗ij=W′W+σq2Ωφ,
W = (lnq₀,....,lnq_T–1)^′, Q = (lnq₁,.....,lnq_t)^′;
(22)p(Σ|⋅, x)∝fIW(Σ|ΩΣ+TS0, μΣ+T),
where S0=T−1∑t = 1Tqt−1(xt−μt)(xt−μt)′;
(23)p(lnq0|⋅, x)=fN,1(lnq0|b2b1−1, b1−1),
where b1=ϕ2/σq2+1/Ωq, b2=ϕlnq1/σq2+μq/Ωq.
Sampling from all these conditional distributions is straightforward (also in the case of the truncated ones, for which rejection sampling is applied).
The conditional posterior distributions of the unobserved variables in the VEC-MSF model are the following:
(24)p(qt|⋅, x)∝qt−(n+2)/2exp[−(2qt)−1(xt−μt)′Σ−1(xt−μt)]××exp[−(2σ∗q2)−1(lnqt−s∗t)2],
where for 1≤t≤T−1: s∗t=[ϕ(lnqt−1+lnqt+1)](1+ϕ2)−1,σ∗q2=σq2(1+ϕ2)−1, and for t=T: s∗T=ϕlnqT−1,σ∗q2=σq2.

Density kernel (24) is not standard, so in order to sample from p(q_t |·, x) we use an acception-rejection Metropolis-Hastings algorithm with the candidate generating distribution obtained by approximating the log-normal kernel in (24) by the kernel of the inverse gamma distribution with the same mean and variance as this log-normal distribution (see Jacquier, Polson, and Rossi, 1994). This yields an inverse Gamma distribution from which direct draws are easily generated (the product of two inverse Gamma distributions is still an inverse Gamma distribution). The candidate generating density is:

(25)q(qt|⋅)∝qt−(φ+1)exp(−θtqqt−1),

where φ=(1−2eσ∗q2)(1−eσ∗q2)+n2,θtq=(φ−n2−1)exp(s∗t+12σ∗q2)+(xt−μt)′Σ−1(xt−μt)/2.

Let us now describe the Gibbs sampler used in the VEC-MSF-SBEKK model. The numerical procedure is much more computationally demanding and time-consuming. Because matrix Σ˜t in the SBEKK structure is a function of the past information set, the full conditional posterior distribution for the VEC structure is very complicated:

(26)p(vecA′, vecB′, vecΓ′, vecΓs′, v, h, hs, τ|., x)∝∝p(vecA′, vecB′, vecΓ′, vecΓs′, v, h, hs, τ)∏t=1TfN,n(xt|μt, qtΣ˜t).

Drawing from the above full conditional posterior is riddled with serious numerical obstacles, unless its dimension (n) is small. In order to sample from (26) we implement the random walk Metropolis-Hastings algorithm with normal and truncated normal distributions centred at the previous values of the chain. The covariance matrix of the proposal distribution is a form of approximation of a specific part of the posterior covariance matrix, and it is determined by initial draws. For data sets used in our empirical illustrations the acceptance rate amounts to about 27%.

The conditional posterior distributions of ϕ, σq2 and lnq₀ in the MSF-SBEKK structure are also truncated normal, inverse gamma and normal, respectively, as in (19), (21) and (23). On the other hand, the conditional posterior of the parameters featured by the SBEKK part of the model is very complicated:

(27)p(a, b, s0,Σ, Σ|⋅, x)∝p(a, b, s0,Σ, Σ)∏t=1TfN,n(xt|μt, Σt).
Parameters a, b, s_0,Σ, and elements of matrix Σ can be sampled using the Metropolis-Hastings steps within the Gibbs sampler. We implement the random walk Metropolis-Hastings algorithm with truncated Student’s t distribution (with 3 degrees of freedom) centred at the previous values of the chain (similarly to Osiewalski and Pajor 2009). The covariance matrix of the latter is determined by initial draws of the algorithm. For large n sampling from this full conditional posterior distribution may be numerically more demanding.

The conditional posterior distributions of the unobserved variables in the VEC-MSF-SBEKK model are of the following form:

(28)p(qt|⋅, x)∝qt−1fN,1(lnqt|ϕlnqt−1, σq2)fN,1(lnqt+1|ϕlnqt, σq2)fN,n(xt|μt, qtΣ˜t),
for 1≤t≤T−1,
(29)p(qT|⋅, x)∝qT−1fN,1(lnqT|ϕlnqT−1, σq2)fN,n(xT|μT, qTΣ˜T).

In order to sample from (28) and (29) we also use the Metropolis-Hastings algorithm. Following (Osiewalski and Pajor 2009), the candidate generating density is an inverse Gamma density:

(30)q(qt|⋅)∝qt−(φ+1)exp(−θtq,Bqt−1),

where φ=(1−2eσ∗q2)(1−eσ∗q2)+n2,θtq,B=(φ−n2−1)exp(s∗t+12σ∗q2)+12(xt−μt)′Σ˜t−1(xt−μt).

We stress that the Metropolis-Hastings steps are implemented within the Gibbs sampler. Thus we create Metropolis subchains within the Markov chain constructed by the Gibbs sampler to sample from the non-standard distributions.

To summarise, Bayesian VEC-MSF or VEC-MSF-SBEKK models are analysed, using Gibbs sampling as a tool for simulating samples from the posterior distributions. In the case of the VEC-MSF-SBEKK structure the numerics is far more demanding even for small n. On the other hand, the forms of the full conditional posterior distributions in the VEC-MSF model make steps of the Gibbs sampler easy even for large n. However, the MSF specification is very restrictive since it assumes the same dynamics for all entries of Σ_t . This assumption is a price to be paid for the ease of numerical calculations.

Harmonic mean estimator with Lenk’s correction

The marginal data densities are essential in the formal Bayesian model selection and model averaging. (Newton and Raftery 1994) proposed a simple (and hence popular) method of estimation of the marginal data density based on the harmonic mean. The harmonic mean estimator (HME) is given as:

(31)p^HME(x)=[1m∑p=1m1p(x|θ(p), q(p))]−1

where {θ(p), q(p)}p=1m are the draws generated from the posterior distribution of the parameters and latent variables, using the MCMC methods.

Even though the HME is consistent (see Newton and Raftery 1994), the computed HME suffers from some serious shortcomings, for example it overestimates the marginal data density. In the paper (Lenk 2009) the source of this “simulation pseudo-bias” of the HME is identified, and several methods of estimating the “bias” adjustment factor are proposed. The adjusted HME for the marginal data density, proposed by (Lenk 2009), is given by the formula:

(32)p^AHME(x)=P^(C)[1m∑p=1m1p(x|θ(p), q(p))]−1,

where P^(C) is an evaluation of the prior probability of subset C⊆Θ [i.e. P(C)], where Θ denotes the space of parameters θ and latent variables q. In Equation (32) {θ(p), q(p)}p=1m are drawn from the posterior distribution of the vector (θ′, q′)′ restricted to the subset C. Following (Lenk 2009) we assume that C={(θ′, q′)}:p(x|θ, q)≥L}, where L=min(θ′,q′)′∈{(θ′(p),q′(p))′}p=1mp(x|θ, q).P(C) is approximated using importance sampling, drawing from the truncated [to the support of p(θ)] multivariate normal distribution for all parameters and the prior distribution for the latent variables. Starting from the identity: P(C)=∫Θp(q|θ)p(θ)IC(θ, q) dθ dq, we obtain: P(C)=∫Θp(θ)s(θ)IC(θ, q)s(θ)p(q|θ)dθ dq, where p(θ) is the probability density function of the prior distribution of parameters, p(q|θ) is the density function of the prior distribution for the latent variables (given the parameters), s(θ) is the probability density function of the truncated multivariate normal distribution. The last identity yields the importance sampling estimator of P(C) with the importance function of the form: s(θ) p(q|θ). The parameters of the truncated normal distribution (the mean and covariances) are obtained basing on the MCMC draws from the posterior distribution.

The estimate of P(C) is:

(33)P^(C)=1J∑j=1Jp(θ(j)s)IC(θ(j)s, q(j)s)s(θ(j)s),

where {θ(j)s, q(j)s}j=1J are drawn from the importance sampling distribution defined by the probability density function p(q|θ) s(θ). The normalising constants of probability density functions s(θ) and p(θ) are not known, thus they are calculated, using the direct sampling method.

Empirical illustrations

Data used in the paper come from the Polish economy. In the first example we analyse financial data of daily quotations on three major exchange rates: PLN/EUR, PLN/USD and EUR/USD. The second one focuses on macroeconomic data of unemployment, inflation and interest rates.

Long-run relationships among exchange rates

We start by modelling the exchange rates (where, on the ground of lack of arbitrage opportunities, the cointegrating vector may be assumed to be known a priori) to show that our model and numerical methods are suitable for modelling a long-run relationship in financial data. Let us consider two average daily main Polish official exchange rates: the zloty (PLN) values of the US dollar, and the zloty (PLN) values of the euro, over the period from January 3, 2005 to September 30, 2011 (downloaded from the website of the National Bank of Poland). The dataset of the daily logarithmic growth rates (expressed in percentage terms) consists of 1662 observations (for each series). The first observation is sacrificed as an initial condition, thus T=1661. The Polish official exchange rates are linked to the exchange rates quoted on Forex, the international currencies market. Thus, while building time-series models for the two Polish exchange rates [EUR/PLN (x_2,t ) and USD/PLN (x_3,t )], we introduce some extra variable from the international market – the euro value of the US dollar, EUR/USD (x_1,t , downloaded from http://stooq.com.).

We use the same dataset as in (Pajor 2011), where exogeneity in models with latent variables was considered, and the cointegrating vector was assumed to be known. The relationship: (EUR/PLN)/(USD/PLN)≈EUR/USD was introduced by assuming that this relation (in logarithmic terms) is a cointegration equation in the sense of (Engle and Granger 1987). This yielded the cointegrating vector (1, −1, 1) and the long-run (equilibrium) relationship: ln x_2,t −ln x_3,t =ln x_1,t . The assumption that lnx_1,t , lnx_2,t and lnx_3,t are cointegrated has been checked informally using the Phillips and Perron test applied to the series ECM_t =100(ln x_1,t −ln x_2,t +ln x_3,t ). In this paper the cointegrating vector is formally estimated.

The Polish official exchange rates with the addition of certain variables related to the foreign exchange market were previously modelled by (Osiewalski and Pipień 2004) within the VEC-GARCH structure, and under assumption that the cointegrating vector is known.

The data are plotted in Figure 1, depicting the graphs of logarithms (multiplied by 100) of exchange rates in terms of their levels and of their increments. It can be seen from the graphs that the growth rates are centred around zero, and feature time-varying volatility and outliers. The series are more volatile in the period of the financial crisis 2008–2009. Moreover, the official exchange rates of the National Bank of Poland appear more volatile than the EUR/USD exchange rates.

Figure 1:

Exchange rates of USD/PLN, EUR/PLN and EUR/USD (January 3, 2005 – September 30, 2011).

A visual inspection of the data reveals that they do not satisfy the constant conditional variance assumption, and therefore the conditional covariance matrix of the respective VAR models need to be stochastic. Summary statistics for the analysed series are shown in Table 1. The arithmetic means of the growth rates of the official Polish exchange rates are positive, but the standard deviations are relatively high. A much higher than 3 value of the kurtosis pronounced leptokurtosis of the empirical distribution. As expected, the EUR/PLN and USD/PLN exchange rates are positively correlated.

Table 1:

Descriptive statistics of analyzed exchange rates (x_i,t ) and of daily growth rates (y_i,t ).

	y_i,t			xi,t
	USD/PLN	EUR/PLN	EUR/USD	USD/PLN	EUR/PLN	EUR/USD
Mean	0.0047	0.0050	−0.0003	2.9319	3.9343	1.3513
SD	0.0280	0.0182	0.0172	0.0081	0.0067	0.0025
Skewness	0.4802	0.2530	−0.0113	−0.6198	−0.0419	0.3538
Kurtosis	7.6101	8.8404	4.9594	3.3365	3.8930	2.4680
Minimum	−6.7485	−4.5895	−3.1163	2.0220	3.2026	1.1670
Maximum	6.2677	4.1467	3.4157	3.8978	4.8999	1.5995
Empirical correlations
USD/PLN	1	0.7502	−0.3082	1	0.8041	−0.8452
EUR/PLN	1	−0.1313	1	−0.3742
EUR/USD			1			1

Due to the typical properties of daily growth rates of exchange rates, we consider the VEC-MSF and the VEC-MSF-SBEKK models with one cointegrating vector (r=1), containing an unrestricted vector of constants, and the number of lags equal to 2 (k=2). ^[1] We set the following values of hyperparameters:

for matrix B: P_τ =I₃ (the uniform distribution over the Stiefel manifold), τ=1;
for matrix A: G=I₃, μ_A =0_3×1, s_v =2, n_v =3;
for matrix Γ: μ_Γ=0_3×3, H=I₃, s_h =2, n_h =3;
for matrix Γ_s : μ_Γs =0_3×1, H_s =I₃, s_hs =2, n_hs =3;
for MSF parameters: μ_φ =0, Ω_φ =100, a_σ =1, b_σ =0.005, μ_q =0, Ω_q =100, μ_Σ=5, Ω_Σ=I₃,
for SBEKK parameters: μ_q,Σ=1.

As mentioned above, the relationship: (EUR/PLN)/(USD/PLN)≈EUR/USD indicates that the posterior cointegrating space can be spanned by the vector (1, −1, 1). One of the objectives of the study is to check whether formal Bayesian estimation of the cointegrating space leads to the same cointegrating vector, which was a priori assumed in our previous research.

The posterior means and standard deviations (in parentheses) of the cointegrating vector as well as the adjustment coefficients are presented bellow. As suggested by (Villani 2006), the point estimates of the cointegrating vectors are obtained with the use of a loss function based on the projective Frobenius distance β^=argminβ˜ ∈ Gr,m − rE[l(β, β˜)], where l(β, β˜)=2(r−tr(ββ′β˜β′˜)). This function is minimised at β^=(v1, v2, …, vr), where v_i (i=1, 2, …, r) are the eigenvectors corresponding to the r largest eigenvalues of the matrix E(ββ′). The measures of cointegration space variation are calculated as τsp(β)=r−∑i = 1rλir(m−r)/m, where λ_i is the i-th largest eigenvalue of E(ββ′). This measure takes values from 0 to 1. The lower is the value, the tighter is the posterior distribution (see Villani 2006). To make comparison with the theoretical cointegrating vector more readable, the point estimates of the cointegrating and the adjustment coefficient vectors are normalised in such a manner that the first element of β equals 1. Note that in the case of one cointegrating vector such a procedure does not alter the cointegration space. We obtained the following posterior means (and standard deviations) of β and α:

E(β|x, MSF)=[1.000−0.9970.999], τsp(β)=0.0007,

E(α|x, MSF)(D(α|x,MSF))=[0.0800.017−0.961(0.097)(0.094)(0.116)],

E(β|x, MSF−SBEKK)=[1.000−0.9981.000], τsp(β)=0.0006,

E(α|x, MSF−SBEKK)(D(α|x,MSF−SBEKK))=[0.0540.314−0.624(0.081)(0.070)(0.090)].

Overall, the results confirm that exchange rates of USD/PLN, EUR/PLN and EUR/USD are cointegrated with coefficients (1, −1, 1).

In the VEC-MSF model the posterior means and standard deviations (in parentheses) of the VAR parameters are:

[Δxt,1Δxt,2Δxt,3]=[0.0880.020−1.602(0.119)(0.107)(0.624)][1.000−0.997 (0.0007)0.999][xt − 1,1xt − 1,2xt − 1,3]++[−0.076−0.040−0.013(0.050)(0.041)(0.030)−0.115−0.0720.020(0.047)(0.039)(0.028)−0.135−0.0420.006(0.060)(0.048)(0.035)][Δxt − 1,1Δxt − 1,2Δxt − 1,3]+[0.003(0.028)−0.032(0.022)0.206(0.201)]+ε^t,

and

lnqt=0.987lnqt − 1+0.0207ηt+η^t,(0.0063)(0.0063)

E(lnq0|x, MSF)=0.055, D(lnq0|x, MSF)=0.488,

E(Σ|x, MSF)(D(Σ|x,MSF))=[0.552−0.041−0.230(0.223)(0.022)(0.095)−0.0410.5030.534(0.022)(0.204)(0.217)−0.2300.5340.780(0.095)(0.217)(0.315)],

where (ε^t′, η^t)′ is the vector of relevant residuals.

In turn, in the VEC-MSF-SBEKK model the posterior means and standard deviations of the VAR parameters are:

[Δxt,1Δxt,2Δxt,3]=[0.0540.314−0.624(0.081)(0.070)(0.090)][1.000−0.998 (0.0006)1.000][xt − 1,1xt − 1,2xt − 1,3]+[−0.080−0.022−0.027(0.042)(0.040)(0.030)−0.295−0.0490.014(0.033)(0.028)(0.017)−0.312−0.0210.006(0.044)(0.043)(0.030)][Δxt − 1,1Δxt − 1,2Δxt − 1,3]+[0.014(0.022)−0.100(0.064)0.109(0.121)]+ε^t,

and

lnqt=0.664lnqt − 1+0.162 ηt+η^t,(0.086)(0.043)

E(lnq0|x, MSF−SBEKK)=5.760, D(lnq0|MSF−SBEKK)=1.753,

E(a|x, MSF−SBEKK)=0.029, D(a|MSF−SBEKK)=0.003,

E(b|x, MSF−SBEKK)=0.969, D(b|MSF−SBEKK)=0.004,

E(q0,Σ|x, MSF−SBEKK)=0.044, D(q0,Σ|x, MSF−SBEKK)=0.026,

E(Σ|x, MSF−SBEKK)(D(Σ|x,MSF−SBEKK))=[0.622−0.018−0.186(0.460)(0.134)(0.271)−0.0180.2190.138(0.134)(0.099)(0.124)−0.1860.1380.414(0.271)(0.124)(0.225)].

Similar characteristics of posterior distributions were obtained in the VEC-SV models with the known cointegration relation (therefore they are not presented here), with the marginal posterior distributions being only less dispersed. The inference about the individual volatility of each time series (measured by the conditional standard deviations) is also very similar in both considered VEC-SV models: with either the known or estimated cointegrating vector. The time plots of conditional standard deviations (for each t=1, 2, …, T; T=1661) are presented in Figure 2, where the upper line represents the posterior mean plus two standard deviations, and the lower one – the posterior mean minus two standard deviations. The models produce volatility peaks at the same time moments. Note that the dynamic pattern of the volatility for all returns is the same in the VEC-MSF model, which is due to the fact that the dynamics of volatilities is governed by only one latent process. However, in spite of the simplicity of the structure of the conditional covariance matrix, we allow the variances of multivariate returns to vary over time. The plots of the main posterior characteristics of conditional correlations are presented in Figure 3. One can easily notice that the constant conditional correlation hypothesis (represented by the MSF specification) is not supported be the data, which is consistent with the findings presented by (Osiewalski and Osiewalski 2013). Due to numerical problems in calculating the marginal likelihoods, we do not present formal Bayesian model comparison of the MSF and MSF-SBEKK specifications. The use of the Lindley-type test (based on the highest posterior density region, see, e.g. Box and Tiao 1973; Bauwens, Lubrano, and Richard 1999) would lead to rejection of the hypothesis. However, the assumption of constant conditional correlations does not influence the inference about the cointegration relationship. The distance measures between the estimated cointegrated spaces and the theoretical one [i.e. spanned by the vector (1, −1, 1)]^[2] obtained in the VEC-MSF and the VEC-MSF-SBEKK models are very close to zero and almost the same (although they are slightly lower in the VEC-MSF models). In the set of the VEC-MSF-SBEKK models the distance measure takes the highest value in the model with five lags (5.5*10⁻⁶) and they fall to 3.3*10⁻⁶ in the model with two lags.

Figure 2:

Conditional standard deviations (posterior mean±2 standard deviations). The results are obtained in the VEC-MSF and VEC-MSF-SBEKK models with r=1.

Figure 3:

Conditional correlations (posterior mean±2 standard deviations). The results are obtained in the VEC-MSF and VEC-MSF-SBEKK models with r=1 (cointegrating vector is estimated).

Our application of the VEC-SV models to exchange rates yields results, which are consistent with the theory. The cointegrating space appears indeed to be spanned by the vector (1, −1, 1). This illustration shows that our methods can accurately estimate the long-run relationship in the VEC model with a time-variable conditional covariance matrix.

A small model for monetary policy

In the second example the methods developed in the paper are applied to a system with three Polish macroeconomic variables. Before presenting the results, it is worth emphasizing that the papers cited in the introduction pertain predominantly to the US economy. To the best of our knowledge, similar analyses for the Polish economy, with the use of the (Bayesian) VEC-SV models, have not been carried out as yet. Therefore, the paper aims to fill this gap.

In this study a so called small model of monetary policy is considered. The model is based on three major macroeconomic variables: inflation of consumer prices, Δp_t , unemployment rate, U_t , and short-term interest rates, r_t . The latter can be regarded as a proxy for the monetary policy.

The seasonally unadjusted data cover the period from 1995Q1 to 2012Q4. The seasonality of the analysed series is modelled in a deterministic manner, via zero-mean seasonal dummies. The first 3 years are used as a training sample to determine the hyperparameters of the prior distribution for the cointegration space (the Johansen procedure in the model with two lags and r cointegrating vectors β has been used, which further have been utilized to calculate Pτ=ββ′+0.5β⊥β′⊥). The next five quarters are sacrificed as initial conditions. Eventually, there are 55 modelled observations (see Figure 4).

Figure 4:

The analysed macro-data series (1995Q1 – 2012Q4).

As the analysed time series are rather short, it is very important to test whether the variances and covariances are constant or not. A visual inspection of the time paths of the series suggests that one could expect a time-varying covariance structure. The empirical validity of this presumption is to be formally tested via Bayesian model comparison. The set of competing models include 72 non-nested specifications. Along with the number of latent processes driving covariances (l=0 in the a VEC model, l=1 in the VEC-MSF and VEC-MSF-SBEKK forms), the models can differ in the VAR order (k ϵ {2, 3, 4, 5}), the type of incorporated deterministic term (d=4 denotes a restricted constant, d=3 – an unrestricted one) and the cointegration rank (r can be equal to 1 or 2). Additionally, we consider VAR models for the first differences of the analysed processes (r=0) and VAR models for the levels of these processes (r=3). We assume that the prior probability of each of the specifications compared is equal to 0.0139, which is the reciprocal of the total number of models considered.

Table 2 summarizes the results of the model comparison. In all three considered cases of the covariance structure (homoscedastic, MSF and MSF-SBEKK), the model that ranks first is the one that features two lags, one cointegrating relation and a constant restricted to the cointegration space. Generally, among the models with the highest posterior probability are almost exclusively those with either no or only one cointegrating relation, a time-varying covariance matrix and a short lag structure. Note that the number of lags seems to be less important than the cointegration rank. Due to high values of numerical standard errors of HME the ranking positions of some models cannot be clearly identified, but even if we take into account the error for the VEC-MSF-SBEKK model with k=2, d=4 and r=1, there is no doubt that the marginal data density for this specification is the highest. We can also observe high positive correlations between ordering of the models in all considered cases of the covariance structure. Table 3 presents models with the highest posterior probability. Among them there are only the VEC-MSF-SBEKK and VEC-MSF specifications, so there is a strong evidence that the covariance matrix is time-varying. Only the first three models obtained higher posterior probability than the assumed prior probability, and almost all posterior probability is gathered by the VEC-MSF-SBEKK model with two lags in the VAR form, one cointegration relation and a constant restricted to the cointegration space. The second one is the model with the same VEC structure but with the MSF covariance structure, and the third specification is the VAR-MSF-SBEKK model for the first differences.

Table 2:

Natural logarithm of estimates of marginal likelihoods, numerical standard errors and ranks.

k, d, r	VEC			VEC-MSF			VEC-MSF-SBEKK
k, d, r	ln p(x)	NSE	Rank (under all models)	ln p(x)	NSE	Rank (under all models)	ln p(x)	NSE	Rank (under all models)
2,3,0	−196.94	0.46	24 (23)	−182.06	0.56	24 (5)	−177.58	0.32	24 (3)
2,3,1	−198.69	0.50	4 (26)	−185.61	0.50	3 (6)	−181.39	0.50	3 (4)
2,3,2	−203.16	0.98	7 (37)	−187.89	0.75	5 (8)	−202.69	1.17	12 (36)
2,3,3	−214.66	1.03	19 (54)	−204.34	0.89	20 (40)	−225.80	1.30	17 (63)
2,4,1	−191.89	0.49	1 (15)	−177.38	0.42	1 (2)	−174.29	0.87	1 (1)
2,4,2	−196.88	0.60	2 (22)	−190.41	1.04	7 (13)	−202.45	1.03	11 (34)
3,3,0	−204.55	1.03	3 (41)	−190.55	0.87	2 (14)	−187.38	0.60	2 (7)
3,3,1	−206.53	0.36	12 (45)	−196.56	0.85	12 (21)	−201.64	1.02	10 (32)
3,3,2	−211.16	0.84	16 (49)	−200.30	1.06	15 (30)	−223.44	1.10	15 (61)
3,3,3	−220.83	1.09	23 (60)	−212.93	0.88	22 (50)	−238.38	1.16	21 (67)
3,4,1	−199.04	0.52	5 (27)	−187.59	0.51	4 (8)	−189.79	0.64	6 (12)
3,4,2	−206.57	0.74	13 (46)	−193.21	0.70	10 (18)	−231.97	0.68	19 (65)
4,3,0	−210.72	0.92	9 (48)	−197.66	1.03	8 (24)	−192.89	0.64	4 (17)
4,3,1	−213.70	0.67	18 (52)	−201.14	1.05	16 (31)	−216.74	0.79	14 (57)
4,3,2	−216.67	0.66	21 (56)	−206.40	1.23	21 (44)	−247.65	1.37	22 (68)
4,3,3	−219.61	1.40	22 (59)	−219.12	0.96	23 (58)	−237.05	1.00	20 (66)
4,4,1	−203.21	0.81	8 (38)	−192.00	0.80	9 (16)	−193.77	0.55	8 (20)
4,4,2	−215.67	0.89	20 (55)	−199.62	0.73	14 (28)	−225.09	1.05	16 (62)
5,3,0	−205.24	1.15	15 (42)	−193.25	1.04	13 (19)	−188.09	1.05	7 (9)
5,3,1	−209.38	0.81	14 (47)	−202.39	1.03	17 (33)	−213.94	0.88	13 (53)
5,3,2	−212.96	0.75	17 (51)	−204.08	0.75	19 (39)	−253.58	1.18	23 (69)
5,4,1	−199.91	1.03	6 (29)	−188.61	0.60	6 (11)	−197.88	0.79	9 (25)
5,4,2	−205.82	1.25	11 (43)	−202.46	1.11	18 (35)	−231.00	1.11	18 (64)

k, VAR order; d, deterministic term; r, cointegration rank; r=3 – VAR for levels, r=0 – VAR for first differences. The P_τ matrix is determined with the help of the Johansen procedure employed in the model with r cointegrating relations.

Table 3:

The most probable models.

VAR order (k)	Deterministic term (d)	Cointegration rank (r)	Model (l)	P(M_{(_k,_d,_r,_l)}\|x)
2	4	1	VEC-MSF-SBEKK	0.92261
2	4	1	VEC-MSF	0.04172
2	3	0	VEC-MSF-SBEKK	0.03435
2	3	1	VEC-MSF-SBEKK	0.00076
2	3	0	VEC-MSF	0.00039
2	3	1	VEC-MSF	0.00001
3	3	0	VEC-MSF-SBEKK	0.000002
3	4	1	VEC-MSF	0.000002
2	3	2	VEC-MSF	0.000001

The P_τ matrix is determined with the help of the Johansen procedure employed in the model with r cointegrating relations.

Figure 5 displays the time plots of conditional standard deviation and the cross correlations of the innovation processes in each of the three equations of the model with the highest posterior probability. Black lines represent posterior means, and the grey ones – the means plus/minus two posterior standard deviations. The posterior means of the conditional standard deviations lead us to the conclusion that all the error terms are heteroscedastic, but the areas between grey lines representing the uncertainty suggest that the Lindley-type test would probably indicate that the error term in the inflation equation could be homoscedastic. In the unemployment equation we can observe a high pick of volatility in year 2003, when the unemployment rate was the highest. The volatility of the interest rate shocks was clearly higher at the beginning of the analysed sample (in years 1999–2002), when the interest rates were high.

Figure 5:

Conditional standard deviations and correlations (posterior mean±2 standard deviations). The results are obtained in the VEC-MSF-SBEKK model with r=2, k=1, d=4.

The posteriors of conditional correlations are quite tight, and the presented figures clearly indicate that these correlations keep changing during the analysed period. It can be seen that all the conditional correlations of error terms for interest rate and for inflation are positive and relatively high (Figure 5, graph 5). The conditional correlations between the error terms for the inflation and unemployment as well as for the unemployment and interest rate change their signs (Figure 5, graphs 4 and 6). Also, they share similar patterns, which probably results from positive correlations of the error terms in the equations of the interest rate and inflation.

As has been mentioned before, the analysed series is rather short, therefore we have decided to perform a sensitivity analysis of the Bayesian model comparison for two different prior specifications for the cointegration space. Contrary to the previous approach of setting the P_τ matrix, now it is determined with the help of the Johansen procedure employed in the model with n−1 (rather than r) cointegrating relations. Tables 4 and 5 summarize the obtained results. The ranking of the most probable models with the homoscedastic covariance structure remains almost unchanged, whereas in the two other cases under study (i.e. VEC-MSF and VEC-MSF-SBEKK) we can observe some major differences in their performance. It emerges now that the models with two lags and no cointegrating relations are superior. As for the covariance structure we still have strong evidence for its time-variability, and almost all posterior model probability is gained by the MSF-SBEKK specification. Summing up, our sensitivity analysis shows that, as it might have been expected, in such richly parameterised models and with samples of so small sizes the prior assumptions affect the posterior results quite considerably.

Table 4:

Natural logarithm of estimates of marginal likelihoods, numerical standard errors and ranks.

k, d, r	VEC			VEC-MSF			VEC-MSF-SBEKK
k, d, r	ln p(x)	NSE	Rank (under all models)	ln p(x)	NSE	Rank (under all models)	ln p(x)	NSE	Rank (under all models)
2,3,0	−196.94	0.46	3 (20)	−182.06	0.56	1 (2)	−177.58	0.32	1 (1)
2,3,1	−199.18	0.81	4 (23)	−188.74	0.56	4 (8)	−196.39	0.35	7 (17)
2,3,2	−203.16	0.98	5 (30)	−187.89	0.75	3 (6)	−202.69	1.17	10 (29)
2,3,3	−214.66	1.03	17 (50)	−204.34	0.89	18 (33)	−225.80	1.30	17 (63)
2,4,1	−195.25	1.09	1 (15)	−184.21	0.79	2 (3)	−187.70	0.88	3 (5)
2,4,2	−196.88	0.60	2 (19)	−190.41	1.04	5 (9)	−202.45	1.03	9 (27)
3,3,0	−204.55	1.03	6 (34)	−190.55	0.87	6 (10)	−187.38	0.60	2 (4)
3,3,1	−209.50	0.83	13 (45)	−198.11	0.74	12 (22)	−206.59	0.68	11 (40)
3,3,2	−211.16	0.84	15 (47)	−200.30	1.06	14 (25)	−223.44	1.10	14 (60)
3,3,3	−220.83	1.09	23 (58)	−212.93	0.88	22 (48)	−238.38	1.16	21 (67)
3,4,1	−208.46	0.72	11 (43)	−195.70	0.49	9 (16)	−193.88	0.80	6 (14)
3,4,2	−206.57	0.74	9 (39)	−193.21	0.70	7 (12)	−231.97	0.68	19 (65)
4,3,0	−210.72	0.92	14 (46)	−197.66	1.03	11 (21)	−192.89	0.64	5 (11)
4,3,1	−215.76	0.82	19 (52)	−207.44	1.06	21 (41)	−222.85	0.88	13 (59)
4,3,2	−216.67	0.66	21 (54)	−206.40	1.23	20 (38)	−247.65	1.37	22 (68)
4,3,3	−219.61	1.40	22 (57)	−219.12	0.96	23 (56)	−237.05	1.00	20 (66)
4,4,1	−216.63	1.11	20 (53)	−203.45	1.14	16 (31)	−217.02	1.04	12 (55)
4,4,2	−215.67	0.89	18 (51)	−199.62	0.73	13 (24)	−225.09	1.05	16 (62)
5,3,0	−205.24	1.15	7 (35)	−193.25	1.04	8 (13)	−188.09	1.05	4 (7)
5,3,1	−208.91	0.83	12 (44)	−205.88	1.16	19 (37)	−224.54	0.79	15 (61)
5,3,2	−212.96	0.75	16 (49)	−204.08	0.75	17 (32)	−253.58	1.18	23 (69)
5,4,1	−207.99	0.77	10 (42)	−196.73	1.03	10 (18)	−202.04	0.74	8 (26)
5,4,2	−205.82	1.25	8 (36)	−202.46	1.11	15 (28)	−231.00	1.11	18 (64)

k, VAR order; d, deterministic term; r, cointegration rank; r=3 – VAR for levels, r=0−VAR for first differences. The P_τ matrix is determined with the help of the Johansen procedure employed in the model with n−1 cointegrating relations.

Table 5:

The most probable models.

VAR order (k)	Deterministic term (d)	Cointegration rank (r)	Model (l)	P(M_{(_k,_d,_r,_l)}\|x)
2	3	0	VEC-MSF-SBEKK	0.98737
2	3	0	VEC-MSF	0.01116
2	4	1	VEC-MSF	0.00130
3	3	0	VEC-MSF-SBEKK	0.00005
2	4	1	VEC-MSF-SBEKK	0.00004
2	3	2	VEC-MSF	0.00003
5	3	0	VEC-MSF-SBEKK	0.00003
2	3	1	VEC-MSF	0.000014
2	4	2	VEC-MSF	0.000003
3	3	0	VEC-MSF	0.000002

The P_τ matrix is determined with the help of the Johansen procedure employed in the model with n−1 cointegrating relations.

Concluding remarks

The paper is focused on the construction and estimation of Bayesian Vector Error Correction -Stochastic Volatility (VEC-SV) models. As regards the stochastic volatility part of the specification we start with the MSF structure, which is the simplest among the MSV structures, thereby allowing a parsimonious way of modelling time-variability of volatility. Such a framework enables us to capture the long-run relationships among processes, and also to formally examine the presence of time-variation in the conditional covariance matrix.

The main conclusion of this paper is that both the heteroskedasticity and the long-run relationships have to be taken into account simultaneously in macroeconomic models as well as in financial analyses. Our application of the VEC-MSF and VEC-MSF-SBEKK models to exchange rates demonstrates that our methods can accurately estimate the long-run relationships in the VEC model for daily market prices. The results of Bayesian model comparison performed in the macroeconomic model example indicate that the ranking of different VEC structures (covering various numbers of lags and cointegrating relations, and two types of incorporated deterministic terms) is rather stable across different covariance matrix specifications.

The MSF structure allows conditional variances of the joint distribution of variables to change over time, but conditional correlations remain constant. In the MSF-SBEKK structure the conditional covariance matrix depends on the past of the process, as in the MGARCH models, which can lead to serious numerical obstacles. Thus, we deem that further research should rather include an MSV specification with stochastic conditional correlations based on more than one latent process.

On the other hand, time-variability of shocks has a potential effect on the propagation mechanism of the innovations (see, e.g. Koop, León-González, and Strachan 2009), so allowing parameters in the conditional mean of the process to be time-varying appear one of the most wanted generalisations.

Acknowledgments

We would like to thank the anonymous referees for very useful comments and suggestions that helped us to prepare the final version of our paper, and Łukasz Kwiatkowski for language verification of the manuscript. Research was partly financed from the funds granted to the Faculty of Management at Cracow University of Economics, within the framework of the subsidy for the maintenance of research potential. Research was partly financed from the funds granted to the Faculty of Finance and Law at Cracow University of Economics, within the framework of the subsidy for the maintenance of research potential.

References

Bauwens, Luc, Michel Lubrano, and Jean-François Richard. 1999. Bayesian Inference in Dynamic Econometric Models. Oxford University Press, New York.10.1093/acprof:oso/9780198773122.001.0001Suche in Google Scholar

Belmonte, Miguel, and Gary Koop. 2014. “Model Switching and Model Averaging in Time-Varying Parameter Regression Models.” In Bayesian Model Comparison, edited by Ivan Jeliazkov and Dale J. Poirier, Advances in Econometrics, Vol. 34, 45–69. Bingley: Emerald Group Publishing Limited. Accessed June 12, 2015. doi: 10.1108/S0731-905320140000034004.10.1108/S0731-905320140000034004Suche in Google Scholar

Box, George E.P., and George C. Tiao. 1973. Bayesian Inference in Statistical Analysis. Massachusetts: Addison-Wesley Publishing Company.Suche in Google Scholar

Cavaliere, Giuseppe, Luca De Angelis, Anders Rahbek, and A.M. Robert Taylor. 2015. “A Comparison of Sequential and Information-based Methods for Determining the Co-integration Rank in Heteroskedastic VAR Models.” Oxford Bulletin of Economics and Statistics 77: 106–128.10.1111/obes.12051Suche in Google Scholar

Chikuse, Yasuko. 2002. “Statistics on Special Manifolds.” Lecture Notes in Statistics 174. Springer-Verlag, New York.10.1007/978-0-387-21540-2Suche in Google Scholar

Cogley, Timothy, and Thomas J. Sargent. 2005. “Drifts and Volatilities: Monetary Policies and Outcomes in the Post WWII U.S.” Review of Economic Dynamics 8: 262–302.10.2139/ssrn.2482531Suche in Google Scholar

Engle, Robert F., and C.W.J. Granger. 1987. “Co-integration and Error Correction: Representation, Estimation and Testing.” Econometrica 55: 251–276.10.2307/1913236Suche in Google Scholar

Gamerman, Dani. 1997. Markov Chain Monte Carlo. Statistic Simulation for Bayesian Inference. London: Chapman and Hall.Suche in Google Scholar

Herwartz, Helmut, and Helmut Lütkepohl. 2011. “Generalized Least Squares Estimation for Cointegration Parameters Under Conditional Heteroskedasticity.” Journal of Time Series Analysis 32 (3): 281–291.10.1111/j.1467-9892.2010.00698.xSuche in Google Scholar

Jacquier, Eric, Nicholas G. Polson, and Peter E. Rossi. 1994. “Bayesian Analysis of Stochastic Volatility Models.” Journal of Business and Economic Statistics 12 (4): 371–389.10.1080/07350015.1994.10524553Suche in Google Scholar

Jacquier, Eric, Nicholas G. Polson, and Peter E. Rossi. 1995. “Priors and Models for Multivariate Stochastic Volatility Models.” CIRANO Working Paper No. 95s-18, Montréal. Accessed September 19, 2016. http://www.cirano.qc.ca/files/publications/95s-18.pdf.Suche in Google Scholar

Koop, Gary. 2012. “Using VARs and TVP-VARs with Many Macroeconomic Variables.” Central European Journal of Economic Modelling and Econometric 4 (3): 43–167.Suche in Google Scholar

Koop, Gary, Roberto León-González, and Rodney W. Strachan. 2009. “On the Evolution of Monetary Policy.” Journal of Economic Dynamics and Control 33: 997–1017.10.1016/j.jedc.2008.11.003Suche in Google Scholar

Koop, Gary, Roberto León-González, and Rodney W. Strachan. 2010. “Efficient Posterior Simulation for Cointegrated Models with Priors on the Cointegration Space.” Econometric Reviews 29: 224–242.10.1080/07474930903382208Suche in Google Scholar

Koop, Gary, Roberto León-González, and Rodney W. Strachan. 2011. “Bayesian Inference in the Time Varying Cointegration Model.” Journal of Econometrics 165: 210–220.10.1016/j.jeconom.2011.07.007Suche in Google Scholar

Koop, Gary, and Simon M. Potter. 2011. “Time Varying VARs with Inequality Restrictions.” Journal of Economic Dynamics and Control 35: 1126–1138.10.1016/j.jedc.2011.02.001Suche in Google Scholar

Larsson, Rolf, and Mattias Villani. 2001. “A Distance Measure between Cointegration Spaces.” Economics Letters 70: 21–27.10.1016/S0165-1765(00)00349-9Suche in Google Scholar

Lenk, Peter. 2009. “Simulation Pseudo-bias Correction to the Harmonic Mean Estimator of Integrated Likelihoods.” Journal of Computational and Graphical Statistics 18: 941–960.10.1198/jcgs.2009.08022Suche in Google Scholar

Newton, Michael A., and Adrian E. Raftery. 1994. “Approximate Bayesian Inference by the Weighted Likelihood Bootstrap.” Journal of the Royal Statistical Society B 56 (1): 3–48.10.1111/j.2517-6161.1994.tb01956.xSuche in Google Scholar

Osiewalski, Jacek. 2009. “New Hybrid Models of Multivariate Volatility (a Bayesian Perspective).” Przegląd Statystyczny (Statistical Review) 56 (1): 15–22.Suche in Google Scholar

Osiewalski, Jacek, and Anna Pajor. 2009. “Bayesian Analysis for Hybrid MSF–SBEKK Models of Multivariate Volatility.” Central European Journal of Economic Modelling and Econometrics 1 (2): 179–202.Suche in Google Scholar

Osiewalski, Jacek, and Anna Pajor. 2010. “Bayesian Value-at-Risk for a Portfolio: Multi-and Univariate Approaches using MSF–SBEKK Models.” Central European Journalof Economic Modelling and Econometrics 2 (4): 253–277.Suche in Google Scholar

Osiewalski, Krzysztof, and Jacek Osiewalski. 2012. “Missing Observations in Daily Returns –Bayesian Inference within the MSF-SBEKK Model.” Central European Journal of Economic Modelling and Econometrics 4 (3): 169–197.Suche in Google Scholar

Osiewalski, Krzysztof, and Jacek Osiewalski. 2013. “A Long-Run Relationship between Daily Prices on Two Markets: The Bayesian VAR(2)–MSF-SBEKK Model.” Central European Journal of Economic Modelling and Econometrics 5 (1): 65–83.Suche in Google Scholar

Osiewalski, Jacek, and Mateusz Pipień. 2004. “Bayesian comparison of bivariate GARCH processes. The Role of the Conditional Mean Specification”. In New Directions in Macromodelling, edited by Aleksander Welfe, 173–196, Amsterdam: Elsevier.10.1016/S0573-8555(04)69007-9Suche in Google Scholar

Pajor, Anna. 2011. “A Bayesian Analysis of Exogeneity in Models with Latent Variables.” Central European Journal of Economic Modelling and Econometrics 3 (2): 49–73.Suche in Google Scholar

Pajor, Anna, and Jacek Osiewalski. 2012. “Bayesian Value-at-Risk and Expected Shortfall for a Large Portfolio (Multi- and Univariate Approaches).” Acta Physica Polonica A 121, (2-B): B-101-B-109.10.12693/APhysPolA.121.B-101Suche in Google Scholar

Primiceri, Giorgio E. 2005. “Time Varying Structural Vector Autoregressions and Monetary Policy.” Review of Economic Studies 72: 821–852.10.1111/j.1467-937X.2005.00353.xSuche in Google Scholar

Quintana, Jose M., and Michael West. 1987. “An analysis of international exchange rates using multivariate DLMs.” The Statistician 36: 275–281.10.2307/2348524Suche in Google Scholar

Seo, Byeongseon. 2007. “Asymptotic Distribution of the Cointegrating Vector Estimator in Error Correction Models with Conditional Heteroskedasticity.” Journal of Econometrics 137: 68–111.10.1016/j.jeconom.2006.03.008Suche in Google Scholar

Sims, Christopher A., and Tao Zha. 2006. “Were there Regime Switches in Macroeconomic Policy?” American Economic Review 96: 54–81.10.1257/000282806776157678Suche in Google Scholar

Villani, Mattias. 2006. “Bayesian Point Estimation of the Cointegration Space” Journal of Econometrics 134: 645–664.10.1016/j.jeconom.2005.07.008Suche in Google Scholar

Supplemental Material:

The online version of this article (DOI: https://doi.org/10.1515/snde-2016-0004) offers supplementary material, available to authorized users.

Published Online: 2017-4-6

Supplementary material

Artikel in diesem Heft

https://doi.org/10.1515/snde-2016-0004

Schlagwörter für diesen Artikel

Bayesian analysis; cointegration; Markov Chain Monte Carlo methods; stochastic volatility

VEC-MSF models in Bayesian analysis of short- and long-run relationships

Artikel

Abstract

Introduction

Bayesian VEC-SV models

MSF and MSF-SBEKK structures in the VEC model

The prior distribution and Bayesian VEC-MSF model

MCMC methods for Bayesian estimation and model comparison

Gibbs and Metropolis-Hastings algorithms

Harmonic mean estimator with Lenk’s correction

Empirical illustrations

Long-run relationships among exchange rates

A small model for monetary policy

Concluding remarks

Acknowledgments

References

Supplemental Material:

Zusatzmaterial

Artikel in diesem Heft

Artikel in diesem Heft

Artikel in diesem Heft