Startseite VEC-MSF models in Bayesian analysis of short- and long-run relationships
Artikel Öffentlich zugänglich

VEC-MSF models in Bayesian analysis of short- and long-run relationships

  • Anna Pajor EMAIL logo und Justyna Wróblewska
Veröffentlicht/Copyright: 6. April 2017
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

The paper aims at developing new Bayesian Vector Error Correction – Stochastic Volatility (VEC-SV) models, which combine the VEC representation of a VAR structure with stochastic volatility, represented by either the multiplicative stochastic factor (MSF) process or the MSF-SBEKK specification. Appropriate numerical methods (MCMC-based algorithms) are adapted for estimation and comparison of these type of models. Based on data coming from the Polish economy (time series of unemployment, inflation, interest rates, and of PLN/EUR, PLN/USD and EUR/USD exchange rates) it is shown that the models and numerical methods proposed in our study work well in simultaneous modelling of volatility and long-run relationships.

JEL Classification: C11; C32

Introduction

A frequently used model structure for macroeconomic time series is the vector autoregression with the error correction mechanism and with a constant conditional covariance matrix. In turn, conditional heteroscedasticity processes serve as typical models for the volatility of financial time series. However, some research has shown that the variability of macroeconomic time series (measured, for example, by the conditional covariance matrix) can vary over time, and also that on financial markets there exist long-term relationships. Therefore, it appears essential to formulate such models in which the possible presence of long-run relationships and time-variable volatility are simultaneously taken into account. A fast growing econometrics literature (see, e.g. Cogley and Sargent 2005; Primiceri 2005; Sims and Zha 2006; Koop, León-González, and Strachan 2009; Koop and Potter 2011; Koop 2012; Belmonte and Koop 2014) has mainly focused on two particular extensions of a basic vector autoregressive (VAR) model: specifications with time-varying parameters and the ones with time-varying conditional covariances. Our particular attention is paid to the vector error correction (VEC) representation of VAR with the foregoing extensions (see, e.g. Seo 2007; Herwartz and Lütkepohl 2011; Koop, León-González, and Strachan 2011; Cavaliere et al. 2015). However, due to the lack of identification of cointegrating vectors (only the space spanned by the cointegrating vectors can be identified), the literature on the VEC models is much more limited than that on VAR. Also, studies on the Bayesian inference within the corresponding VEC structures are quite rare. Therefore, it is desirable to fill this gap in the literature and to develop Bayesian methods for VEC models allowing for stochastic volatility (SV).

The paper aims at developing new Bayesian Vector Error Correction – Stochastic Volatility (VEC-SV) models, which combine the VEC representation of a VAR structure with stochastic volatility, represented by either the multiplicative stochastic factor (MSF) process or the MSF-SBEKK specification. The MSF structure is one of the simplest amongst the Multivariate Stochastic Volatility (MSV) models (since it features only one latent process, and the conditional covariance matrix does not depend on the past information set), and it can be used to describe time-varying volatility in a parsimonious manner. It is worth mentioning that this SV specification has been used in constructing hybrid models combining multivariate stochastic volatility and multivariate generalized conditional heteroscedasticity structures (MSF-SBEKK, among others), proposed by (Osiewalski 2009; Osiewalski and Pajor 2009), and further developed and extended by (Osiewalski and Pajor 2010; Pajor and Osiewalski 2012; Osiewalski and Osiewalski 2012).

Another objective of this paper is to develop Bayesian methodology for estimation and comparison of the models in question. Bayesian statistical analysis in the class of proposed models requires using some numerical methods, in particular Markov Chain Monte Carlo (MCMC) algorithms. Therefore, one of objectives of the research is to adapt relevant numerical methods to allow the estimation, prediction and comparison of such model structures. In particular, we employ the Gibbs sampler and the Metropolis-Hastings procedure.

The paper is organized as follows. In Section 2 the basic framework is set and the Bayesian models are formulated. In Section 3 we elaborate on the numerical methods. Finally, Section 4 is devoted to an empirical study, in which we use a standard set of the Polish macroeconomics variables (including the unemployment, inflation and interest rates), and show that relaxing the usual assumption of conditional homoscedasticity by introducing the MSF or the MSF-SBEKK structures is empirically valid. The resulting VEC-MSF and VEC-MSF-SBEKK models are also used for the analysis of relationships and volatility of selected main official Polish exchange rates.

Bayesian VEC-SV models

Consider a linear n-variate and k-order vector autoregressive [VAR(k)] process with deterministic terms and a stochastic volatility (SV) structure, represented in the form of a vector error correction (VEC) process, i.e. Vector Error Correction with Stochastic Volatility (VEC-SV) process:

  1. (1)Δxt=Π˜xt1+i=1k1ΓiΔxti+ΦDt+εt,   t=1,2,,T,
  2. (2)εt|ψt1,Dt,qt,θN(0,Σt),

where xt is an n×1 random vector, Π˜ and Γi are n×n matrices of real coefficients (i=1, … k–1), matrix Dt is comprised of deterministic variables such as the constant and seasonal centred dummies, Φ is a parameter matrix, qt is a latent variable, θ is a vector of parameters, ψt−1 denotes the past of the process {xt } up to time t−1, and Σt =Σ(qt , ψt−1). Moreover, Π˜=αβ˜, with α and β˜ being some n×r matrices, where r<n is the number of cointegration relationships (if they exist). The initial conditions xk+1, xk+2, …, x0 are assumed to be known.

MSF and MSF-SBEKK structures in the VEC model

In this paper we consider two alternative stochastic volatility structures for matrix Σt : multiplicative stochastic factor (MSF) and hybrid MSF-SBEKK (type I; see Osiewalski and Pajor 2009).

The multiplicative stochastic factor structure for matrix Σt is as follows:

(3)Σt=qtΣ,

with ln qt =ϕ ln qt−1+σqηt , {ηt }~iiN(0, 1).

Henceforth, this specification will be referred to as the VEC-MSF (VEC with Multiplicative Stochastic Factor) process. Although the VEC-MSF process features non-zero time-varying conditional covariances, the conditional correlations remain constant over time. Such a result is attributable to the fact that the very same qt factor drives the dynamics of each element of Σt . The idea of the MSF structure dates back to (Quintana and West 1987), who specified the scaling factors to be some known constants. Under a different name of the stochastic discount factor (SDF) process, the MSF specification was then revived (though only conceptually, without being used in any application) by (Jacquier, Polson, and Rossi 1995) in modelling financial time series volatility. Later, the process was employed, e.g. by (Osiewalski and Pajor 2009; Pajor 2011; Pajor and Osiewalski 2012; Osiewalski and Osiewalski 2013).

As an alternative MSV specification we employ the MSF-SBEKK structure proposed by (Osiewalski 2009). Following (Osiewalski and Pajor 2009), for matrix Σt we assume the so-called type I hybrid MSF-SBEKK process:

(4)Σt=qtΣ˜t,
(5)Σ˜t=(1ab)Σ+b(εt1εt1)+aΣ˜t1,

with ln qt =ϕ ln qt−1+σqηt , {ηt }~iiN(0, 1), a,b∈R.

Matrix Σ˜t is square, of order n and follows the scalar BEKK(1,1) structure. The specification resulting from introducing MSF-SBEKK error term (εt ) into the VEC model is further referred to as VEC-MSF-SBEKK. The presence of the scalar BEKK(1,1) structure in the conditional covariance matrix allows us to model time-varying conditional correlations without introducing more latent processes. In the hybrid MSF-SBEKK model two simple basic structures are nested. In the limiting case of σg →0 and ϕ=0 we obtain the VEC-SBEKK process, while setting b=0 and a=0 leads to the VEC-MSF case. As regards the initial conditions for Σ˜t, we assume ε0=0, Σ˜0=s0,ΣIn, where s0,Σ>0 and In denotes the identity matrix of size n.

Equation (1) can be decomposed and written as:

(6)Δxt=α[β˜,Φ1][xt1Dt(1)]+i=1k1ΓiΔxti+Φ2Dt(2)+εt=αβz1,t+Γz2,t+Γsz3,t+εt,

where β=[β˜,Φ1],z1,t =[xt−1′, Dt(1)′]′, z2,t =(Δxt−1′, Δxt−2′, …, Δxtk+1′)′, z3,t =Dt(2), Γ=[Γ1, Γ2, …, Γk−1]′, Γs2′, and αΦ1Dt(1)+Φ2Dt(2)=ΦDt.

In order to simplify the notation let us write the basic model (6) in a matrix form:

(7)Z0=Z1Π+Z2Γ+Z3Γs+E,

where Π=αβ′, Z0=[Δx1, Δx2 … ΔxT ]′=[z0,1z0,2z0,T ]′, Z1=[z1,1z1,2z1,T ]′, Z2=[z2,1z2,2z2,T ]′, Z3=[z3,1z3,2z3,T ]′, E=[ε1, ε2εT ]′.

The conditional distribution of xt (given the past of the process, ψt−1, the deterministic variables, Dt , the parameters and the latent variable qt ) is n-variate Normal with the mean μt=xt1+Π˜xt1+i=1k1ΓiΔxti+ΦDt and the covariance matrix Σt :

(8)p(xt|ψt1,Dt,α,β,Γ,Γs,qt,Σ,θΣ,q0,Σ)=fN,n(xt|μt,Σt),

where θΣ and q0,Σ are the vectors of the stochastic volatility parameters: in the VEC-MSF model θΣ=(ϕ,σq2), and q0,Σ=ln q0, whereas in the VEC-MSF-SBEKK model we have θΣ=(ϕ,σq2,a,b) and q0,Σ=(ln q0, s0,Σ)′. The vector q0,Σ is treated as an additional vector of parameters and is estimated jointly with other parameters. The density of the data (given the parameters) is the mixture (over q=(q1, q2, …, qT )′) distribution:

(9)p(x|D,α,β,Γ,Γs,Σ,θΣ,q0,Σ)=p(x|D,α,β,Γ,Γs,Σ,θΣ,q0,Σ,q)p(q|θΣ,q0,Σ)dq,

where x=[x1x2xT ]′ denotes the full data set and D=[D1D2DT ]. The two densities on the right hand side of (9) are given as:

p(x|D,α,β,Γ1,Γs,Σ,θΣ,q0,Σ,q)=t=1TfN,n(xt|μt,Σt),

and

p(q|θΣ,q0,Σ)=t=1Tqt1fN,1(lnqt|ϕlnqt1,σq2).

The prior distribution and Bayesian VEC-MSF model

We are interested in making inference about both the parameter vector θ=(vecα′, vecβ′, vecΓ′, vecΓs ′, vechΣ′, θΣ′, q0,Σ)′ and the latent variables vector q. Within the Bayesian approach, parameters and latent variables are treated as random variables. The joint density of the vector of observations, latent variables and parameters, which determines the Bayesian model, can be written as follows:

(10)p(x,q,θ|D)=p(x|D,q,θ)p(q|θ)p(θ)==p(θ)[t=1TfN,n(xt|μt,Σt)][t=1Tqt1fN,1(lnqt|ϕlnqt1,σq2)],

where p(θ) denotes the density of prior distribution. This density is also conditioned by certain initial observations, which are omitted from our notation. To complete the Bayesian models, we specify the prior distribution of the parameter vector θ:

(11)p(θ)=p(α|β)p(β)p(Γ)p(Γs)p(Σ)p(θΣ)p(q0,Σ)I[0,1](|λ|max),

where I[a, b](.) denotes the indicator function of the interval [a, b] and λ is the vector of the eigenvalues of the companion matrix, that is the matrix of the form:

[Π˜+In+Γ1Γ2Γ1Γk1Γk2Γk1In0000In0000In0],

which makes it possible to write the analysed process in the VAR(1) form.

According to (11), we assume that certain blocks of the parameters are a priori independent, and that the stability condition is imposed on the parameters of the VEC process. Furthermore, for matrix Π the following parameterisations are used (as proposed by Koop, León-González, and Strachan 2010):

αβ=(αMΠ)(αMΠ1)AB,

where MΠ is an r×r symmetric positive-definite matrix, A and B are unrestricted matrices. Moreover, α=A(BB)12, and β=B(BB)12, so β has orthonormal columns, and it is an element of the Stiefel manifold Vr,m (represented by the matrix space of m×r matrices with orthonormal columns). The data inform only about the cointegration space, which is the element of the Grassmann manifold Gr,mr , i.e. the space of r-dimensional hyperplanes in ℝm . In order to take into account the many-to-one relationship between the Stiefel and the Grassmann manifolds, we normalise the columns of β to have positive first elements (with the help of a diagonal matrix whose elements are equal to either 1 or −1).

Next, we state the prior distributions (see Koop, León-González, and Strachan 2010 for the discussion).

  • For matrix B the matrix normal distribution is used:

    p(B|τ, r)=fmN (B|0, Ir , Pτ ), which leads us to the matrix angular central Gaussian (MACG) distribution for β: p(β)=fMACG (β|Pτ ) (see e.g. Chikuse 2002).

    The prior information for the space spanned by β may be incorporated into the model via matrix Pτ , which is constructed as follows:

    Pτ=HBHB+τHBHB,

    where HB is a matrix with orthonormal columns containing prior information about the cointegration space and HB represents its orthogonal complement. If we assume that parameter matrix Pτ in the MACG distribution is an identity matrix, we obtain the uniform distribution over the Stiefel manifold Vr,m and also the uniform distribution over the Grassmann manifold Gr,mr . Additionally, it is worth recalling that unless the prior information about the cointegration space of the highest considered rank is available, the researcher willing to perform Bayesian model selection (including inference about the cointegration rank) should use the uniform prior over the Grassmann manifold.

  • On matrix A we also impose the matrix normal distribution:

    p(A|μA,G,v,r)=fmN(A|μA,vIr,G).

    Parameters ν and τ control the degrees of informativeness of the distributions stated above, and may be either set arbitrarily by the researcher or estimated. In the latter case, usually inverse gamma prior distributions are used: p(v/sv , nv )=fIG(v|sv , nv ) and p(τ|sτ , nτ )=fIG (τ|sτ , nτ ) (see e.g. Koop, León-González, and Strachan 2010). Such values of sτ and nτ should be set that almost all the resulting prior probability is allocated close to zero. For τ close to zero one imposes most of the prior probability to the spaces close to those spanned by HB , whereas for τ equal 1 one gets noninformative priors for the estimated spaces (see Koop, León-González, and Strachan 2010 for the discussion).

The priors for the remaining parameters are as follows:

  • p(Γ|μΓ, H, h)=fmN (Γ|μΓ, H, hIl ), where l=n(k−1), H – an n×n positive-definite symmetric matrix,

  • p(Γs|μΓs,Hs,hs)=fmN(Γs|μΓs,Hs,hsIls), where Hs – an n×n positive-definite symmetric matrix, ls – the number of deterministic terms in Dt ,

  • where p(h|nh , sh )=fIG (h|nh , sh ) and p(hs |nhs , shs )=fIG (hs |nhs , shs ).

  • p(ϕ)∝fN,1(ϕ|μφ , Ωφ ) I(−1,1)(ϕ), – the normal density with mean μφ and variance Ωφ , truncated by the restriction |ϕ|<1;

  • p(σq2)=fIG(σq2|aσ,bσ) – the density of the inverse gamma distribution with the mean bσ /(aσ −1) and variance equal to bσ2/[(aσ1)(aσ2)2];

  • p(Σ)=fIW (Σ|ΩΣ, μΣ, n) – the inverted Wishart distribution with mean ΩΣ/(μΣn−1), μΣ>n+1;

  • p(a,b)∝I(0,1)(a+b) – the uniform distribution over the unit simplex;

  • p(lnq0)=fN,1 (lnq0|μq , Ωq );

  • p(s0,Σ)=fExp(s0,Σ|μq,Σ) – the exponential distribution with mean 1/μq .

    Finally, for θΣ we use the same priors as in (Osiewalski and Pajor 2009; Pajor and Osiewalski 2012).

MCMC methods for Bayesian estimation and model comparison

The joint posterior distribution, represented by p(q, θ|x), is too complicated to obtain a closed-form expression for any of its characteristics. To estimate these, we resort to MCMC methods, which allow one to generate a (pseudo-) random sample from the considered distribution.

Gibbs and Metropolis-Hastings algorithms

In order to draw from the joint posterior distribution of unknown parameters and latent variables we use a MCMC method, specifically, the Metropolis and Hastings algorithm within the Gibbs sampler. The procedure generates drawings from a joint distribution (as a stationary distribution) by sequentially sampling from the full conditional distributions (see, e.g. Gamerman 1997). Thus, we construct a Gibbs sampler with limiting distribution equal to the joint posterior distribution p(q, θ|x). The Gibbs sampler for the univariate stochastic volatility models, presented by (Jacquier, Polson, and Rossi 1994), can be easily generalised to the VEC-SV models.

In the VEC-MSF model one has to draw the [T+nr+mr+lsn+n2(k−1)+n(n+1)/2]-dimensional vector (θ′, q′)′, which is split up into T+12 separate Gibbs steps, 11 of which are quite straightforward, while the remaining one poses more of a challenge.

Before presenting the conditional posterior distributions for the model parameters, let us introduce some additional notation:

Wi=[wi,1wi,2wi,T] with wi,t=Σt1/2zi,t for i=0,1,2,3,

where Σt−1/2 denotes the matrix inverse of the Cholesky factor of Σt .

The equation for the observation t may be written as follows:

w0,t=w1,tvec(BA)+w2,tvec(Γ)+w3,tvec(Γs)+ut,   {ut}iiN(0,In).

We can now move on to presenting the conditional distributions of the VEC form parameters. They are either Normal (for the vectorisations of the matrix parameters) or inverse Gamma for the scalar ones:

  • (12)p(vec(A)|,x)=fN(vec(A)|μ¯A,Ω¯A),

    where

    Ω¯A=[(G11νIr)+(InB)W1W1(InB)]1,μ¯A=Ω¯A[(G11νIr)vec(μA)+(InB)W1[W0W2vec(Γ)W3vec(Γs)]],

  • (13)p(vec(B)|,x)=fN(vec(B)|μ¯B,Ω¯B),

    where

    Ω¯B=[(mIrP1/τ)+(AIm)W1W1(AIm)]1,

    μ¯B=Ω¯B(AIm)W1[W0W2vec(Γ)W3vec(Γs)],

  • (14)p(vec(Γ)|,x)=fN(vec(Γ)|μ¯Γ,Ω¯Γ),

    where Ω¯Γ=[(H11hIl)+W2W2]1,

    μ¯Γ=Ω¯Γ{(H11hIl)vec(μΓ)+W2[W0W1vec(BA)W3vec(Γs)]},

  • (15)p(vec(Γs)|,x)=fN(vec(Γs)|μ¯Γs,Ω¯Γs),

    where Ω¯Γs=[(Hs11hsIls)+W3W3]1,

    μ¯Γs=Ω¯Γs{(Hs11hsIls)vec(μΓs)+W3[W0W1vec(BA)W2vec(Γ)]},

  • (16)p(v|,x)=fIG(v|nv+nr2,sv+12tr[(AμA)G1(AμA)]),
  • (17)p(h|,x)=fIG(h|nh+nl2,sh+12tr[H1(ΓμΓ)(ΓμΓ)]),
  • (18)p(hs|,x)=fIG(hs|nhs+nls2,shs+12tr[Hs1(ΓsμΓs)(ΓsμΓs)]),
  • additionally, in the case of estimated τ with an inverse gamma prior imposed [τ~IG(nτ , sτ )], the full conditional posterior distribution is of the form:

    (19)p(τ|,x)|Pτ|r/2τnτ1exp{1τ[sτ+12tr(mBHBHBB)]}.

    Distribution (19) does not belong to any standard distribution families. To obtain a sample from it, a Metropolis-Hastings algorithm can be employed with the proposed values drawn from the inverse gamma distribution: IG(nτ+mr2,sτ+12tr(mBHBHBB)). For a discussion about the meaning of τ and further recommendations for its estimation see e.g. (Koop, León-González, and Strachan 2010).

The conditional posteriors of the MSF parameters are defined by the following inverse gamma, truncated normal, inverted Wishart, and normal distributions:

  • (20)p(σq2|,x)=fIG(σq2|μσ+T/2,s),

    with s=bσ+0.5t=1T(lnqtϕlnqt1)2;

  • (21)p(ϕ|,x)fN,1(ϕ|α,σq2A1)I(1,1)(ϕ),

    where

    α=A1(WQ+σq2Ωϕμϕ),Aij=WW+σq2Ωφ,

    W = (lnq0,....,lnqT–1), Q = (lnq1,.....,lnqt);

  • (22)p(Σ|,x)fIW(Σ|ΩΣ+TS0,μΣ+T),

    where S0=T1t=1Tqt1(xtμt)(xtμt);

  • (23)p(lnq0|,x)=fN,1(lnq0|b2b11,b11),

    where b1=ϕ2/σq2+1/Ωq,   b2=ϕlnq1/σq2+μq/Ωq.

    Sampling from all these conditional distributions is straightforward (also in the case of the truncated ones, for which rejection sampling is applied).

    The conditional posterior distributions of the unobserved variables in the VEC-MSF model are the following:

  • (24)p(qt|,x)qt(n+2)/2exp[(2qt)1(xtμt)Σ1(xtμt)]××exp[(2σq2)1(lnqtst)2],

    where for 1≤tT−1: st=[ϕ(lnqt1+lnqt+1)](1+ϕ2)1,σq2=σq2(1+ϕ2)1, and for t=T: sT=ϕlnqT1,σq2=σq2.

Density kernel (24) is not standard, so in order to sample from p(qt |·, x) we use an acception-rejection Metropolis-Hastings algorithm with the candidate generating distribution obtained by approximating the log-normal kernel in (24) by the kernel of the inverse gamma distribution with the same mean and variance as this log-normal distribution (see Jacquier, Polson, and Rossi, 1994). This yields an inverse Gamma distribution from which direct draws are easily generated (the product of two inverse Gamma distributions is still an inverse Gamma distribution). The candidate generating density is:

(25)q(qt|)qt(φ+1)exp(θtqqt1),

where φ=(12eσq2)(1eσq2)+n2,θtq=(φn21)exp(st+12σq2)+(xtμt)Σ1(xtμt)/2.

Let us now describe the Gibbs sampler used in the VEC-MSF-SBEKK model. The numerical procedure is much more computationally demanding and time-consuming. Because matrix Σ˜t in the SBEKK structure is a function of the past information set, the full conditional posterior distribution for the VEC structure is very complicated:

(26)p(vecA, vecB, vecΓ, vecΓs,v,h,hs,τ|.,x)p(vecA, vecB, vecΓ, vecΓs,v,h,hs,τ)t=1TfN,n(xt|μt,qtΣ˜t).

Drawing from the above full conditional posterior is riddled with serious numerical obstacles, unless its dimension (n) is small. In order to sample from (26) we implement the random walk Metropolis-Hastings algorithm with normal and truncated normal distributions centred at the previous values of the chain. The covariance matrix of the proposal distribution is a form of approximation of a specific part of the posterior covariance matrix, and it is determined by initial draws. For data sets used in our empirical illustrations the acceptance rate amounts to about 27%.

The conditional posterior distributions of ϕ, σq2 and lnq0 in the MSF-SBEKK structure are also truncated normal, inverse gamma and normal, respectively, as in (19), (21) and (23). On the other hand, the conditional posterior of the parameters featured by the SBEKK part of the model is very complicated:

  • (27)p(a,b,s0,Σ,Σ|,x)p(a,b,s0,Σ,Σ)t=1TfN,n(xt|μt,Σt).

    Parameters a, b, s0,Σ, and elements of matrix Σ can be sampled using the Metropolis-Hastings steps within the Gibbs sampler. We implement the random walk Metropolis-Hastings algorithm with truncated Student’s t distribution (with 3 degrees of freedom) centred at the previous values of the chain (similarly to Osiewalski and Pajor 2009). The covariance matrix of the latter is determined by initial draws of the algorithm. For large n sampling from this full conditional posterior distribution may be numerically more demanding.

The conditional posterior distributions of the unobserved variables in the VEC-MSF-SBEKK model are of the following form:

  • (28)p(qt|,x)qt1fN,1(lnqt|ϕlnqt1,σq2)fN,1(lnqt+1|ϕlnqt,σq2)fN,n(xt|μt,qtΣ˜t),

    for 1≤tT−1,

  • (29)p(qT|,x)qT1fN,1(lnqT|ϕlnqT1,σq2)fN,n(xT|μT,qTΣ˜T).

In order to sample from (28) and (29) we also use the Metropolis-Hastings algorithm. Following (Osiewalski and Pajor 2009), the candidate generating density is an inverse Gamma density:

(30)q(qt|)qt(φ+1)exp(θtq,Bqt1),

where φ=(12eσq2)(1eσq2)+n2,θtq,B=(φn21)exp(st+12σq2)+12(xtμt)Σ˜t1(xtμt).

We stress that the Metropolis-Hastings steps are implemented within the Gibbs sampler. Thus we create Metropolis subchains within the Markov chain constructed by the Gibbs sampler to sample from the non-standard distributions.

To summarise, Bayesian VEC-MSF or VEC-MSF-SBEKK models are analysed, using Gibbs sampling as a tool for simulating samples from the posterior distributions. In the case of the VEC-MSF-SBEKK structure the numerics is far more demanding even for small n. On the other hand, the forms of the full conditional posterior distributions in the VEC-MSF model make steps of the Gibbs sampler easy even for large n. However, the MSF specification is very restrictive since it assumes the same dynamics for all entries of Σt . This assumption is a price to be paid for the ease of numerical calculations.

Harmonic mean estimator with Lenk’s correction

The marginal data densities are essential in the formal Bayesian model selection and model averaging. (Newton and Raftery 1994) proposed a simple (and hence popular) method of estimation of the marginal data density based on the harmonic mean. The harmonic mean estimator (HME) is given as:

(31)p^HME(x)=[1mp=1m1p(x|θ(p),q(p))]1

where {θ(p),q(p)}p=1m are the draws generated from the posterior distribution of the parameters and latent variables, using the MCMC methods.

Even though the HME is consistent (see Newton and Raftery 1994), the computed HME suffers from some serious shortcomings, for example it overestimates the marginal data density. In the paper (Lenk 2009) the source of this “simulation pseudo-bias” of the HME is identified, and several methods of estimating the “bias” adjustment factor are proposed. The adjusted HME for the marginal data density, proposed by (Lenk 2009), is given by the formula:

(32)p^AHME(x)=P^(C)[1mp=1m1p(x|θ(p),q(p))]1,

where P^(C) is an evaluation of the prior probability of subset C⊆Θ [i.e. P(C)], where Θ denotes the space of parameters θ and latent variables q. In Equation (32) {θ(p),q(p)}p=1m are drawn from the posterior distribution of the vector (θ′, q′)′ restricted to the subset C. Following (Lenk 2009) we assume that C={(θ′, q′)}:p(x|θ, q)≥L}, where L=min(θ,q){(θ(p),q(p))}p=1mp(x|θ,q).P(C) is approximated using importance sampling, drawing from the truncated [to the support of p(θ)] multivariate normal distribution for all parameters and the prior distribution for the latent variables. Starting from the identity: P(C)=Θp(q|θ)p(θ)IC(θ,q)dθdq, we obtain: P(C)=Θp(θ)s(θ)IC(θ,q)s(θ)p(q|θ)dθdq, where p(θ) is the probability density function of the prior distribution of parameters, p(q|θ) is the density function of the prior distribution for the latent variables (given the parameters), s(θ) is the probability density function of the truncated multivariate normal distribution. The last identity yields the importance sampling estimator of P(C) with the importance function of the form: s(θ) p(q|θ). The parameters of the truncated normal distribution (the mean and covariances) are obtained basing on the MCMC draws from the posterior distribution.

The estimate of P(C) is:

(33)P^(C)=1Jj=1Jp(θ(j)s)IC(θ(j)s,q(j)s)s(θ(j)s),

where {θ(j)s,q(j)s}j=1J are drawn from the importance sampling distribution defined by the probability density function p(q|θ) s(θ). The normalising constants of probability density functions s(θ) and p(θ) are not known, thus they are calculated, using the direct sampling method.

Empirical illustrations

Data used in the paper come from the Polish economy. In the first example we analyse financial data of daily quotations on three major exchange rates: PLN/EUR, PLN/USD and EUR/USD. The second one focuses on macroeconomic data of unemployment, inflation and interest rates.

Long-run relationships among exchange rates

We start by modelling the exchange rates (where, on the ground of lack of arbitrage opportunities, the cointegrating vector may be assumed to be known a priori) to show that our model and numerical methods are suitable for modelling a long-run relationship in financial data. Let us consider two average daily main Polish official exchange rates: the zloty (PLN) values of the US dollar, and the zloty (PLN) values of the euro, over the period from January 3, 2005 to September 30, 2011 (downloaded from the website of the National Bank of Poland). The dataset of the daily logarithmic growth rates (expressed in percentage terms) consists of 1662 observations (for each series). The first observation is sacrificed as an initial condition, thus T=1661. The Polish official exchange rates are linked to the exchange rates quoted on Forex, the international currencies market. Thus, while building time-series models for the two Polish exchange rates [EUR/PLN (x2,t ) and USD/PLN (x3,t )], we introduce some extra variable from the international market – the euro value of the US dollar, EUR/USD (x1,t , downloaded from http://stooq.com.).

We use the same dataset as in (Pajor 2011), where exogeneity in models with latent variables was considered, and the cointegrating vector was assumed to be known. The relationship: (EUR/PLN)/(USD/PLN)≈EUR/USD was introduced by assuming that this relation (in logarithmic terms) is a cointegration equation in the sense of (Engle and Granger 1987). This yielded the cointegrating vector (1, −1, 1) and the long-run (equilibrium) relationship: ln x2,t −ln x3,t =ln x1,t . The assumption that lnx1,t , lnx2,t and lnx3,t are cointegrated has been checked informally using the Phillips and Perron test applied to the series ECMt =100(ln x1,t −ln x2,t +ln x3,t ). In this paper the cointegrating vector is formally estimated.

The Polish official exchange rates with the addition of certain variables related to the foreign exchange market were previously modelled by (Osiewalski and Pipień 2004) within the VEC-GARCH structure, and under assumption that the cointegrating vector is known.

The data are plotted in Figure 1, depicting the graphs of logarithms (multiplied by 100) of exchange rates in terms of their levels and of their increments. It can be seen from the graphs that the growth rates are centred around zero, and feature time-varying volatility and outliers. The series are more volatile in the period of the financial crisis 2008–2009. Moreover, the official exchange rates of the National Bank of Poland appear more volatile than the EUR/USD exchange rates.

Figure 1: Exchange rates of USD/PLN, EUR/PLN and EUR/USD (January 3, 2005 – September 30, 2011).
Figure 1:

Exchange rates of USD/PLN, EUR/PLN and EUR/USD (January 3, 2005 – September 30, 2011).

A visual inspection of the data reveals that they do not satisfy the constant conditional variance assumption, and therefore the conditional covariance matrix of the respective VAR models need to be stochastic. Summary statistics for the analysed series are shown in Table 1. The arithmetic means of the growth rates of the official Polish exchange rates are positive, but the standard deviations are relatively high. A much higher than 3 value of the kurtosis pronounced leptokurtosis of the empirical distribution. As expected, the EUR/PLN and USD/PLN exchange rates are positively correlated.

Table 1:

Descriptive statistics of analyzed exchange rates (xi,t ) and of daily growth rates (yi,t ).

yi,txi,t
USD/PLNEUR/PLNEUR/USDUSD/PLNEUR/PLNEUR/USD
Mean0.00470.0050−0.00032.93193.93431.3513
SD0.02800.01820.01720.00810.00670.0025
Skewness0.48020.2530−0.0113−0.6198−0.04190.3538
Kurtosis7.61018.84044.95943.33653.89302.4680
Minimum−6.7485−4.5895−3.11632.02203.20261.1670
Maximum6.26774.14673.41573.89784.89991.5995
Empirical correlations
USD/PLN10.7502−0.308210.8041−0.8452
EUR/PLN1−0.13131−0.3742
EUR/USD11

Due to the typical properties of daily growth rates of exchange rates, we consider the VEC-MSF and the VEC-MSF-SBEKK models with one cointegrating vector (r=1), containing an unrestricted vector of constants, and the number of lags equal to 2 (k=2). [1] We set the following values of hyperparameters:

  • for matrix B: Pτ =I3 (the uniform distribution over the Stiefel manifold), τ=1;

  • for matrix A: G=I3, μA =03×1, sv =2, nv =3;

  • for matrix Γ: μΓ=03×3, H=I3, sh =2, nh =3;

  • for matrix Γs : μΓs =03×1, Hs =I3, shs =2, nhs =3;

  • for MSF parameters: μφ =0, Ωφ =100, aσ =1, bσ =0.005, μq =0, Ωq =100, μΣ=5, ΩΣ=I3,

  • for SBEKK parameters: μq,Σ=1.

As mentioned above, the relationship: (EUR/PLN)/(USD/PLN)≈EUR/USD indicates that the posterior cointegrating space can be spanned by the vector (1, −1, 1). One of the objectives of the study is to check whether formal Bayesian estimation of the cointegrating space leads to the same cointegrating vector, which was a priori assumed in our previous research.

The posterior means and standard deviations (in parentheses) of the cointegrating vector as well as the adjustment coefficients are presented bellow. As suggested by (Villani 2006), the point estimates of the cointegrating vectors are obtained with the use of a loss function based on the projective Frobenius distance β^=argminβ˜Gr,mrE[l(β,β˜)], where l(β,β˜)=2(rtr(βββ˜β˜)). This function is minimised at β^=(v1,v2,,vr), where vi (i=1, 2, …, r) are the eigenvectors corresponding to the r largest eigenvalues of the matrix E(ββ′). The measures of cointegration space variation are calculated as τsp(β)=ri=1rλir(mr)/m, where λi is the i-th largest eigenvalue of E(ββ′). This measure takes values from 0 to 1. The lower is the value, the tighter is the posterior distribution (see Villani 2006). To make comparison with the theoretical cointegrating vector more readable, the point estimates of the cointegrating and the adjustment coefficient vectors are normalised in such a manner that the first element of β equals 1. Note that in the case of one cointegrating vector such a procedure does not alter the cointegration space. We obtained the following posterior means (and standard deviations) of β and α:

E(β|x,MSF)=[1.0000.9970.999],τsp(β)=0.0007,

E(α|x,MSF)(D(α|x,MSF))=[0.0800.0170.961(0.097)(0.094)(0.116)],

E(β|x,MSFSBEKK)=[1.0000.9981.000],τsp(β)=0.0006,

E(α|x,MSFSBEKK)(D(α|x,MSFSBEKK))=[0.0540.3140.624(0.081)(0.070)(0.090)].

Overall, the results confirm that exchange rates of USD/PLN, EUR/PLN and EUR/USD are cointegrated with coefficients (1, −1, 1).

In the VEC-MSF model the posterior means and standard deviations (in parentheses) of the VAR parameters are:

[Δxt,1Δxt,2Δxt,3]=[0.0880.0201.602(0.119)(0.107)(0.624)][1.0000.997(0.0007)0.999][xt1,1xt1,2xt1,3]++[0.0760.0400.013(0.050)(0.041)(0.030)0.1150.0720.020(0.047)(0.039)(0.028)0.1350.0420.006(0.060)(0.048)(0.035)][Δxt1,1Δxt1,2Δxt1,3]+[0.003(0.028)0.032(0.022)0.206(0.201)]+ε^t,

and

lnqt=0.987lnqt1+0.0207ηt+η^t,(0.0063)(0.0063)

E(lnq0|x,MSF)=0.055,   D(lnq0|x,MSF)=0.488,

E(Σ|x,MSF)(D(Σ|x,MSF))=[0.5520.0410.230(0.223)(0.022)(0.095)0.0410.5030.534(0.022)(0.204)(0.217)0.2300.5340.780(0.095)(0.217)(0.315)],

where (ε^t,η^t) is the vector of relevant residuals.

In turn, in the VEC-MSF-SBEKK model the posterior means and standard deviations of the VAR parameters are:

[Δxt,1Δxt,2Δxt,3]=[0.0540.3140.624(0.081)(0.070)(0.090)][1.0000.998(0.0006)1.000][xt1,1xt1,2xt1,3]+[0.0800.0220.027(0.042)(0.040)(0.030)0.2950.0490.014(0.033)(0.028)(0.017)0.3120.0210.006(0.044)(0.043)(0.030)][Δxt1,1Δxt1,2Δxt1,3]+[0.014(0.022)0.100(0.064)0.109(0.121)]+ε^t,

and

lnqt=0.664lnqt1+0.162ηt+η^t,(0.086)(0.043)

E(lnq0|x,MSFSBEKK)=5.760,   D(lnq0|MSFSBEKK)=1.753,

E(a|x,MSFSBEKK)=0.029,   D(a|MSFSBEKK)=0.003,

E(b|x,MSFSBEKK)=0.969,   D(b|MSFSBEKK)=0.004,

E(q0,Σ|x,MSFSBEKK)=0.044,   D(q0,Σ|x,MSFSBEKK)=0.026,

E(Σ|x,MSFSBEKK)(D(Σ|x,MSFSBEKK))=[0.6220.0180.186(0.460)(0.134)(0.271)0.0180.2190.138(0.134)(0.099)(0.124)0.1860.1380.414(0.271)(0.124)(0.225)].

Similar characteristics of posterior distributions were obtained in the VEC-SV models with the known cointegration relation (therefore they are not presented here), with the marginal posterior distributions being only less dispersed. The inference about the individual volatility of each time series (measured by the conditional standard deviations) is also very similar in both considered VEC-SV models: with either the known or estimated cointegrating vector. The time plots of conditional standard deviations (for each t=1, 2, …, T; T=1661) are presented in Figure 2, where the upper line represents the posterior mean plus two standard deviations, and the lower one – the posterior mean minus two standard deviations. The models produce volatility peaks at the same time moments. Note that the dynamic pattern of the volatility for all returns is the same in the VEC-MSF model, which is due to the fact that the dynamics of volatilities is governed by only one latent process. However, in spite of the simplicity of the structure of the conditional covariance matrix, we allow the variances of multivariate returns to vary over time. The plots of the main posterior characteristics of conditional correlations are presented in Figure 3. One can easily notice that the constant conditional correlation hypothesis (represented by the MSF specification) is not supported be the data, which is consistent with the findings presented by (Osiewalski and Osiewalski 2013). Due to numerical problems in calculating the marginal likelihoods, we do not present formal Bayesian model comparison of the MSF and MSF-SBEKK specifications. The use of the Lindley-type test (based on the highest posterior density region, see, e.g. Box and Tiao 1973; Bauwens, Lubrano, and Richard 1999) would lead to rejection of the hypothesis. However, the assumption of constant conditional correlations does not influence the inference about the cointegration relationship. The distance measures between the estimated cointegrated spaces and the theoretical one [i.e. spanned by the vector (1, −1, 1)][2] obtained in the VEC-MSF and the VEC-MSF-SBEKK models are very close to zero and almost the same (although they are slightly lower in the VEC-MSF models). In the set of the VEC-MSF-SBEKK models the distance measure takes the highest value in the model with five lags (5.5*10−6) and they fall to 3.3*10−6 in the model with two lags.

Figure 2: Conditional standard deviations (posterior mean±2 standard deviations). The results are obtained in the VEC-MSF and VEC-MSF-SBEKK models with r=1.
Figure 2:

Conditional standard deviations (posterior mean±2 standard deviations). The results are obtained in the VEC-MSF and VEC-MSF-SBEKK models with r=1.

Figure 3: Conditional correlations (posterior mean±2 standard deviations). The results are obtained in the VEC-MSF and VEC-MSF-SBEKK models with r=1 (cointegrating vector is estimated).
Figure 3:

Conditional correlations (posterior mean±2 standard deviations). The results are obtained in the VEC-MSF and VEC-MSF-SBEKK models with r=1 (cointegrating vector is estimated).

Our application of the VEC-SV models to exchange rates yields results, which are consistent with the theory. The cointegrating space appears indeed to be spanned by the vector (1, −1, 1). This illustration shows that our methods can accurately estimate the long-run relationship in the VEC model with a time-variable conditional covariance matrix.

A small model for monetary policy

In the second example the methods developed in the paper are applied to a system with three Polish macroeconomic variables. Before presenting the results, it is worth emphasizing that the papers cited in the introduction pertain predominantly to the US economy. To the best of our knowledge, similar analyses for the Polish economy, with the use of the (Bayesian) VEC-SV models, have not been carried out as yet. Therefore, the paper aims to fill this gap.

In this study a so called small model of monetary policy is considered. The model is based on three major macroeconomic variables: inflation of consumer prices, Δpt , unemployment rate, Ut , and short-term interest rates, rt . The latter can be regarded as a proxy for the monetary policy.

The seasonally unadjusted data cover the period from 1995Q1 to 2012Q4. The seasonality of the analysed series is modelled in a deterministic manner, via zero-mean seasonal dummies. The first 3 years are used as a training sample to determine the hyperparameters of the prior distribution for the cointegration space (the Johansen procedure in the model with two lags and r cointegrating vectors β has been used, which further have been utilized to calculate Pτ=ββ+0.5ββ). The next five quarters are sacrificed as initial conditions. Eventually, there are 55 modelled observations (see Figure 4).

Figure 4: The analysed macro-data series (1995Q1 – 2012Q4).
Figure 4:

The analysed macro-data series (1995Q1 – 2012Q4).

As the analysed time series are rather short, it is very important to test whether the variances and covariances are constant or not. A visual inspection of the time paths of the series suggests that one could expect a time-varying covariance structure. The empirical validity of this presumption is to be formally tested via Bayesian model comparison. The set of competing models include 72 non-nested specifications. Along with the number of latent processes driving covariances (l=0 in the a VEC model, l=1 in the VEC-MSF and VEC-MSF-SBEKK forms), the models can differ in the VAR order (k ϵ {2, 3, 4, 5}), the type of incorporated deterministic term (d=4 denotes a restricted constant, d=3 – an unrestricted one) and the cointegration rank (r can be equal to 1 or 2). Additionally, we consider VAR models for the first differences of the analysed processes (r=0) and VAR models for the levels of these processes (r=3). We assume that the prior probability of each of the specifications compared is equal to 0.0139, which is the reciprocal of the total number of models considered.

Table 2 summarizes the results of the model comparison. In all three considered cases of the covariance structure (homoscedastic, MSF and MSF-SBEKK), the model that ranks first is the one that features two lags, one cointegrating relation and a constant restricted to the cointegration space. Generally, among the models with the highest posterior probability are almost exclusively those with either no or only one cointegrating relation, a time-varying covariance matrix and a short lag structure. Note that the number of lags seems to be less important than the cointegration rank. Due to high values of numerical standard errors of HME the ranking positions of some models cannot be clearly identified, but even if we take into account the error for the VEC-MSF-SBEKK model with k=2, d=4 and r=1, there is no doubt that the marginal data density for this specification is the highest. We can also observe high positive correlations between ordering of the models in all considered cases of the covariance structure. Table 3 presents models with the highest posterior probability. Among them there are only the VEC-MSF-SBEKK and VEC-MSF specifications, so there is a strong evidence that the covariance matrix is time-varying. Only the first three models obtained higher posterior probability than the assumed prior probability, and almost all posterior probability is gathered by the VEC-MSF-SBEKK model with two lags in the VAR form, one cointegration relation and a constant restricted to the cointegration space. The second one is the model with the same VEC structure but with the MSF covariance structure, and the third specification is the VAR-MSF-SBEKK model for the first differences.

Table 2:

Natural logarithm of estimates of marginal likelihoods, numerical standard errors and ranks.

k, d, rVECVEC-MSFVEC-MSF-SBEKK
ln p(x)NSERank (under all models)ln p(x)NSERank (under all models)ln p(x)NSERank (under all models)
2,3,0−196.940.4624 (23)−182.060.5624 (5)−177.580.3224 (3)
2,3,1−198.690.504 (26)−185.610.503 (6)−181.390.503 (4)
2,3,2−203.160.987 (37)−187.890.755 (8)−202.691.1712 (36)
2,3,3−214.661.0319 (54)−204.340.8920 (40)−225.801.3017 (63)
2,4,1−191.890.491 (15)−177.380.421 (2)−174.290.871 (1)
2,4,2−196.880.602 (22)−190.411.047 (13)−202.451.0311 (34)
3,3,0−204.551.033 (41)−190.550.872 (14)−187.380.602 (7)
3,3,1−206.530.3612 (45)−196.560.8512 (21)−201.641.0210 (32)
3,3,2−211.160.8416 (49)−200.301.0615 (30)−223.441.1015 (61)
3,3,3−220.831.0923 (60)−212.930.8822 (50)−238.381.1621 (67)
3,4,1−199.040.525 (27)−187.590.514 (8)−189.790.646 (12)
3,4,2−206.570.7413 (46)−193.210.7010 (18)−231.970.6819 (65)
4,3,0−210.720.929 (48)−197.661.038 (24)−192.890.644 (17)
4,3,1−213.700.6718 (52)−201.141.0516 (31)−216.740.7914 (57)
4,3,2−216.670.6621 (56)−206.401.2321 (44)−247.651.3722 (68)
4,3,3−219.611.4022 (59)−219.120.9623 (58)−237.051.0020 (66)
4,4,1−203.210.818 (38)−192.000.809 (16)−193.770.558 (20)
4,4,2−215.670.8920 (55)−199.620.7314 (28)−225.091.0516 (62)
5,3,0−205.241.1515 (42)−193.251.0413 (19)−188.091.057 (9)
5,3,1−209.380.8114 (47)−202.391.0317 (33)−213.940.8813 (53)
5,3,2−212.960.7517 (51)−204.080.7519 (39)−253.581.1823 (69)
5,4,1−199.911.036 (29)−188.610.606 (11)−197.880.799 (25)
5,4,2−205.821.2511 (43)−202.461.1118 (35)−231.001.1118 (64)

k, VAR order; d, deterministic term; r, cointegration rank; r=3 – VAR for levels, r=0 – VAR for first differences. The Pτ matrix is determined with the help of the Johansen procedure employed in the model with r cointegrating relations.

Table 3:

The most probable models.

VAR order (k)Deterministic term (d)Cointegration rank (r)Model (l)P(M(k,d,r,l)|x)
241VEC-MSF-SBEKK0.92261
241VEC-MSF0.04172
230VEC-MSF-SBEKK0.03435
231VEC-MSF-SBEKK0.00076
230VEC-MSF0.00039
231VEC-MSF0.00001
330VEC-MSF-SBEKK0.000002
341VEC-MSF0.000002
232VEC-MSF0.000001

The Pτ matrix is determined with the help of the Johansen procedure employed in the model with r cointegrating relations.

Figure 5 displays the time plots of conditional standard deviation and the cross correlations of the innovation processes in each of the three equations of the model with the highest posterior probability. Black lines represent posterior means, and the grey ones – the means plus/minus two posterior standard deviations. The posterior means of the conditional standard deviations lead us to the conclusion that all the error terms are heteroscedastic, but the areas between grey lines representing the uncertainty suggest that the Lindley-type test would probably indicate that the error term in the inflation equation could be homoscedastic. In the unemployment equation we can observe a high pick of volatility in year 2003, when the unemployment rate was the highest. The volatility of the interest rate shocks was clearly higher at the beginning of the analysed sample (in years 1999–2002), when the interest rates were high.

Figure 5: Conditional standard deviations and correlations (posterior mean±2 standard deviations). The results are obtained in the VEC-MSF-SBEKK model with r=2, k=1, d=4.
Figure 5:

Conditional standard deviations and correlations (posterior mean±2 standard deviations). The results are obtained in the VEC-MSF-SBEKK model with r=2, k=1, d=4.

The posteriors of conditional correlations are quite tight, and the presented figures clearly indicate that these correlations keep changing during the analysed period. It can be seen that all the conditional correlations of error terms for interest rate and for inflation are positive and relatively high (Figure 5, graph 5). The conditional correlations between the error terms for the inflation and unemployment as well as for the unemployment and interest rate change their signs (Figure 5, graphs 4 and 6). Also, they share similar patterns, which probably results from positive correlations of the error terms in the equations of the interest rate and inflation.

As has been mentioned before, the analysed series is rather short, therefore we have decided to perform a sensitivity analysis of the Bayesian model comparison for two different prior specifications for the cointegration space. Contrary to the previous approach of setting the Pτ matrix, now it is determined with the help of the Johansen procedure employed in the model with n−1 (rather than r) cointegrating relations. Tables 4 and 5 summarize the obtained results. The ranking of the most probable models with the homoscedastic covariance structure remains almost unchanged, whereas in the two other cases under study (i.e. VEC-MSF and VEC-MSF-SBEKK) we can observe some major differences in their performance. It emerges now that the models with two lags and no cointegrating relations are superior. As for the covariance structure we still have strong evidence for its time-variability, and almost all posterior model probability is gained by the MSF-SBEKK specification. Summing up, our sensitivity analysis shows that, as it might have been expected, in such richly parameterised models and with samples of so small sizes the prior assumptions affect the posterior results quite considerably.

Table 4:

Natural logarithm of estimates of marginal likelihoods, numerical standard errors and ranks.

k, d, rVECVEC-MSFVEC-MSF-SBEKK
ln p(x)NSERank (under all models)ln p(x)NSERank (under all models)ln p(x)NSERank (under all models)
2,3,0−196.940.463 (20)−182.060.561 (2)−177.580.321 (1)
2,3,1−199.180.814 (23)−188.740.564 (8)−196.390.357 (17)
2,3,2−203.160.985 (30)−187.890.753 (6)−202.691.1710 (29)
2,3,3−214.661.0317 (50)−204.340.8918 (33)−225.801.3017 (63)
2,4,1−195.251.091 (15)−184.210.792 (3)−187.700.883 (5)
2,4,2−196.880.602 (19)−190.411.045 (9)−202.451.039 (27)
3,3,0−204.551.036 (34)−190.550.876 (10)−187.380.602 (4)
3,3,1−209.500.8313 (45)−198.110.7412 (22)−206.590.6811 (40)
3,3,2−211.160.8415 (47)−200.301.0614 (25)−223.441.1014 (60)
3,3,3−220.831.0923 (58)−212.930.8822 (48)−238.381.1621 (67)
3,4,1−208.460.7211 (43)−195.700.499 (16)−193.880.806 (14)
3,4,2−206.570.749 (39)−193.210.707 (12)−231.970.6819 (65)
4,3,0−210.720.9214 (46)−197.661.0311 (21)−192.890.645 (11)
4,3,1−215.760.8219 (52)−207.441.0621 (41)−222.850.8813 (59)
4,3,2−216.670.6621 (54)−206.401.2320 (38)−247.651.3722 (68)
4,3,3−219.611.4022 (57)−219.120.9623 (56)−237.051.0020 (66)
4,4,1−216.631.1120 (53)−203.451.1416 (31)−217.021.0412 (55)
4,4,2−215.670.8918 (51)−199.620.7313 (24)−225.091.0516 (62)
5,3,0−205.241.157 (35)−193.251.048 (13)−188.091.054 (7)
5,3,1−208.910.8312 (44)−205.881.1619 (37)−224.540.7915 (61)
5,3,2−212.960.7516 (49)−204.080.7517 (32)−253.581.1823 (69)
5,4,1−207.990.7710 (42)−196.731.0310 (18)−202.040.748 (26)
5,4,2−205.821.258 (36)−202.461.1115 (28)−231.001.1118 (64)

k, VAR order; d, deterministic term; r, cointegration rank; r=3 – VAR for levels, r=0−VAR for first differences. The Pτ matrix is determined with the help of the Johansen procedure employed in the model with n−1 cointegrating relations.

Table 5:

The most probable models.

VAR order (k)Deterministic term (d)Cointegration rank (r)Model (l)P(M(k,d,r,l)|x)
230VEC-MSF-SBEKK0.98737
230VEC-MSF0.01116
241VEC-MSF0.00130
330VEC-MSF-SBEKK0.00005
241VEC-MSF-SBEKK0.00004
232VEC-MSF0.00003
530VEC-MSF-SBEKK0.00003
231VEC-MSF0.000014
242VEC-MSF0.000003
330VEC-MSF0.000002

The Pτ matrix is determined with the help of the Johansen procedure employed in the model with n−1 cointegrating relations.

Concluding remarks

The paper is focused on the construction and estimation of Bayesian Vector Error Correction -Stochastic Volatility (VEC-SV) models. As regards the stochastic volatility part of the specification we start with the MSF structure, which is the simplest among the MSV structures, thereby allowing a parsimonious way of modelling time-variability of volatility. Such a framework enables us to capture the long-run relationships among processes, and also to formally examine the presence of time-variation in the conditional covariance matrix.

The main conclusion of this paper is that both the heteroskedasticity and the long-run relationships have to be taken into account simultaneously in macroeconomic models as well as in financial analyses. Our application of the VEC-MSF and VEC-MSF-SBEKK models to exchange rates demonstrates that our methods can accurately estimate the long-run relationships in the VEC model for daily market prices. The results of Bayesian model comparison performed in the macroeconomic model example indicate that the ranking of different VEC structures (covering various numbers of lags and cointegrating relations, and two types of incorporated deterministic terms) is rather stable across different covariance matrix specifications.

The MSF structure allows conditional variances of the joint distribution of variables to change over time, but conditional correlations remain constant. In the MSF-SBEKK structure the conditional covariance matrix depends on the past of the process, as in the MGARCH models, which can lead to serious numerical obstacles. Thus, we deem that further research should rather include an MSV specification with stochastic conditional correlations based on more than one latent process.

On the other hand, time-variability of shocks has a potential effect on the propagation mechanism of the innovations (see, e.g. Koop, León-González, and Strachan 2009), so allowing parameters in the conditional mean of the process to be time-varying appear one of the most wanted generalisations.

Acknowledgments

We would like to thank the anonymous referees for very useful comments and suggestions that helped us to prepare the final version of our paper, and Łukasz Kwiatkowski for language verification of the manuscript. Research was partly financed from the funds granted to the Faculty of Management at Cracow University of Economics, within the framework of the subsidy for the maintenance of research potential. Research was partly financed from the funds granted to the Faculty of Finance and Law at Cracow University of Economics, within the framework of the subsidy for the maintenance of research potential.

References

Bauwens, Luc, Michel Lubrano, and Jean-François Richard. 1999. Bayesian Inference in Dynamic Econometric Models. Oxford University Press, New York.10.1093/acprof:oso/9780198773122.001.0001Suche in Google Scholar

Belmonte, Miguel, and Gary Koop. 2014. “Model Switching and Model Averaging in Time-Varying Parameter Regression Models.” In Bayesian Model Comparison, edited by Ivan Jeliazkov and Dale J. Poirier, Advances in Econometrics, Vol. 34, 45–69. Bingley: Emerald Group Publishing Limited. Accessed June 12, 2015. doi: 10.1108/S0731-905320140000034004.10.1108/S0731-905320140000034004Suche in Google Scholar

Box, George E.P., and George C. Tiao. 1973. Bayesian Inference in Statistical Analysis. Massachusetts: Addison-Wesley Publishing Company.Suche in Google Scholar

Cavaliere, Giuseppe, Luca De Angelis, Anders Rahbek, and A.M. Robert Taylor. 2015. “A Comparison of Sequential and Information-based Methods for Determining the Co-integration Rank in Heteroskedastic VAR Models.” Oxford Bulletin of Economics and Statistics 77: 106–128.10.1111/obes.12051Suche in Google Scholar

Chikuse, Yasuko. 2002. “Statistics on Special Manifolds.” Lecture Notes in Statistics 174. Springer-Verlag, New York.10.1007/978-0-387-21540-2Suche in Google Scholar

Cogley, Timothy, and Thomas J. Sargent. 2005. “Drifts and Volatilities: Monetary Policies and Outcomes in the Post WWII U.S.” Review of Economic Dynamics 8: 262–302.10.2139/ssrn.2482531Suche in Google Scholar

Engle, Robert F., and C.W.J. Granger. 1987. “Co-integration and Error Correction: Representation, Estimation and Testing.” Econometrica 55: 251–276.10.2307/1913236Suche in Google Scholar

Gamerman, Dani. 1997. Markov Chain Monte Carlo. Statistic Simulation for Bayesian Inference. London: Chapman and Hall.Suche in Google Scholar

Herwartz, Helmut, and Helmut Lütkepohl. 2011. “Generalized Least Squares Estimation for Cointegration Parameters Under Conditional Heteroskedasticity.” Journal of Time Series Analysis 32 (3): 281–291.10.1111/j.1467-9892.2010.00698.xSuche in Google Scholar

Jacquier, Eric, Nicholas G. Polson, and Peter E. Rossi. 1994. “Bayesian Analysis of Stochastic Volatility Models.” Journal of Business and Economic Statistics 12 (4): 371–389.10.1080/07350015.1994.10524553Suche in Google Scholar

Jacquier, Eric, Nicholas G. Polson, and Peter E. Rossi. 1995. “Priors and Models for Multivariate Stochastic Volatility Models.” CIRANO Working Paper No. 95s-18, Montréal. Accessed September 19, 2016. http://www.cirano.qc.ca/files/publications/95s-18.pdf.Suche in Google Scholar

Koop, Gary. 2012. “Using VARs and TVP-VARs with Many Macroeconomic Variables.” Central European Journal of Economic Modelling and Econometric 4 (3): 43–167.Suche in Google Scholar

Koop, Gary, Roberto León-González, and Rodney W. Strachan. 2009. “On the Evolution of Monetary Policy.” Journal of Economic Dynamics and Control 33: 997–1017.10.1016/j.jedc.2008.11.003Suche in Google Scholar

Koop, Gary, Roberto León-González, and Rodney W. Strachan. 2010. “Efficient Posterior Simulation for Cointegrated Models with Priors on the Cointegration Space.” Econometric Reviews 29: 224–242.10.1080/07474930903382208Suche in Google Scholar

Koop, Gary, Roberto León-González, and Rodney W. Strachan. 2011. “Bayesian Inference in the Time Varying Cointegration Model.” Journal of Econometrics 165: 210–220.10.1016/j.jeconom.2011.07.007Suche in Google Scholar

Koop, Gary, and Simon M. Potter. 2011. “Time Varying VARs with Inequality Restrictions.” Journal of Economic Dynamics and Control 35: 1126–1138.10.1016/j.jedc.2011.02.001Suche in Google Scholar

Larsson, Rolf, and Mattias Villani. 2001. “A Distance Measure between Cointegration Spaces.” Economics Letters 70: 21–27.10.1016/S0165-1765(00)00349-9Suche in Google Scholar

Lenk, Peter. 2009. “Simulation Pseudo-bias Correction to the Harmonic Mean Estimator of Integrated Likelihoods.” Journal of Computational and Graphical Statistics 18: 941–960.10.1198/jcgs.2009.08022Suche in Google Scholar

Newton, Michael A., and Adrian E. Raftery. 1994. “Approximate Bayesian Inference by the Weighted Likelihood Bootstrap.” Journal of the Royal Statistical Society B 56 (1): 3–48.10.1111/j.2517-6161.1994.tb01956.xSuche in Google Scholar

Osiewalski, Jacek. 2009. “New Hybrid Models of Multivariate Volatility (a Bayesian Perspective).” Przegląd Statystyczny (Statistical Review) 56 (1): 15–22.Suche in Google Scholar

Osiewalski, Jacek, and Anna Pajor. 2009. “Bayesian Analysis for Hybrid MSF–SBEKK Models of Multivariate Volatility.” Central European Journal of Economic Modelling and Econometrics 1 (2): 179–202.Suche in Google Scholar

Osiewalski, Jacek, and Anna Pajor. 2010. “Bayesian Value-at-Risk for a Portfolio: Multi-and Univariate Approaches using MSF–SBEKK Models.” Central European Journalof Economic Modelling and Econometrics 2 (4): 253–277.Suche in Google Scholar

Osiewalski, Krzysztof, and Jacek Osiewalski. 2012. “Missing Observations in Daily Returns –Bayesian Inference within the MSF-SBEKK Model.” Central European Journal of Economic Modelling and Econometrics 4 (3): 169–197.Suche in Google Scholar

Osiewalski, Krzysztof, and Jacek Osiewalski. 2013. “A Long-Run Relationship between Daily Prices on Two Markets: The Bayesian VAR(2)–MSF-SBEKK Model.” Central European Journal of Economic Modelling and Econometrics 5 (1): 65–83.Suche in Google Scholar

Osiewalski, Jacek, and Mateusz Pipień. 2004. “Bayesian comparison of bivariate GARCH processes. The Role of the Conditional Mean Specification”. In New Directions in Macromodelling, edited by Aleksander Welfe, 173–196, Amsterdam: Elsevier.10.1016/S0573-8555(04)69007-9Suche in Google Scholar

Pajor, Anna. 2011. “A Bayesian Analysis of Exogeneity in Models with Latent Variables.” Central European Journal of Economic Modelling and Econometrics 3 (2): 49–73.Suche in Google Scholar

Pajor, Anna, and Jacek Osiewalski. 2012. “Bayesian Value-at-Risk and Expected Shortfall for a Large Portfolio (Multi- and Univariate Approaches).” Acta Physica Polonica A 121, (2-B): B-101-B-109.10.12693/APhysPolA.121.B-101Suche in Google Scholar

Primiceri, Giorgio E. 2005. “Time Varying Structural Vector Autoregressions and Monetary Policy.” Review of Economic Studies 72: 821–852.10.1111/j.1467-937X.2005.00353.xSuche in Google Scholar

Quintana, Jose M., and Michael West. 1987. “An analysis of international exchange rates using multivariate DLMs.” The Statistician 36: 275–281.10.2307/2348524Suche in Google Scholar

Seo, Byeongseon. 2007. “Asymptotic Distribution of the Cointegrating Vector Estimator in Error Correction Models with Conditional Heteroskedasticity.” Journal of Econometrics 137: 68–111.10.1016/j.jeconom.2006.03.008Suche in Google Scholar

Sims, Christopher A., and Tao Zha. 2006. “Were there Regime Switches in Macroeconomic Policy?” American Economic Review 96: 54–81.10.1257/000282806776157678Suche in Google Scholar

Villani, Mattias. 2006. “Bayesian Point Estimation of the Cointegration Space” Journal of Econometrics 134: 645–664.10.1016/j.jeconom.2005.07.008Suche in Google Scholar


Supplemental Material:

The online version of this article (DOI: https://doi.org/10.1515/snde-2016-0004) offers supplementary material, available to authorized users.


Published Online: 2017-4-6

©2017 Walter de Gruyter GmbH, Berlin/Boston

Heruntergeladen am 27.10.2025 von https://www.degruyterbrill.com/document/doi/10.1515/snde-2016-0004/html
Button zum nach oben scrollen