Abstract
This paper is concerned with testing for the varying coefficient spatial autoregressive models. Based on the profile likelihood estimation procedure, a profile generalized likelihood ratio test procedure is proposed to test spatial lag effects, and a residual-based bootstrap procedure is used to derive the p-value of the test. Some simulations are conducted to assess the performance of the test and the results are satisfactory.
1 Introduction
In the last three decades, spatial autoregressive models as a popular spatial econometric method have received much attention in the literature. For a long time, however, its main theory has centered around parametric models in which the relationship between the response and the covariates is assumed to be linear. In recent years, many useful semiparametric spatial autoregressive models have been proposed to relax traditional parametric models. Su and Jin[1] proposed a partially linear spatial autoregressive model and studied the properties of the profile quasi-maximum likelihood estimators for the parameters in the model. Li and Mei[2] proposed a statistical test procedure to check a polynomial relationship of the non-parametric component in partially linear spatial autoregressive models. Su[3] studied a nonprametric spatial autoregressive model that the spatially lagged response variable enters the model linearly while the covariates enter the model nonparametrically.
Like parametric models, semiparametric models have various forms. An alternative approach to relax the conditions imposed on traditional parametric models and explore the hidden structure is varying coefficient models, which was introduced by Cleveland et al.[4] and popularized by Hastie and Tibshirani[5]. Varying coefficient model is a useful extension of linear regression model, by allowing all the regression coefficients to vary as unknown functions of other factors. Due to its flexibility, varying coefficient model has been studied in many different contexts and has been successfully applied to nonlinear time series analysis, longitudinal and functional data analysis, and time-varying models in finance, see Fan and Zhang[6] for a comprehensive survey. In this paper, we consider the following varying coefficient spatial autoregressive model by combining the spatial autoregressive model and the varying coefficient regression model,
where
For model (1), Li and Chen[12] proposed a profile likelihood approach based on the local linear method to estimate the unknown coefficient functions α(·) and the spatial lag parameter ρ. As we all know, an important question of spatial autoregressive model is to test the existence of the spatial effects. This leads to the following testing problem
For the standard linear spatial autoregressive models, Likelihood ratio test and Rao’s Score (Lagrange Multiplier ) test can be applied to solve this problem, details can be found in Anselin[8]. However, the test problem (2) is a semiparametric hypothesis versus another semiparametric hypothesis testing problem. Many traditional tests cannot be directly applied to the above hypothesis. For this kind of testing problem, Fan and Huang[13] proposed a profile generalized likelihood ratio (PGLR) test based on the partially linear varying coefficient model. Thus, we are motivated to extend the PGLR test procedure for the testing problem (2) of model (1).
The rest of this paper is organized as follows. In Section 2, the profile maximum likelihood method to fit the varying coefficient spatial autoregressive model is briefly described to facilitate our consequent discussions. In Section 3, the test statistic is proposed and the residual-based bootstrap procedure is suggested to derive the p-value of the test. Simulations are conducted in Section 4 to examine the finite sample performance of the proposed test procedure. Conclusion is presented in Section 5.
2 Profile Likelihood Estimation Procedure
For the need of constructing the test statistic, we first introduce the profile likelihood estimating approach of model (1) proposed by Li and Chen[12]. Let us work with the matrix notation. Denote
Then model (1) can be written as
Assume that ε ∼ N(0, σ2In), let θ = (ρ, σ2)T, the log-likelihood function of model (3) is
where In is the identity matrix of order n.
If the parameter ρ is known, then model (1) can be written as
where
This leads to the following weighted local least-squares problems: find
where K is a kernel function, h is a bandwidth and Kh(·) = K(·/h)/h.
Let
with 01×p is the 1 × p zero matrix, and
The solution of the problem (61) is given by
Then, we have
where 0p is the p × p zero matrix.
We take u0 to be each of U1, U2, · · ·, Un, then we can obtain the estimators of α̂(Uj), j = 1, 2, · · ·, p. Then we can define the estimator for M as
Replacing M of (4) by M̄, we obtain the following profile log-likelihood function
Given ρ, by differentiating log Ln(Y | θ) with respect to σ2, we obtain the following equation:
Then, the profile maximum likelihood estimator of σ2 can be obtained as
Furthermore, the concentrated log-likelihood function of ρ is
Maximizing log Ln(ρ) leads to the estimator ρ̂ of ρ. Then, we can define the final estimator of σ2 as σ̂2 = σ̂2(ρ̂). Substituting ρ̂ into (8), we can obtain the estimator of M as
Finally, the residual vector is
3 Profile Generalized Likelihood Ratio Test
3.1 Construction of the Test Statistic
On one hand, under the alternative hypothesis H1, we can obtain the profile log-likelihood as
On the other hand, if the null hypothesis H0 is true, model (1) reduces to the following standard varying coefficient model
As a special case of model (3), we can obtain the profile log-likelihood of model (16) as
with
Following Fan and Huang[13], we define the profile generalized likelihood ratio test statistic
Intuitively, if H0 is true, there should not be significant difference between RSS(H0) and RSS(H1). Otherwise, RSS(H0) – RSS(H1) will tend to take a large positive value as RSS(H0) should become systematically larger than RSS(H1) under H1. Hence, a large value of the test statistic Tn indicates that the null hypothesis should be rejected. If we denote the observed value of T by t, the p-value of the test is
where
3.2 Calculation of the p-Value by the Residual Based Bootstrap Approach
For the proposed test statistic T, it is difficult to obtain its asymptotic null distribution as the presence of the spatial lag term in model (1). To solve this problem, the bootstrap approach was suggested to obtain the p-value by many researchers (Cai et al.[14], Fan and Jiang[15], Hall and Hart[16], Herwartz and Xu[17]). In the following, we propose here a residual based bootstrap procedure to derive the p-value of the test.
Step 1 Based on the data set
Step 2 Generate the bootstrap residuals
Step 3 Calculate the bootstrap test statistic
Step 4 Repeat Steps 2 and 3 k times and obtain a bootstrap sample of the test statistic T as
where t is the observed value of the test statistic T obtained in Step 1 and ♯A denotes the number of the elements in set A.
For the sake of simplicity, we use the same bandwidth in calculating
4 Simulation Studies
In this section, we shall conduct some simulations to examine the performance of the proposed estimate procedure.
Assume that the observations are collected from a uniform, two-dimensional grid consisting of m × m lattice points with unit distance between any two neighboring points along the horizontal and vertical axes. These m2 points are arranged in an orthogonal coordinate system. We generate the spatial weight matrix W according to the principle of Rook contiguity.
The data are generated from the following varying coefficient spatial autoregressive model
where x1i ∼ U(–2, 2), x2i ∼ N(1, 1), ui ∼ U(0, 1), and
For each given value of ρ and each type of error distribution and bandwidth, 1000 replications with n = 102, 152 were run and the rejection rate at the significance level α = 0.05 was computed as the simulated power of our proposed test procedure. And for each replication, the p-value was computed based on m = 500 bootstrap samples. The results are shown in Table 1.
The Rejection frequencies for H0 : ρ = 0 at the significance level α = 0.05
| ρ | ε ∼ N(0, 1) | |||
|---|---|---|---|---|
| m = 10 | m = 15 | m = 10 | m = 15 | |
| –0.15 | 0.944 | 0.999 | 0.954 | 1.000 |
| –0.1 | 0.692 | 0.964 | 0.691 | 0.956 |
| –0.05 | 0.229 | 0.453 | 0.245 | 0.464 |
| 0 | 0.055 | 0.059 | 0.049 | 0.054 |
| 0.05 | 0.256 | 0.549 | 0.237 | 0.538 |
| 0.1 | 0.768 | 0.977 | 0.737 | 0.986 |
| 0.15 | 0.980 | 1.000 | 0.998 | 1.000 |
We summarize our findings as follows. When the null hypothesis is true (that is ρ = 0), the rejection frequencies (estimated sizes) of our proposed test are quite good and close to their nominal level 0.05 under different error distributions. Under the alternative hypothesis, the rejection rate seems very robust to the variation of the type of error distribution, and increases rapidly as the alternative hypothesis deviates from the null hypothesis.
5 Conclusion
In this paper, a test approach is proposed to check the existence of spatial effects in varying coefficient spatial autoregressive models, in which a residual-based bootstrap procedure is suggested to derive the p-value of the test. The simulation experiment demonstrates that the proposed test performs satisfactorily.
References
[1] Su L J, Jin S N. Profile quasi-maximum likelihood estimation of partially linear spatial autoregressive models. Journal of Econometrics, 2010, 157(1): 18–33.10.1016/j.jeconom.2009.10.033Search in Google Scholar
[2] Li T Z, Mei C L. Testing a polynomial relationship of the non-parametric component in partially linear spatial autoregressive models. Papers in Regional Science, 2013, 92(3): 633–649.10.1111/j.1435-5957.2012.00428.xSearch in Google Scholar
[3] Su L J. Semiparametric GMM estimation of spatial autoregressive models. Journal of Econometrics, 2012, 167(2): 543–560.10.1016/j.jeconom.2011.09.034Search in Google Scholar
[4] Cleveland W S, Grosse E, Shyu W M. Local regression models. In Statistical Models in S, Eds. by Chambers J M and Hastie T J. Wadsworth and Brooks, Pacific Grove, 1991, 309–376.10.1201/9780203738535-8Search in Google Scholar
[5] Hastie T, Tibshirani R. Varying-coefficient models (with discussion). Journal of the Royal Statistical Society Series B, 1993, 55(4): 757–796.Search in Google Scholar
[6] Fan J Q, Zhang W Y. Statistical methods with varying coefficient models. Statistics and its Interface, 2008, 1: 179–195.10.4310/SII.2008.v1.n1.a15Search in Google Scholar
[7] Cliff A D, Ord J K. Spatial Autocorrelation. London: Pion Ltd., 1973.Search in Google Scholar
[8] Anselin L. Spatial econometrics: Methods and models. Dordrecht: Kluwer Academic Publishers, 1988.10.1007/978-94-015-7799-1Search in Google Scholar
[9] Kelejian H H, Prucha I R. A generalized moments estimator for the autoregressive parameter in a spatial model. International Economic Review, 1999, 40(2): 509–533.10.1111/1468-2354.00027Search in Google Scholar
[10] Lee L F. Asymptotic distributions of quasi-maximum likelihood estimators for spatial autoregressive models. Econometrica, 2004, 72(6): 1899–1925.10.1111/j.1468-0262.2004.00558.xSearch in Google Scholar
[11] Lee L F. GMM and 2SLS estimation of mixed regressive spatial autoregressive models. Journal of Econometrics, 2007, 137(2): 489–514.10.1016/j.jeconom.2005.10.004Search in Google Scholar
[12] Li K M, Chen J B. Profile maximum likelihood estimation of semiparametric varying coefficient spatial lag model. The Journal of Quantitative Technical Economics, 2013, 4: 85-98 (in Chinese).Search in Google Scholar
[13] Fan J Q, Huang T. Profile likelihood inferences on semiparametric varying-coefficient partially linear models. Bernoulli, 2005, 11(6): 1031–1057.10.3150/bj/1137421639Search in Google Scholar
[14] Cai Z W, Fan J Q, Yao Q W. Functional-coefficient regression models for nonlinear times series. Journal of the American Statistical Association, 2000, 95(451): 941–956.10.1080/01621459.2000.10474284Search in Google Scholar
[15] Fan J Q, Jiang J C. Nonparametric inferences for additive models. Journal of the American Statistical Association, 2005, 100(471): 890–907.10.1198/016214504000001439Search in Google Scholar
[16] Hall P, Hart J D. Bootstrap test for difference between means in nonparametric regression. Journal of the American Statistical Association, 1990, 85(412): 1039–1049.10.1080/01621459.1990.10474974Search in Google Scholar
[17] Herwartz H, Xu F. A new approach to bootstrap inference in functional coefficient models. Computational Statistics and Data Analysis, 2009, 53(6): 2155–2167.10.1016/j.csda.2008.09.014Search in Google Scholar
© 2015 Walter de Gruyter GmbH, Berlin/Boston
Articles in the same Issue
- Domino Effect Analysis, Assessment and Prevention in Process Industries
- Carbon Emissions and Carbon Intensity in China’s Exports: A Contrast of SRIO and GIRIO Methods
- Game Analysis in a Dual Channels System with Different Power Structures and Service Provision
- Uniform Parallel Machine Scheduling Problem with Controllable Delivery Times
- Dealing with Interval DEA Based on Error Propagation and Entropy: A Case Study of Energy Efficiency of Regions in China Considering Environmental Factors
- A Data-Centric Approach for Model-Based Systems Engineering
- Testing for Spatial Lag Effects in Varying Coefficient Spatial Autoregressive Models
- A Generic Statistics-Based Tessellation Method of Voronoi Diagram
Articles in the same Issue
- Domino Effect Analysis, Assessment and Prevention in Process Industries
- Carbon Emissions and Carbon Intensity in China’s Exports: A Contrast of SRIO and GIRIO Methods
- Game Analysis in a Dual Channels System with Different Power Structures and Service Provision
- Uniform Parallel Machine Scheduling Problem with Controllable Delivery Times
- Dealing with Interval DEA Based on Error Propagation and Entropy: A Case Study of Energy Efficiency of Regions in China Considering Environmental Factors
- A Data-Centric Approach for Model-Based Systems Engineering
- Testing for Spatial Lag Effects in Varying Coefficient Spatial Autoregressive Models
- A Generic Statistics-Based Tessellation Method of Voronoi Diagram