+ All Categories
Home > Documents > Forecasting With Spatial Data Panel

Forecasting With Spatial Data Panel

Date post: 14-Apr-2018
Category:
Upload: umbaranwisnu
View: 225 times
Download: 0 times
Share this document with a friend
37
   D    I    S    C    U    S    S    I    O    N     P    A    P    E    R     S    E    R    I    E    S Forschungsinstitut zur Zukunft der Arbeit Institute for the Study of Labor Forecasting with Spatial Panel Data IZA DP No. 4242 June 2009 Badi H. Baltagi Georges Bresson Alain Pirotte
Transcript
Page 1: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 1/37

D

I S

C

U S

S

I O

N

P

A

P

E

R

S

E

R

I E

S

Forecasting with Spatial Panel Data

IZA DP No. 4242

June 2009

Badi H. Baltagi

Georges Bresson

Alain Pirotte

Page 2: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 2/37

Forecasting with Spatial Panel Data

Badi H. BaltagiSyracuse University

and IZA

Georges BressonERMES (CNRS), Université Panthéon-Assas Paris II

Alain PirotteERMES (CNRS), Université Panthéon-Assas Paris II

and INRETS-DEST

Discussion Paper No. 4242 J une 2009

IZA

P.O. Box 724053072 BonnGermany

Phone: +49-228-3894-0Fax: +49-228-3894-180

E-mail: [email protected]

Any opinions expressed here are those of the author(s) and not those of IZA Research published in

Page 3: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 3/37

IZA Discussion Paper No. 4242 J une 2009

ABSTRACT

Forecasting with Spatial Panel Data*

This paper compares various forecasts using panel data with spatial error correlation. Thetrue data generating process is assumed to be a simple error component regression modelwith spatial remainder disturbances of the autoregressive or moving average type. The bestlinear unbiased predictor is compared with other forecasts ignoring spatial correlation, orignoring heterogeneity due to the individual effects, using Monte Carlo experiments. Inaddition, we check the performance of these forecasts under misspecification of the spatialerror process, various spatial weight matrices, and heterogeneous rather than homogeneouspanel data models.

J EL Classification: C33

Keywords: forecasting, BLUP, panel data, spatial dependence, heterogeneity

Corresponding author:

Badi H. BaltagiDepartment of Economics and Center for Policy Research426 Eggers HallSyracuse UniversitySyracuse, NY 13244-1020USA

Page 4: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 4/37

1 Introduction

The literature on forecasting is rich with time series applications, but thisis not the case for spatial panel data applications. Exceptions are Baltagiand Li (2004, 2006) with applications to forecasting sales of cigarettes and

liquor per capita for U.S. states over time.

1

Best linear unbiased prediction(BLUP) in panel data using an error component model have been consid-ered by Taub (1979), Baltagi and Li (1992), and Baillie and Baltagi (1999)to mention a few. Applications include Baltagi and Griffin (1997), Hsiaoand Tahmiscioglu (1997), Schmalensee, Stoker and Judson (1998), Baltagi,Griffin and Xiong (2000), Hoogstrate, Palm and Pfann (2000), Baltagi, Bres-son and Pirotte (2002, 2004), Frees and Miller (2004), Rapach and Wohar(2004), and Brucker and Siliverstovs (2006), see Baltagi (2008) for a recentsurvey. However, these panel forecasting applications do not deal with spatialdependence across the panel units. Spatial dependence models – popularin regional science and urban economics – deal with spatial interaction andspatial heterogeneity (see Anselin (1988) and Anselin and Bera (1998)). Thestructure of the dependence can be related to location and distance, both in ageographic space as well as a more general economic or social network space.Some commonly used spatial error processes include the spatial autoregres-

sive (SAR) and the spatial moving average (SMA) error processes. Twodiff erent variants of these models for spatial panels are considered, one dis-cussed in Anselin (1988) and another in Kapoor, Kelejian and Prucha (2007)and Fingleton (2007). The best linear unbiased predictors for the Anselintype model was derived by Baltagi and Li (2004). This paper derives the bestlinear unbiased predictors for the Kapoor, Kelejian and Prucha (2007) andFingleton (2007) variants. More importantly, it compares the performance of

sixteen various forecasts of the spatial panel data using Monte Carlo exper-iments. These include homogeneous as well as heterogeneous estimators of the spatial panel model and their corresponding forecasts. The true data gen-erating process is assumed to be a simple error component regression modelwith spatial remainder disturbances of the autoregressive or moving average

Page 5: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 5/37

type. The best linear unbiased predictor is compared with other forecastsignoring spatial correlation, or ignoring heterogeneity due to the individualeff ects. In addition, we check the performance of these forecasts under mis-specification of the spatial error process, diff erent spatial weight matrices,and various sample sizes. Section 2 introduces the error component model

with spatially autocorrelated residuals of the SAR and SMA type. Section3 describes the forecasts using the estimators considered in Section 2, whileSection 4 gives the Monte Carlo design. Section 5 reports the results of theMonte Carlo simulations and Section 6 gives our summary and conclusion.

2 The Error Component Model with Spatially

Autocorrelated ResidualsConsider a linear panel data regression model:

yit = X itβ + εit , i = 1,...,N ; t = 1,...,T (1)

where the disturbance term follows an error component model with spatiallyautocorrelated residuals. The disturbance vector for time t is given by:

εt = µ + φt (2)

where εt = (ε1t,...,εNt)0, µ = (µ1,...,µN )

0 denotes the vector of specific ef-fects assumed to be iid

¡0,σ2µ

¢and φt = (φ1t,...,φNt)

0 are the remainderdisturbances which are independent of µ. We let the φt’s follow a spatialautoregressive (SAR) or a spatial moving average (SMA) error model. TheSAR process is known to transmit the shocks globally while the SMA processtransmits these shocks locally, see Anselin, Le Gallo and Jayet (2008).The SAR specification for the (N × 1) error vector φt at time t can be ex-pressed as:

φt = ρW N φt + vt = (I N − ρW N )−1 vt = B−1

N vt (3)

where WN is an (N ×N) known spatial weights matrix2 ρ is the spatial au-

Page 6: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 6/37

tributed independently across cross-sectional dimension with constant vari-ance σ2vI N . BN = (I N − ρW N ) and is assumed to be non-singular. The errorcovariance matrix for the cross-section at time t becomes:

Ωt = E [εtε0

t] = σ2µI N + σ2v (B0

N BN )−1

(4)

For the full (NT × 1) vector of disturbances:

ε = (ιT ⊗ I N ) µ +¡

I T ⊗B−1N

¢v (5)

the corresponding (NT ×NT ) covariance matrix is given by:

Ω = σ2µ (J T ⊗ I N ) + σ2v hI T ⊗ (B0

N BN )−1

i (6)

where ιT is a (T × 1) vector of ones and J T = ιT ι0

T is a (T × T ) matrix of ones.The spatial moving average (SMA) specification for the (N × 1) error vectorφt at time t can be expressed as:

φt = λW N vt + vt = (I N + λW N ) vt = DN vt (7)

where DN = (I N + λW N ) . The error covariance matrix for the cross-sectionat time t becomes:

Ωt = E [εtε0

t] = σ2µI N + σ2v (DN D0

N ) (8)

For the full (NT × 1) vector of disturbances:

ε = (ιT ⊗

I N ) µ + (I T ⊗

DN ) v (9)

the corresponding (NT ×NT ) covariance matrix is given by:

Ω = σ2µ (J T ⊗ I N ) + σ2v [I T ⊗ (DN D0

N )] (10)

MLE d lit f th di t b i th t d

Page 7: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 7/37

where

ε = y −X β , Ω = σ2vΣ

Σ =

½(J T ⊗ θI N ) +

£I T ⊗ (B0

N BN )−1¤ for SAR

(J T ⊗ θI N ) + [I T ⊗ (DN D0

N )] for SMA(12)

with θ = σ2µ/σ2v.

Regression models containing spatially correlated disturbance terms basedon the SAR or SMA models are typically estimated using MLE, where thelikelihood function corresponds to the normal distribution. However, this canbe computationally demanding for large N . Kelejian and Prucha (1999) sug-gested a generalized moments (GM) estimation method for the SAR modelin a cross-section setting, and Fingleton (2007) extended this generalizedmoments estimator to the SMA model. Kapoor, Kelejian and Prucha (2007)generalized this GM procedure from cross-section to panel data and derivedits large sample properties when T is fixed and N →∞. However, their SARrandom eff ects model (SAR-RE) diff ers from that described in (2) which wewill call (RE-SAR). In fact, in their specification, the disturbance term εtitself follows a SAR process and the remainder term follows an error compo-nent structure. This allows the individual eff ects, i.e., the µ’s themselves to

be spatially correlated but with the same ρ. In particular, the disturbancevector for time t is given by:

εt = ρW N εt + ut (13)

where ut follows an error component structure :

ut = µ + vt (14)

The SAR-RE specification for the (N × 1) error vector εt at time t can beexpressed as:

εt = (I N − ρW N )−1 ut = B−1

N ut (15)

where BN = (I N − ρW N ) . For the full (NT × 1) vector of disturbances:

Page 8: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 8/37

Kapoor, et al. (2007) proposed three generalized moments (GM) estimatorsof ρ, σ2v and σ21

¡= σ2v + T σ2µ

¢based on the following six moment conditions:

E

1N (T −1)

u0

N Q0,N uN 1

N (T −1)u

0

N Q0,N uN 1

N (T −1)u

0

N Q0,N uN 1N

u0

N Q1,N uN 1N

u0

N Q1,N uN 1N

u0

N Q1,N uN

=

σ2vσ2v

1N

tr ¡W 0

N W N ¢0σ21

σ211N

tr¡

W 0

N W N ¢

0

(18)

where

uN = εN − ρεN (19)

uN = εN − ρεN (20)

εN = (I T ⊗W N ) εN (21)

εN = (I T ⊗W N ) εN (22)

Q0,N =

µI T −

J T T

¶⊗ I N (23)

Q1,N =

J T

T ⊗ I N (24)

Under the random eff ects specification considered, the OLS estimator of β isconsistent. Using bβ OLS one gets a consistent estimator of the disturbances bε = y − X bβ OLS . The GM estimators of σ21, σ2

ν and ρ are the solution of

the sample counterpart of the six equations given above. Kapoor, et al.(2007) suggest three GM estimators. The first involves only the first three

moments which do not involve σ

2

1 and yield estimates of ρ and σ

2ν . The fourthmoment condition is then used to solve for σ21 given estimates of ρ and σ2

ν . The

second GM estimator is based upon weighing the moment equations by theinverse of a properly normalized variance-covariance matrix of the samplemoments evaluated at the true parameter values. A simple version of thisweighting matrix is derived under normality of the disturbances. The third

Page 9: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 9/37

similar. The feasible GLS estimator of β is then obtained by replacing ρ, σ2vand σ21 by their GM estimators.3

Recently, Fingleton (2007) extended this GM estimator for the SMA paneldata model with random eff ects. We call this SMA-RE to distinguish itfrom the RE-SMA procedure described in Anselin, et al. (2008). In fact,

for the Fingleton (2007) SMA-RE, the disturbance term εt in (2) follows aSMA process and the remainder term follows an error component structure.Unlike the Anselin, et al. (2008) RE-SMA, the individual eff ects, i.e., theµ’s themselves are allowed to be spatially correlated but with the same λ. Inparticular, the disturbance vector for time t is given by:

εt = (I N + λW N ) ut = DN ut (25)

where DN = (I N + λW N ), and ut follows an error component structure (14).So, the full SMA-RE (NT × 1) vector of disturbances is given by:

ε = (ιT ⊗DN ) µ + (I T ⊗DN ) v (26)

and the corresponding (NT ×NT ) covariance matrix is given by:

Ω = σ2µ (J T ⊗ (DN D0

N )) + σ2v [I T ⊗ (DN D0

N )] (27)

The moment conditions for SMA-RE are similar to those derived by Kapoor,et al. (2007), see Fingleton (2007).

3 Prediction

Goldberger (1962) has shown that, for a given Ω, the best linear unbiased

predictor (BLUP) for the ith individual at a future period T + τ is given by:

byi,T +τ = X i,T +τ

bβ GLS + ω0Ω−1 bεGLS (28)

where ω = E [εi,T +τ ε] is the covariance between the future disturbance εi,T +τ

and the sample disturbances ε β is the GLS estimator of β from equation

Page 10: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 10/37

For the error component without spatial autocorrelation (λ = 0), this BLUPreduces to: byi,T +τ

= X i,T +τ

bβ GLS + σ2µ

σ21(ι0T ⊗ l0

i) bεGLS (29)

where σ21 = T σ2µ + σ2v and li is the ith column of I N . This predictor was

considered by Wansbeek and Kapteyn (1978), Lee and Griffiths (1979) andTaub (1979). The typical element of the last term of equation (29) is¡

T σ2µ/σ21¢εi.,GLS where εi.,GLS =

PT t=1 bεti,GLS /T . Therefore, the BLUP of

yi,T +τ for the RE model modifies the usual GLS forecasts by adding a frac-tion of the mean of the GLS residuals corresponding to the ith individual.In order to make this forecast operational,

bβ GLS is replaced by its feasible

GLS estimate and the variance components are replaced by their feasible

estimates.Baltagi and Li (2004, 2006) derived the BLUP correction term whenboth error components and spatial autocorrelation are present and φt followsa SAR process. So, the predictors for the SAR and the SMA are given by:

byi,T +τ =

X i,T +τ

bβ MLE + θ

¡ι0T ⊗ l0

iC −11

¢ bεMLE

= X i,T +τ bβ MLE

+ T θN

P j=1 c1,jε j.,MLE for SAR

X i,T +τ

bβ MLE + θ¡ι0T ⊗ l0

iC −12

¢ bεMLE

= X i,T +τ

bβ MLE + T θN P

j=1

c2,jε j.,MLE for SMA

(30)

where c1 j (resp. c2,j) is the jth element of the ith row of C −11 (resp. C −12 ) withC 1 =

£T θI N + (B0

N BN )−1

¤(resp. C 2 = [T θI N + (DN D

0

N )]) and ε j.,MLE =

PT t=1 bεtj,MLE /T . In other words, the BLUP of yi,T +τ adds to X i,T +τ bβ MLE

a weighted average of the MLE residuals for the N individuals averagedover time. The weights depend upon the spatial matrix W N and the spa-tial autoregressive (or moving average) coefficients ρ and λ. To make thesepredictors operational, we replace θ, ρ and λ by their estimates from theRE spatial MLE with SAR or SMA When there are no random individual

Page 11: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 11/37

For the Kapoor, et al. (2007) model, the BLUP of yi,T +τ for the SAR-REalso modifies the usual GLS forecasts by adding a fraction of the mean of the GLS residuals corresponding to the ith individual. More specifically, thepredictor is given by:

byi,T +τ = X i,T +τ bβ GLS + µσ2µ

σ21¶ bi (ι0

T ⊗BN ) bεGLS (31)

where bi is the ith row of the matrix B−1N . This is derived in the Appendix

of this paper which also shows the resulting predictor has the same form asthat of the RE model (29). This proof applies to both the Kapoor, et al.(2007) SAR-RE specification and the Fingleton (2007) SMA-RE specifica-tion. Therefore, the BLUP of yi,T +τ for the SAR-RE and the SMA-RE, like

the usual RE model with no spatial eff

ects, modifi

es the usual GLS forecastsby adding a fraction of the mean of the GLS residuals corresponding to theith individual. While the predictor formula is the same, the MLEs for thesespecifications yield diff erent estimates which in turn yield diff erent residualsand hence diff erent forecasts.

4 Monte Carlo Design

In this section, we consider the small sample performance of several predictorsfor an error component model with spatially autocorrelated residuals. Thedata generating process (DGP) consider two specifications on the remaindererrors, namely SAR and SMA:

yit = β 0 + β 1xit + εit , εit = µi + φit, i = 1,...,N ; t = 1,...,T (32)

where4

xit = δ i + ξ it

with

µi ∼ iid.N ¡

0,σ2µ¢

, δ i ∼ iid.U (−7.5, 7.5) ,

ξ d U ( ) β β

Page 12: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 12/37

φt =

½ρW N φt + vt for SARλW N vt + vt for SMA

with ρ,λ =

½0.80.4

(33)

andvit ∼ iid.N

¡0,σ2v

¢(34)

We consider the simple regressions (32) and (33) with N = (50, 100), T =(10, 20) and two cases for the residuals variances:½

σ2µ = 4, σ2v = 16σ2µ = 16, σ2v = 4

(35)

Following Kelejian and Prucha (1999), we use two weight matrices which es-sentially diff er in their degree of sparseness. The weight matrices are labelled

as “ j ahead and j behind” with the non-zero elements being 1/2 j, j = 1 and5. Even with this modest design we have 64 experiments.

For each experiment, we obtain the following 16 estimators:

1. Pooled OLS which ignores the individual heterogeneity and the spatialautocorrelation.

2. The average heterogeneous OLS which estimates the cross-sectionalequation using OLS for each time period and averages these heteroge-neous estimates to obtain a pooled estimator, see Pesaran and Smith(1995).

3. The fixed-eff ects (FE) estimator which accounts for fixed individualeff ects but does not take into account the spatial autocorrelation.

4. The random eff ects (RE) estimator which asssumes that the µi’s areiid(0,σ2µ), and independent of the remainder disturbances φit’s. Thisestimator accounts for random individual eff ects but does not take intoaccount the spatial autocorrelation.

5 The RE spatial MLE assuming a SAR specification (RE SAR) on the

Page 13: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 13/37

6. The RE-spatial MLE assuming a SMA specification (RE-SMA) on theremainder disturbances. In this case, the µi’s are iid(0,σ2µ) and areindependent of the φit’s which follow a SMA process, see Anselin, etal. (2008).

7. The pooled spatial MLE assuming a SAR specification (Pooled SAR)

on the remainder disturbances. This estimator ignores the individualheterogeneity but takes into account the spatial autocorrelation of theSAR type.

8. The pooled spatial MLE assuming a SMA specification (Pooled SMA)on the remainder disturbances. This estimator ignores the individualheterogeneity but takes into account the spatial autocorrelation of the

SMA type.

9. The average heterogeneous spatial MLE assuming a SAR specificationon the remainder disturbances. This estimates cross-sectional MLEwith SAR disturbances for each time period and averages the estimatesover time.

10. The average heterogeneous spatial GM estimator assuming a SAR

specification on the remainder disturbances proposed by Kelejian andPrucha (1999). This estimates cross-sectional GM estimator with SARdisturbances for each time period and averages the estimates over time.

11. The average heterogeneous spatial MLE assuming a SMA specificationon the remainder disturbances. This estimates cross-sectional MLEwith SMA disturbances for each time period and averages the estimatesover time.

12. The average heterogeneous spatial GM estimator assuming a SMAspecification on the remainder disturbances proposed by Fingleton (2007).This estimates cross-sectional GM estimator with SMA disturbances foreach time period and averages the estimates over time.

Page 14: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 14/37

15. The (SAR-RE) model following Kapoor, et al. (2007). This utilizesa panel data GM estimator where the disturbance term itself followsa SAR process and the remainder term follows an error componentstructure.

16. The (SMA-RE) model following Fingleton (2007). This utilizes a panel

data GM estimator where the disturbance term itself follows a SMAprocess and the remainder term follows an error component structure.

Next, we compute the following predictors for the ith individual at a fu-ture period T + τ for τ = 1, 2,..., 5:

OLS

byi,T +τ = X i,T +τ

bβ OLS

Average hetero. OLS byi,T +τ = X i,T +τ bβ av.OLS FE5

( byi,T +τ = X i,T +τ

bβ FE + bµi

with bµi = yi −X i bβ FE , yi =PT

t=1 yit/T

RE byi,T +τ = X i,T +τ

bβ RE + σ2µ

σ2

1

(ι0T ⊗ l0

i) bεRE RE-SAR

½byi,T +τ

= X i,T +τ

bβ MLE,RE −SAR + θ

¡ι0T ⊗ l0

iC −11

¢ bεMLE,RE −SAR

with C 1 =

£T θI N + (B0

N BN )−1

¤and θ = σ2µ/σ2v

RE-SMA ½ byi,T +τ = X i,T +τ bβ MLE,RE −SMA + θ ¡ι0T ⊗ l

0

iC −1

2 ¢ bεMLE,RE −SMA

with C 2 = [T θI N + (DN D0

N )] and θ = σ2µ/σ2vPooled SAR byi,T +τ

= X i,T +τ

bβ MLE,SAR

Pooled SMA byi,T +τ = X i,T +τ

bβ MLE,SMA

Average hetero. SAR

byi,T +τ =

(X i,T +τ

bβ av.MLE,SAR

X i,T +τ

bβ av.GM,SAR

Average hetero. SMA byi,T +τ = ( X i,T +τ bβ av.MLE,SMA

X i,T +τ

bβ av.GM,SMA

FE-SAR

( byi,T +τ = X i,T +τ

bβ MLE,FE −SAR + bµi

with bµi = yi −X i bβ MLE,FE −SAR , yi =PT

t=1 yit/T

FE SMA

(yi T+τ = X i T+τ

bβ MLE FE SMA + µi

Page 15: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 15/37

For all experiments, 1000 replications are performed and the RMSE forone step to five step ahead forecasts are reported.

5 Monte Carlo Results

5.1 The Spatial Dependence Specification Eff ect

Table 1 gives the RMSE for the one year, two year,..., and five year aheadforecasts along with the average RMSE for all 5 years. These are out of sam-ple forecasts when the true DGP is a RE panel model with SAR remainder

disturbances. The sample size is N = 50 and T = 10, the weight matrix isW(1,1), i.e., one neighbor behind and one neighbor ahead. In general, forρ = 0.4, 0.8 and σ2µ = 4, 16, the lowest RMSE is that of RE-SAR. This is fol-lowed closely by SAR-RE and SMA-RE. It confirms the findings of Kapoor,et al. (2007) that, on average, RMSE of MLE and their GM estimators arequite similar. It also seems like misspecifying the SAR by an SMA in an errorcomponent model does not aff ect the forecast performance as long as it is

taken into account. As the spatial autoregressive parameter ρ doubles from0.4 to 0.8, the RMSE also doubles. The RMSE improves as σ2µ gets large,i.e., 16 rather than 4, for estimators that take heterogeneity into account.Pooled OLS, average heterogeneous OLS, pooled SAR, pooled SMA, averageheterogeneous SAR (MLE and GM) and average heterogeneous SMA (MLEand GM) perform worse in terms of RMSE than spatial/panel homogeneousestimators. This forecast comparison is robust whether we are predicting

one period, two periods or 5 periods ahead and is also refl

ected in the av-erage over the five years. The gain in forecast performance is substantialonce we account for RE or FE and is only slightly improved by additionallyaccounting for spatial autocorrelation, i.e., FE-SAR or RE-SAR, FE-SMA,or RE-SMA.

Table 2 gives the RMSE results when the true DGP is a RE panel model

Page 16: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 16/37

Misspecifying the SMA by an SAR in an error component model does notseem to aff ect the forecast performance as long as it is taken into account.However, the magnitudes of the RMSE in Table 2 (where the true DGP isa RE-SMA process) are much lower than those in Table 1 (where the trueDGP is a RE-SAR process). Once again, the forecast RMSE of based on

MLE and their GM counterparts are quite similar, compare SAR-RE andSMA-RE with RE-SAR and RE-SMA. The RMSE improves as σ2µ gets large,i.e., 16 rather than 4, for estimators that take heterogeneity into account. Asthe spatial autoregressive parameter λ increases from 0.4 to 0.8, the RMSEalso increases but not as much as it did for the SAR process in Table 1.Pooled OLS, average heterogeneous OLS, pooled SAR, pooled SMA, averageheterogeneous SAR (MLE and GM) and average heterogeneous SMA (MLEand GM) perform worse in terms of RMSE than spatial/panel homogeneous

estimators. This forecast performance is robust whether we are predictingone period, two periods or 5 periods ahead and is also reflected in the averageover thefive years. Once again, the gain in forecast performance is substantialonce we account for RE or FE and is only slightly improved by additionallyaccounting for spatial autocorrelation, i.e., FE-SMA, or RE-SMA, FE-SARor RE-SAR.

5.2 Sensitivity Analysis

5.2.1 The Spatial Weight Matrix eff ect

Tables 3 and 4 report the RMSE results as Tables 1 and 2 except that theweight matrix is changed from a W (1, 1) to W (5, 5) , i.e., five neighborsbehind and five neighbors ahead. Except for the magnitudes of the RMSE,the same rankings in terms of RMSE performance are exhibited as before.

Tables 5 and 6 report the RMSE results as Tables 1 and 2 except that T isnow doubled from 10 to 20 holding N fixed at 50. Except for the magnitudesof the RMSE, the same rankings in terms of RMSE performance are exhibitedas before.

Table 7 reports the RMSE results when ρ = λ = 0.8, the weight matrix

Page 17: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 17/37

5.2.2 Sensitivity to Irregular Lattice Structures

The spatial weights matrices considered in the paper are regular lattice struc-tures. Using real irregular lattices structures, as in Anselin and Moreno(2003) and in Kelejian and Prucha (1999), does not change the conclusionsof the Monte Carlo study. We used real-world matrices by taking spatial

groupings of French administrative communes for dimension N = 50.7 Thosespatial matrices have been used by Baltagi, Bresson and Pirotte (2007). Spa-tial weight matrices may represent high-order contiguity relationships. Weuse a k-order contiguity matrix containing N − 1 potential neighborhoodsin French municipalities. We have patterns of 0 and 1 values in an (N − 1)by (N − 1) grid for the k-nearest neighborhoods and we use the 1-nearestneighborhood (k = 1) and the 5-nearest neighborhoods (k = 1)8. Results of

Tables 9 to 12 are very similar to those of Tables 1 to 4. Using irregularlattice structures do not change the main conclusions in terms of the RMSEforecast performance of the various estimators considered. These are similarto the rankings obtained when regular lattice structures are used, only themagnitudes of the RMSE diff er.

5.2.3 Robustness to Non-Normality

So far, we have been assuming that the error components have been generatedby the normal distribution. In this section, we check the sensitivity of ourresults to non-normal disturbances. In particular, we generate the µi’s froma χ2 distribution and we let the remainder disturbances follow the normaldistribution. Tables 13 and 14 give similar results as those of Tables 1 and2 (when the individual eff ects follow a normal distribution). So, the resultsseem to be robust to non-normality of the disturbances of the χ2 type.

RMSE forecast performance and are not shown here to save space. These are availableupon request from the authors.

7Other Tables for N = 100 are available upon request from the authors.8Note that a non-zero entry in row i, column j denotes that neighborhoods i and j have

borders that touch and are therefore considered “neighbors”. For N = 50 and for k = 5,and for the 2401 possible elements in the 49 by 49 matrix there are only 250 non zero

Page 18: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 18/37

6 Summary and Conclusion

Our Monte Carlo study finds that when the true DGP is RE with a SAR orSMA remainder disturbances, estimators that ignore heterogeneity/spatialcorrelation perform badly in RMSE forecasts. For our experiments, account-ing for heterogeneity improves the forecast performance by a big margin andaccounting for spatial correlation improves the forecast but by a smaller mar-gin. Ignoring both leads to the worst forecasting performance. Heterogeneousestimators based on averaging perform worse than homogeneous estimatorsin forecasting performance. This performance improves with a larger sam-ple size and seems robust to the type of spatial error structure imposed onthe remainder disturbances. These Monte Carlo experiments confirm earlierempirical studies that report similar findings.

7 Appendix

This appendix first derives the BLUP for the KKP model which we are callingthe (SAR-RE) model described in (13) and (14). The variance-covariancematrix Ω is given in (17). The inverse of Ω is given by:

Ω−1 = 1

σ2v

·µI T − T σ2µσ21

J T ¶⊗ (B0

N BN )¸where J T = J T /T and σ21 = T σ2µ + σ2v and BN = (I N − ρW N ). From (13)and (14), we have :

εT +τ = B−1N uT +τ = B−1

N (µ + vT +τ )

so that,

E hεT +τ ε

0

i= E

hB−1N (µ + vT +τ )

¡¡ιT ⊗B−1

N

¢µ +

¡I T ⊗B−1

N

¢v¢

0

i= σ2µB−1

N

³ι0

T ⊗B−10

N

´

Page 19: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 19/37

=σ2µ

σ2vbi

·³ι0

T ⊗BN −T σ2µσ21

³ι0

T ⊗BN

¸=

σ2µ

σ21bi

³ι0

T ⊗BN

But bi ¡ι0T ⊗BN ¢ = (1⊗ bi) ¡ι0T ⊗BN ¢ = ¡ι0T ⊗ l0

i¢, where l0

i is the ith row of

I N . This holds because B−1

N BN = I N and therefore biBN = l0

i. This meansthat the predictor of the KKP model from (28) is given by:

byi,T +τ = X i,T +τ

bβ GLS + σ2µ

σ21(ι0T ⊗ l0

i) bεGLS (36)

which is the same as that of the RE model with no spatial correlation. Whilethe predictor formula is the same, the MLEs for these specifications yield

diff erent estimates which in turn yield diff erent residuals and hence diff erentforecasts.

The proof is the similar for the Fingleton (2007) specification which weare calling the (SMA-RE) model described in (25) and (14). The variance-covariance matrix Ω is given in (27). The inverse of Ω is given by:

Ω−1 =

1

σ2v ·µI T −

T σ2µσ21

J T ¶⊗ (DN D

0

N )−1

¸where DN = (I N + λW N ). From (25) and (14), we have :

εT +τ = DN uT +τ = DN (µ + vT +τ )

so that,

E [εT +τ ε0] = E

£DN (µ + vT +τ ) ((ιT ⊗DN ) µ + (I T ⊗DN ) v)0

¤= σ2µDN ³ι

0

T ⊗D0

N ´ω

0

= E [εi,T +τ ε0] = σ2µdi

³ι0

T ⊗D0

N

where di is the ith row of the matrix DN . In this case,

ω0Ω−1 =

σ2µd³ι0

⊗D0

´·µI

T σ2µJ

¶⊗ (D D0 )

−1

¸

Page 20: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 20/37

But di¡ι0

T ⊗D−1N

¢= (1⊗di)

¡ι0

T ⊗D−1N

¢=¡ι0

T ⊗ l0

i

¢, where l0

i is the ith row of I N . This holds because DN D

−1N = I N and therefore diD

−1N = l0

i. This meansthat the predictor of the Fingleton (2007) model is again the same as thatof the RE model with no spatial correlation. While the predictor formula isthe same, the MLEs for these specifications yield diff erent estimates whichin turn yield diff erent residuals and hence diff erent forecasts.

Page 21: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 21/37

References

Anselin, L., 1988, Spatial Econometrics: Methods and Models, Kluwer Academic Pub-

lishers, Dordrecht.

Anselin, L. and A.K. Bera, 1998, Spatial dependence in linear regression models with anintroduction to spatial econometrics. In A. Ullah and D.E.A. Giles, eds., Handbook

of Applied Economic Statistics, Marcel Dekker, New York.

Anselin, L. and R. Moreno, 2003, Properties of tests for spatial error components, Re-

gional Science and Urban Economics 33, 595-618.

Anselin, L., J. Le Gallo and H. Jayet, 2008, Spatial panel econometrics. Ch. 19 in L.

Mátyás and P. Sevestre, eds., The Econometrics of Panel Data: Fundamentals and

Recent Developments in Theory and Practice, Springer-Verlag, Berlin, 625-660.

Baillie, R.T. and B.H. Baltagi, 1999, Prediction from the regression model with one-way

error components, Chapter 10 in C. Hsiao, K. Lahiri, L.F. Lee and H. Pesaran, eds.,

Analysis of Panels and Limited Dependent Variable Models, Cambridge University

Press, Cambridge, 255—267.

Baltagi, B.H., 2008, Forecasting with panel data, Journal of Forecasting 27, 153-173..

Baltagi, B.H. and J.M. Griffin, 1997, Pooled estimators vs. their heterogeneous counter-

parts in the context of dynamic demand for gasoline, Journal of Econometrics 77,

303—327.

Baltagi, B.H. and D. Li, 2004, Prediction in the panel data model with spatial correlation,

Chapter 13 in L. Anselin, R.J.G.M. Florax and S.J. Rey, eds., Advances in Spatial

Econometrics: Methodology, Tools and Applications, Springer, Berlin, 283—295.

Baltagi, B.H. and D. Li, 2006, Prediction in the panel data model with spatial correlation:

The case of liquor, Spatial Economic Analysis 1, 175-185.

B l i B H d Q Li 1992 P di i i h d l i h

Page 22: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 22/37

Baltagi, B.H., G. Bresson and A. Pirotte, 2004, Tobin q: forecast performance for hier-

archical Bayes, shrinkage, heterogeneous and homogeneous panel data estimators,

Empirical Economics 29, 107-113.

Baltagi, B.H., G. Bresson and A. Pirotte, 2007, Panel unit root tests and spatial depen-

dence, Journal of Applied Econometrics 22, 339-360.

Baltagi, B.H., J.M. Griffin and W. Xiong, 2000, To pool or not to pool: Homogeneous

versus heterogeneous estimators applied to cigarette demand, Review of Economics

and Statistics 82, 117—126.

Brucker, H. and B. Siliverstovs, 2006, On the estimation and forecasting of international

migration: how relevant is heterogeneity across countries, Empirical Economics 31,

735-754.

Fingleton, B., 2007a, A generalized method of moments estimator for a spatial model with

endogenous spatial lag and spatial moving average errors, paper presented at the

13th international conference on panel data, University of Cambridge, forthcoming

Spatial Economic Analysis.

Fingleton, B., 2007b, A generalized method of moments estimator for a spatial model

with moving average errors with application to real estate prices, forthcoming in

Empirical Economics.

Frees, E.W. and T.W. Miller, 2004, Sales forecasting using longitudinal data models.

International Journal of Forecasting 20, 99—114.

Goldberger, A.S., 1962, Best linear unbiased prediction in the generalized linear regres-

sion model, Journal of the American Statistical Association 57, 369—375.

Kapoor, M., H.H. Kelejian and I.R. Prucha, 2007, Panel data models with spatially

correlated error components, Journal of Econometrics 140, 97-130.

Kelejian, H.H. and I.R. Prucha, 1999, A generalized moments estimator for the autore-

gressive parameter in a spatial model, International Economic Review 40, 509-533.

Page 23: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 23/37

Hoogstrate, A.J., F.C. Palm and G.A. Pfann, 2000, Pooling in dynamic panel-data mod-

els: An application to forecasting GDP growth rates, Journal of Business and Eco-

nomic Statistics 18, 274-283.

Hsiao, C. and A.K. Tahmiscioglu, 1997, A panel analysis of liquidity constraints and firm

investment, Journal of the American Statistical Association 92, 455—465.

Nerlove, M., 1971, Futher evidence on the estimation of dynamic economic relations from

a time-series of cross-sections, Econometrica 39, 359-382.

Pesaran, M.H. and R. Smith, 1995, Estimating long-run relationships from dynamic

heterogenous panels, Journal of Econometrics 68, 79—113.

Rapach, D.E. and M.E. Wohar, 2004, Testing the monetary model of exchange rate

determination: a closer look at panels, Journal of International Money and Finance23, 867—895.

Schmalensee, R., T.M. Stoker and R.A. Judson, 1998, World carbon dioxide emissions:

1950-2050, Review of Economics and Statistics 80, 15—27.

Spanos, A., 2002, The ET interview: Professor Phoebus J. Dhrymes, Econometric Theory

18, 1221-1272.

Taub, A.J., 1979, Prediction in the context of the variance-components model, Journal

of Econometrics 10, 103—108.

Theil, H., 1961, Economic Forecasts and Policy, North-Holland, Amsterdam.

Wansbeek, T.J. and A. Kapteyn, 1978, The seperation of individual variation and sys-

tematic change in the analysis of panel data, Annales de l’INSEE 30-31, 659-680.

Page 24: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 24/37

Page 25: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 25/37

Page 26: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 26/37

Page 27: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 27/37

Page 28: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 28/37

Page 29: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 29/37

Page 30: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 30/37

Page 31: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 31/37

Page 32: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 32/37

Page 33: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 33/37

Page 34: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 34/37

Page 35: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 35/37

Page 36: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 36/37

Page 37: Forecasting With Spatial Data Panel

7/30/2019 Forecasting With Spatial Data Panel

http://slidepdf.com/reader/full/forecasting-with-spatial-data-panel 37/37


Recommended