+ All Categories
Home > Documents > Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 :...

Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 :...

Date post: 10-Aug-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
32
1 Evaluating Density Forecasts with Applications to ESPF Kanemi Ban (Osaka University) Masaaki Kawagoe (ESRI) Hideaki Matsuoka (JCER) ESRI-JCER Conference Feb. 21, 2013
Transcript
Page 1: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

111

Evaluating Density Forecasts with Applications to ESPFKanemi Ban (Osaka University)Masaaki Kawagoe (ESRI)Hideaki Matsuoka (JCER)

ESRI-JCER Conference

Feb. 21, 2013

Page 2: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

222

1. Introduction

2. Methodology

3. Data

4. Results

5. Conclusion

Page 3: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

3

1. Introduction ESPF’s consensus forecasts are of great practical use,

… … and its quality is confirmed by annual performance

reviews.

3

Record of best 5

0 1 2 3 4 5 total

number of forecasters

30 10 4 3 2 1 50

Records of being selected as “best 5 forecasters”

CF is here!

Page 4: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

4

1-2 Introduction

“Risk” information is valuable to ESPF users. Introduced density forecasts survey in June

2008. ⇒ “Mean Probability Distribution” (MPD) A natural next questions to be addressed is:

“How good is it and in what sense?”

4

Page 5: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

555

1. Introduction

2. Methodology

3. Data

4. Results

5. Conclusion

Page 6: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

666

2-1 Methodology

Two approaches to density forecastsPIT (Probability Integral Transform)Scoring the densities

PIT: Diebold, Gunther and Tay (1998) the sequence of PIT of realized variable with

respect to the observed density forecast is i.i.d. U(0,1), if the observed density forecast coincides with the sequence of conditional density of a target variable.

Page 7: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

77

2-2 Diebold, Gunther and Tay (1998)

ty ,...,, 321 tttt yyy

ttt yf |

ttt yp |

Series of realizations

Information set available at t

Conditional density of a series

Corresponding sequence of forecast density

mtttt

mtttt yfyp 11 || to be tested

is not observed and may exhibit structural change.

ttt yf |

ty

Page 8: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

88

If , then is .

ty

tttt yPduupz

2-3 Probability Integral Transform

ttt

tttttt

t

tttt zPp

zPfzPfz

zPzq 1

11

1

Density of tz

ttt yfyp tt zq 1,0U

tttt

tttt zPyand

zzPzp 1

Page 9: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

99

coincide with .

are generated from .

2-4 i.i.d. uniform distribution

mtty 1

mttttt yf 1|

mttttt yf 1| mttttt yp 1|

1,0..

11 Uduupz diimy

tmtt

t

111111

11

|...|||,...,,

yfyfyfyyyf

mmmmmm

mm

1

111

111

11

1111

11111

1

111

|...||

|,...,,

zPpzPf

zPpzPf

zPpzPf

zzzq

mmm

mmmm

mmm

mmmm

mm

Page 10: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

10

2-5. Berkowitz (2001) How to test i.i.d. U(0,1) DGT (1998) advocated graphical or nonparametric

approaches. But those methods are data intensive. Unfortunately, ESPF has only 4 year data. Berkowitz (2001) proposed to use a simple

transformation to normality. Likelihood ratio test is available and more powerful in small samples.

10

ty

ttt duufzx 11 1,0... Ndii

Standard normal cdf.

Page 11: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

1111

tt x Standard normal distribution function

tt

tttttt

xf

xfxxh ^

tt

tt

tt

tt

xxh

xf

xf

loglog ^

tt xh Density of based on tx upt

tt

tt

tt

tt

xxh

xf

xf

loglog ^ tttttttt xxhxfxf ^

Testing i.i.d. normal

ttt xx 1

0:0 H

Page 12: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

2.7 Scoring rules RPS (Ranked Probability Score)Boero, Smith and Wallis (2011)Kenny, Kostka, and Masera (2012)

A density forecast of individual j: Bin: CDF of : CDF of realization : , which is binary.

Deviation values are used for aggregation12

)( itjt Bp

itB ),...1( Ki

)( itjt Bp )( itjt BP

tytY

2

1 1, )(1

T

t

K

itittjj YBP

TRPS

Page 13: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

131313

1. Introduction

2. Methodology

3. Data

4. Results

5. Conclusion

Page 14: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

1414

3-1-1 MPDs for real GDP growth rates

‐8-7-6-5-4-3-2-101234

806 809 812 903

2008 GDP Density Forecast by Periods

-8-7-6-5-4-3-2-101234

806 809 812 903 906 909 912

2009 GDP Density Forecast by Periods

-7-6-5-4-3-2-1012345

906 909 912 1003 1006 1009 1012 1103

2010 GDP Density Forecast by Periods

-8-7-6-5-4-3-2-101234

1006 1009 1012 1103 1106 1109 1112 1203

2011 GDP Density Forecast by Periods

Page 15: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

151515

3-1-2 MPDs for CPI Inflation Rates

-4-3-2-101234

806 809 812 903

2008 CPI Density Forecast by Periods

-4-3-2-101234

806 809 812 903 906 909 912 1003

2009 CPI Density Forecast by Periods

-4-3-2-101234

906 909 912 1003 1006 1009 1012 1103

2010 CPI Density Forecast by Periods

-4-3-2-101234

1006 1009 1012 1103 1106 1109 1112 1203

2011 CPI Density Forecast by Periods

Page 16: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

1616

3-2-1 Fan Charts for real GDP growth rates

Note: The prediction intervals covers 10%, 25%,50%,75% and 90% ranges.

Page 17: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

1717

3-2-2 Fan Charts for CPI inflation rates

Note: The prediction intervals covers 10%, 25%,50%,75% and 90% ranges.

-2-1.5

-1-0.5

00.5

11.5

2

2005 2006 2007 2008 2009

June 2008 CPI Forecast

-2-1.5

-1-0.5

00.5

11.5

2

2005 2006 2007 2008 2009 2010

June 2009 CPI Forecast

-2-1.5

-1-0.5

00.5

11.5

2

2005 2006 2007 2008 2009 2010 2011

June 2010 CPI Forecast

-2-1.5

-1-0.5

00.5

11.5

2

2005 2006 2007 2008 2009 2010 2011

June 2011 CPI Forecast

Page 18: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

18

How many bins are used?

0

5

10

15

20

25

30

35

1 2 3 4 5 6 7 8 9 10 11 12 13 14

GDP CPI

(%)<FY2008>

0

5

10

15

20

25

30

1 2 3 4 5 6 7 8 9 10 11 12 13 14

GDP CPI

(%)<FY2009>

0

5

10

15

20

25

30

1 2 3 4 5 6 7 8 9 10 11 12 13 14

GDP CPI

(%) <FY2010>

05

101520253035

1 2 3 4 5 6 7 8 9 10 11 12 13 14

GDP CPI

(%) <FY2011>

GDP CPIFY2008 FY2009 FY2010 FY2011 FY2008 FY2009 FY2010 FY2011

Average 3.88 4.66 4.35 4.59 Average 3.89 4.51 4.53 4.53STD 1.34 1.98 1.74 1.79 STD 1.32 1.64 1.78 1.66# 428 631 677 679 # 346 551 599 623

Page 19: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

19

Are realized values outliers? : probability of being contained in density forecasts

0102030405060708090

100

6 7 8 9 10 11 12 1 2 3 4 5

GDP CPI

<FY2008>

Year t Year t+1

(%)

0102030405060708090

100

1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5

GDP CPI

<FY2009>

Year t Year t+1

(%)

0102030405060708090

100

1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5

GDP CPI

<FY2010>

Year t Year t+1

(%)

0102030405060708090

100

1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5

GDP CPI

<FY2011>

Year t Year t+1

(%)

Page 20: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

202020

1. Introduction

2. Methodology

3. Data

4. Results

5. Conclusion

Page 21: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

2121

1. Probability integral transform with respect to the forecast densities

2. If , then those data are dropped. (“Failure”)

3. transform of

4. Test of in the autoregressive model

5. is rejected, the same procedure is undertaken by subgrouping individual forecasters.

4-1-1 Procedures to test independency

ty

titi duupz ,,

1

10 ,, titi zorz

tiz ,

titi zx ,1

,

0:0 H

tititi xx ,1,,

0:0 H

Page 22: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

4-1-2 Samples to test independency

period # of sample # of failure

Real GDP growth rate

June 2008 36 36June 2009 27 20June 2010 37 1June 2011 40 1

CPI inflation rateJune 2008 34 3June 2009 30 6June 2010 37 7June 2011 40 22

22

National account FY (April to March) figures are available in mid-May: the June forecast is the first with non-overlapping information.

Sample size: 34 for GDP in FY2010 and 2011; 29 for CPI in FY2008 to FY2011.

Page 23: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

4-1-3 Results of the independency test

23

Eq.1 Eq.2 Eq.3Dependent var. GDP growth rate CPI inflation rateρ estimate 0.055 -0.587 -0.605

standard error 0.255 0.236 0.565p-value 0.830 0.013 0.285

Sample period FY 2010 to 2011 FY 2008 to 2011 FY 2008 to 2011# of observations 34 29 14# of individuals 20 10# of instruments 4 4

tititi xx ,1,, Estimate equations: The null hypothesis:

Rejected in Eq.2, but not rejected in Eq.3 for 10 individuals with better forecast performance

0:0 H

Page 24: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

4-2-1 Procedures to calculate RPS Want to examine performance of MPD. Use of June samples is justified by

independency test results, but the sample is small.

4 cases: (June or 17 months) x (exclude FY2008 or not)Larger samples: 17 months for GDP and 16

months for CPI. Compare MPD with three benchmarks and

individuals’ density forecasts.24

Page 25: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

25

4-2-2 Benchmark density forecasts: Real GDP Growth Rate in June 2011

Page 26: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

26

4-2-3 Benchmark density forecasts: CPI Inflation in June 2011

Page 27: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

4-2-4 Results of RPS calculations: comparing with benchmarks

27

period evaluation MPD Uniform Normal NaïveJune samples (1)

GDP FY2009 to 2011 RPS 1.29 1.34 1.56 1.37 ADV of RPS 46.26 56.47 57.00 48.14

including FY2008 RPS 3.10 2.81 3.15 n.a. ADV of RPS 46.22 47.08 51.18 n.a.

CPI FY2009 to 2011 RPS 0.45 1.06 0.39 0.51ADV of RPS 46.35 57.86 45.06 47.68

including FY2008 RPS 0.42 1.01 0.33 n.a. ADV of RPS 46.19 57.58 44.35 n.a. 17 or 16 month samples for each FY (2)

GDP FY2009 to 2011 ADV of RPS 46.78 55.34 51.36 48.94 including FY2008 ADV of RPS 46.80 52.87 49.98 51.27

CPI FY2009 to 2011 ADV of RPS 46.06 61.08 44.36 48.48including FY2008 ADV of RPS 46.13 59.24 45.14 48.88

Note: (1) Calculated from individuals without missing responses. (2) Calculated from individuals with more than 70 per cent responses.

Page 28: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

4-2-5 Results of RPS calculations: comparing with individuals (performance ranking)

28

period MPD Uniform Normal Naïve # of individuals

June samples (1)

GDP FY2009 to 2011 7 31 31 16 33including FY2008 2 3 23 n.a. 29

CPI FY2009 to 2011 13 28 9 17 32including FY2008 12 30 2 n.a. 32

17 or 16 month samples for each FY (2)

GDP FY2009 to 2011 5 36 32 13 37including FY2008 3 34 19 31 36

CPI FY2009 to 2011 6 34 3 15 35including FY2008 4 34 3 17 35

Note: (1) Calculated from individuals without missing responses. (2) Calculated from individuals with more than 70 per cent responses.

Page 29: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

292929

1. Introduction

2. Methodology

3. Data

4. Results

5. Conclusion

Page 30: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

30

5-1 Conclusion Apply Berkowitz’s (2001) test to individuals’

density forecasts produced in June every year. Real GDP growth rates Fail to reject the independency in real in FY 2010

and 2011. CPI inflation rates Reject the independency in all the samples in FY

2008 and 2011.But fail to reject it in its subsample with better

forecast performance.

Page 31: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

5-2 Conclusion

Calculate RPSMPD is a “good” density forecast compared to

three benchmark as well as individual densities.The above is robust to changes in sample periods

and variables.Prudent CPI density?Overconfidence may get rewarded in GDP.

Page 32: Evaluating Density Forecasts with Applications to ESPF · Testing i.i.d. normal x t x t 1 t H 0 : 0. 2.7 Scoring rules RPS (Ranked Probability Score) Boero, Smith and Wallis (2011)

3232

ReferencesBerkowitz, J. (2001) “Testing Density Forecasts, With Applications to Risk

Management”, Journal of Business and Economic Statistics 19:465-474.Boero, Gianna, Jeremy Smith, and Kenneth F. Wallis (2011) “Scoring Rules

and Survey Density Forecasts,” International Journal of Forecasting 27: 379–393

Diebold, F.X., T.A. Gunther and A.S. Tay (1998) “Evaluating Density Forecasts with Applications to Financial Risk Management”, International Economic Reviews 39:863-883.

Kenny, Geoff, Thomas Kostka, and Federico Masera (2012) “How Informative Are The Subjective Density Forecasts of Macroeconomists,” ECB Working Paper Series, No.1446.


Recommended