Time Series Analysis - Integrated and long-memory processes

Time Series Analysis

Integrated and long-memory processes

Andres M. Alonso Carolina Garcıa-Martos

Universidad Carlos III de Madrid

Universidad Politecnica de Madrid

June – July, 2012

Alonso and Garcıa-Martos (UC3M-UPM) Time Series Analysis June – July, 2012 1 / 51

5. Integrated and long-memory processes

Outline:

Introduction

Integrated processes

The random walk

The simple exponential smoothing process

Integrated process of order two

ARIMA processes

Integrated processes and trends

Long-memory processes

Recommended readings:

� Chapter 6 of Brockwell and Davis (1996).

� Chapter 15 of Hamilton (1994).Alonso and Garcıa-Martos (UC3M-UPM) Time Series Analysis June – July, 2012 2 / 51

Introduction

� In this section we begin our study of non-stationary processes:

A process can be non-stationary in the mean, the variance, theautocorrelations, or in other characteristics of the distribution of variables.

When the level of the series is not stable in time, in particular showingincreasing or decreasing trends, we say that the series is not stable in themean.

When the variability or autocorrelations change with time, we say that theseries is not stationary in the variance or autocovariance.

Finally, if the distribution of the variable at each point in time varied overtime, we say that the series is not stationary in distribution.

The most important non-stationary processes are the integrated processes,which have the basic property that by differentiating them we obtainstationary processes.


Introduction

� An important property that distinguishes integrated processes from stationaryones is the form in which dependency disappears over time.

In ARMA stationary processes the autocorrelations diminish geometrically,and practically reach zero in few lags.

In the integrated processes, the autocorrelations diminish linearly over timeand it is possible to find autocorrelation coefficients different from zero evenfor very high lags.

� There is a class of stationary processes where the autocorrelations decay muchmore slowly over time than in the case of the ARMA processes or in theintegrated processes. These are known as long-memory processes.

� In addition to their theoretical interest, these processes can closely approximatebehavior observed in long climatological or financial series.



� Most real series are not stationary, and their average level varies over time.

Example 46

The figure shows the weekly market share for Colgate toothpaste. The series,which we will denote by zt , shows a clearly decreasing trend and thus is notstationary.

.15

.20

.25

.30

.35

.40

.45

.50

1958 1959 1960 1961 1962

Market share of ColgateAlonso and Garcıa-Martos (UC3M-UPM) Time Series Analysis June – July, 2012 5 / 51

� The figure shows the first difference in this series, that is, the series ofvariations in market share from one week to the next. If we let wt = ∇zt denotethis new series, we see that its values oscillate around a constant mean and seemto correspond to a stationary series.

-.20

-.16

-.12

-.08

-.04

.00

.04

.08

.12

.16

1958 1959 1960 1961 1962

First difference of Colgate market share



� We conclude that the series zt of Colgate market share seems to be anintegrated series, which is transformed into a stationary one by means ofdifferentiation.

� We say then that it is integrated of order one; the number of differencesneeded to obtain a stationary process being the order of integration.

� Economic series are not usually stationary but their relative differences, or thedifferences when we measure the variable in logarithms, are stationary.

Example 47

The next figures show another non-stationary series and its difference, thelogarithm of the Madrid Stock Exchange general index:

The series is not stationary in the mean since its level varies over time.

If we take a difference in this series we obtain the series of the Madrid StockExchange returns using the General Index. It seems that the monthly returnis stable over time.


5.2

5.6

6.0

6.4

6.8

7.2

1988 1990 1992 1994 1996 1998 2000 2002

Madrid Stock General Index (logs)

-.3

-.2

-.1

.0

.1

.2

1988 1990 1992 1994 1996 1998 2000 2002

D(LSGI)

� Examples 46 and 47 show first order integrated processes, that is, they arenon-stationary but their growth, or first differences, are stationary.


� If zt is the series of the general stock index, then ∇ log zt is:

∇ log zt = logzt

zt−1= log(1 +

zt − zt−1

zt−1) ≈∇ztzt−1

and we prove that the first difference of the logarithm of a variable isapproximately equal to its relative growth.

-.2

-.1

.0

.1

1988 1990 1992 1994 1996 1998 2000 2002

RSGI DLSGI

� When the original variable, zt , is aseries of prices, then the series∇ log ztis defined as the stock return.



� It is sometimes necessary to differentiate more than once to obtain a stationaryprocess.

� For example, if we let zt denote the original series, and its variations (orgrowth) measured by wt = ∇zt are not stationary, but the variations of growth,measured by ∇2zt , are. That is, the series:

yt = ∇ωt = ωt − ωt−1 = zt − 2zt−1 + zt−2 = ∇2zt (98)

which represents the growth of series ωt is stationary.

� Series yt is called the second difference of the original series, zt , and we saythat zt is integrated of order 2, or with stationary second order increments.

� Generalizing, we say that a process is integrated of order h≥ 0, and we denoteit by I (h), when upon differentiating it h times a stationary process is obtained.

� A stationary process is, therefore, always I (0).


Example 48

The figures show a series and its difference which are non-stationary.Nevertheless, the difference of this last series, which corresponds to the seconddifference of the original does have a stable level.

2.50E+07

2.60E+07

2.70E+07

2.80E+07

2.90E+07

3.00E+07

3.10E+07

3.20E+07

3.30E+07

0

20000

40000

60000

80000

100000

120000

140000

160000

78 80 82 84 86 88 90 92 94 96 98 00

Spanish population over 16 years of ageFirst difference

-60000

-40000

-20000

0

20000

40000

60000

78 80 82 84 86 88 90 92 94 96 98 00

Second difference


The random walk

� We have seen that finite MA processes are always stationary and that the ARare only so if the roots of φ (B) = 0 lie outside the unit circle.

� Let us consider the AR(l):

zt = c + φzt−1 + at . (99)If |φ| < 1 the process is stationary.

If |φ| > 1 it is easy to see that the process is explosive process and the valuesof the variable grow without limit to the infinite.

� Since explosive processes are not frequent in practice, values of the ARparameter greater than the unit are not generally useful for representing real series.

� An interesting case is when |φ| = 1. Then, the process is not stationary, butneither is it explosive, and it belongs to a class of first order integrated processes.


The random walk

� Indeed, it is straightforward to show that the first difference of an AR(1) with|φ| = 1:

wt = ∇zt = c + at ,

is in fact a stationary process.

� This process is called a random walk, and was used as an example of a Markovprocess and a Martingale process.

� An important characteristic that distinguishes stationary from non-stationaryprocesses is the role of the constants:

In a stationary process the constant is unimportant, and we have subtractedits mean from the observations and worked with zero mean processes. Boththe form of the process and its basic properties are the same whether themean is zero or different from zero.

Nevertheless, in a non-stationary process the constants, if they exist, are veryimportant and represent some permanent property of the process.


Example 49

The figure shows two simulations of model (99). In the first c = 0, it is said thatthe process does not have drift, and the level of the series oscillates in time. In thesecond c = 1, it is said that the process has a drift equal to one, and the processshows a linear trend of slope c.

0

40

80

120

160

200

240

-20

-16

-12

-8

-4

0

4

25 50 75 100 125 150 175 200

Random walk with drift Random walk without drift

� The level of the series is that of theprevious period plus c , which producesdeterministic linear growth.

Datafile 2randomwalk.xls


The random walk

� We see that the graph of the series is totally different in the two cases.

� To calculate the descriptive measures of this process we assume that it starts att = 0. Then, successively replacing zt with zt−1 we have:

zt = ct + at + at−1 + at−2 + ...+ a1

and taking expectations:E (zt) = ct.

� If c 6= 0, the process has a mean that increases linearly over time, but if theprocess does not have drift, c = 0, the mean is constant and equal to zero.

� Its variance is:

Var (zt) = E (at + at−1 + at−2 + ...+ a1)2 = σ2t (100)

and we see that the variance increases over time and tends to the infinite with t.This property indicates that increasing time increases uncertainty about thesituation of the process.


The random walk

� To calculate the autocovariances we use the expression of the process for t + k:

zt+k = c(t + k) + at+k + ...+ at + ...+ a1

and using the notation Cov(t, t + k) = Cov(Zt ,Zt+k) we have:

Cov(t, t + k) = E [(zt − ct)(zt+k − c(t + k))] = σ2t. (101)

� Note that the autocovariances also increase over time and are not only afunction of the lag, as in stationary processes, but that they depend on the timesat which they were calculated.

� Particularly:Cov(t, t − k) = σ2(t − k) 6= Cov(t, t + k).


The random walk

� The autocorrelation is obtained dividing (101) by the standard deviations of thevariables that are obtained from the expression (100), which results in:

ρ(t, t + k) =t√

t(t + k)= (1 +

k

t)−1/2. (102)

� This expression indicates that if t is large, the coefficients of the autocorrelationfunction will be close to one and will decay in an approximately linear form with k.

� Indeed, if we assume that the process starts in the distant past such that k/t issmall, then the function (1 + k/t)−1/2 can be approximated using a first orderTaylor expansion such as:

ρ(t, t + k) ' 1− k

2t. (103)

� This equation indicates that assuming fixed t, if we look at the autocorrelationas a function of k we obtain a straight line with slope (−1/2t).


The random walk

� Taking logarithms in (102) we have

log ρ(t, t + k) = −1

2log(1 +

k

t) ≈ − k

2t

and we see that if we express the correlations in logarithms they also decay linearlywith lag k.

� This behavior of the autocorrelation function of an integrated process can bethought of as an extreme case of an AR(1) when φ is close to the unit.

� Indeed, if φ = 1− ε, with small ε, we have

ρk = (1− ε)k ≈ 1− εk

for small ε.


Example 50

The figure shows a graph of the autocorrelation function of a simulation of 200observations that follow a random walk. Here, we can clearly see the linear decayof the ACF.

Corr

elo

gra

m o

f R

W1

Date

: 01/3

1/0

8

Tim

e:

13:0

9S

am

ple

: 1 2

00

Inclu

ded o

bserv

ations:

200

Auto

corr

ela

tion

Part

ial C

orr

ela

tion

AC

P

AC

Q-S

tat

Pro

b

10.9

72

0.9

72

191.7

80.0

00

20.9

42

-0.0

52

372.7

60.0

00

30.9

12

-0.0

08

543.3

20.0

00

40.8

81

-0.0

43

703.1

80.0

00

50.8

53

0.0

52

853.9

30.0

00

60.8

27

0.0

18

996.5

20.0

00

70.8

04

0.0

20

1131.8

0.0

00

80.7

79

-0.0

39

1259.5

0.0

00

90.7

60

0.0

85

1381.5

0.0

00

10

0.7

40

-0.0

20

1497.9

0.0

00

11

0.7

17

-0.0

52

1607.9

0.0

00

12

0.6

98

0.0

42

1712.7

0.0

00

13

0.6

75

-0.0

80

1811.0

0.0

00

14

0.6

59

0.1

38

1905.3

0.0

00

15

0.6

44

-0.0

11

1995.8

0.0

00

16

0.6

28

-0.0

29

2082.3

0.0

00

17

0.6

14

0.0

35

2165.5

0.0

00

18

0.6

06

0.1

02

2246.9

0.0

00

19

0.5

96

-0.0

48

2326.1

0.0

00

20

0.5

84

-0.0

27

2402.6

0.0

00

21

0.5

72

-0.0

11

2476.5

0.0

00

22

0.5

64

0.0

90

2548.7

0.0

00

23

0.5

55

-0.0

09

2619.0

0.0

00

24

0.5

46

-0.0

38

2687.4

0.0

00

25

0.5

35

-0.0

30

2753.3

0.0

00

26

0.5

23

0.0

06

2816.9

0.0

00

27

0.5

07

-0.0

82

2876.9

0.0

00

28

0.4

90

-0.0

29

2933.2

0.0

00

29

0.4

74

0.0

15

2986.2

0.0

00

30

0.4

62

0.0

73

3036.9

0.0

00

31

0.4

50

0.0

02

3085.2

0.0

00

32

0.4

40

0.0

16

3131.8

0.0

00

33

0.4

35

0.0

67

3177.7

0.0

00

34

0.4

29

-0.0

24

3222.6

0.0

00

35

0.4

21

-0.0

30

3265.9

0.0

00

36

0.4

12

-0.0

14

3307.7

0.0

00


Example 51

The figure shows the autocorrelation function of series in Example 47. Notice thatthe correlations decline linearly, indicating a non-stationary process.

Corr

elo

gra

m o

f LS

GI

Date

: 01/3

1/0

8

Tim

e:

13:2

1S

am

ple

: 1988M

01 2

003M

05

Inclu

ded o

bserv

ations:

185

Auto

corr

ela

tion

Part

ial C

orr

ela

tion

AC

P

AC

Q-S

tat

Pro

b

10.9

89

0.9

89

183.7

60.0

00

20.9

76

-0.0

47

363.9

60.0

00

30.9

66

0.0

89

541.3

80.0

00

40.9

55

-0.0

63

715.6

20.0

00

50.9

44

0.0

45

887.0

10.0

00

60.9

36

0.0

67

1056.2

0.0

00

70.9

26

-0.0

59

1222.9

0.0

00

80.9

16

0.0

19

1387.0

0.0

00

90.9

08

0.0

16

1548.9

0.0

00

10

0.8

97

-0.0

92

1707.8

0.0

00

11

0.8

84

-0.0

80

1863.0

0.0

00

12

0.8

69

-0.1

16

2013.9

0.0

00

13

0.8

53

-0.0

40

2160.2

0.0

00

14

0.8

36

-0.0

65

2301.7

0.0

00

15

0.8

20

-0.0

27

2438.4

0.0

00

16

0.8

03

-0.0

15

2570.4

0.0

00

17

0.7

89

0.0

71

2698.5

0.0

00

18

0.7

75

0.0

16

2822.8

0.0

00

19

0.7

62

0.0

38

2943.8

0.0

00

20

0.7

49

0.0

36

3061.5

0.0

00

21

0.7

38

0.0

74

3176.4

0.0

00

22

0.7

24

-0.1

05

3287.6

0.0

00

23

0.7

08

-0.0

25

3394.8

0.0

00

24

0.6

91

-0.1

14

3497.3

0.0

00

25

0.6

71

-0.1

00

3594.6

0.0

00

26

0.6

50

-0.1

14

3686.4

0.0

00

27

0.6

28

-0.0

89

3772.8

0.0

00

28

0.6

07

-0.0

37

3853.9

0.0

00

29

0.5

87

-0.0

05

3930.2

0.0

00

30

0.5

68

-0.0

25

4002.1

0.0

00

31

0.5

48

-0.0

25

4069.6

0.0

00

32

0.5

27

-0.0

69

4132.3

0.0

00

33

0.5

03

-0.1

15

4189.8

0.0

00

34

0.4

79

0.0

74

4242.5

0.0

00

35

0.4

56

0.0

06

4290.4

0.0

00

36

0.4

32

0.0

22

4333.8

0.0

00

� This behavior is observed both in the autocorrelations of the series as well as inthe logarithms of the autocorrelations.



� A non-stationary process that has been widely used for forecasting and whichcan be considered a generalization of the random walk is:

∇zt = c + (1− θB) at . (104)

In this process if we make θ = 0 we obtain a random walk. It can also be seen asan extreme case of the ARMA(1,1) when the autoregressive coefficient is the unit.

� For simplicity we assume c = 0 and |θ| < 1 and inverting the MA part, thisprocess is written:

zt = (1− θ)zt−1 + θ(1− θ)zt−2 + θ2(1− θ)zt−3 + ...+ at . (105)

� In the usual case where 0 < θ < 1, all the coefficients of the above AR(∞) arepositive and add up to the unit. These coefficients form an infinite geometricprogression with the first term (1− θ) and ratio θ, hence their sum is one.



� The equation (105) indicates that the observed value at each time is theweighted mean of the values of the series at earlier times, with coefficients thatdecay geometrically with the lag:

If θ is close to the unit the mean is calculated using many coefficients, buteach of them with little weight.

For example, if θ = .9 the weights of the lags are .1, .09, .081, . . . and theweights decay very slowly.

If θ is close to zero, the mean is calculated by weighting only the last valuesobserved.

For example, if θ = .1 the weights of the lags are .9, .09, .009,. . . andpractically only the the last lags are taken into account, with very differentweights.

This is known as the simple exponential smoothing process and wasintroduced intuitively in the first session.



� To calculate the descriptive measures of the process we assume that it startedat t = 0, with z0 = a0 = 0 and c = 0. Thus:

z1 = a1

z2 = z1 + a2 − θa1 = a2 + (1− θ) a1

......

...

zt = at + (1− θ) at−1 + ...+ (1− θ) a1.

� The mean of the process is zero, as in the random walk when c = 0. Itsvariance is:

Var(zt) = σ2(1 + (t − 1)(1− θ)2)

and the process is non-stationary in the variance, since the function depends onthe time.



� The covariances are calculated as:

Cov(t, t + k) = E (ztzt+k) =

= E

[(at + (1− θ)

t−1∑i=1

at−i )(at+k + (1− θ)t+k−1∑i=1

at+k−i )

]

and, taking into account that E (aiaj) = 0, if i 6= j , we obtain:

Cov(t, t + k) = σ2(1− θ)(1 + (t − 1)(1− θ)).

� Utilizing this expression and that of the variance we obtain the autocorrelationfunction:

ρ(t, t + k) =(1− θ)(1 + (t − 1)(1− θ))√

(1 + (t − 1)(1− θ)2) (1 + (t + k − 1)(1− θ)2).



� Assuming that t is large, such that t − 1 ≈ t, and that θ is not very close toone, such that 1 + t(1− θ)2 ≈ t(1− θ)2, then:

ρ(t, t + k) =t√

t(t + k)

� That is the same autocorrelation function (102) obtained for the random walk,i.e., the coefficients show approximately linear decay when the lag increases.

� Nevertheless, if θ is close to one, the coefficients do not necessarily have to belarge, although we should always observe a linear decay.


Example 52

The figure shows a series generated using θ = .9. Notice that although theautocorrelations are no longer close to one, they show the expected characteristicof linear decay.

0 100 200 300-2

-1

0

1

2

3

4

5

6

5 10 15 20 25 30-0.2

-0.1

0

0.1

0.2

0.3

0.4

0.5

0.6



� Many real series with trend are represented with a model of two differences, orintegrated of order two. A simple model that appears in many applications is:

∇2zt = (1− θB)at . (106)

� To justify this model, we assume a random walk but where the drift changeswith time, that is, the process:

∇zt = ct + ut .

� This equation indicates that ∇zt has a mean, which is the growth of zt , thatevolves over time. Successively substituting in the above equation and assumingthat the process starts at t = 0 and that z0 = u0 = 0, we obtain:

zt = (ct + ...+ c1) + ut + ...+ u1.



� Let us assume that the evolution of the growth coefficient at each time, ct , issmooth, and such that

ct = ct−1 + εt ,

where εt is a white noise process, independent of ut .

� Thus:∇2zt = ∇zt −∇zt−1 = ct + ut − (ct−1 + ut−1) =

= εt + ut − ut−1 = (1− θB)at ,

since the sum of white noise and a non-invertible MA(1) will be an invertibleMA(1).

� We conclude, therefore, that process (106) is a generalization of the randomwalk that allows the drift to vary smoothly over time.

� In general, integrated processes of order two can be seen as a generalization ofintegrated processes of order one but where the slope of the growth line, insteadof being fixed, varies over time.


ARIMA processes

� The two processes discussed, RW and IMA, were obtained by accepting thatthe root of the AR part of the AR(1) and ARMA(1,1) processes is the unit. Thisidea can be generalized for any ARMA process, allowing one or various roots ofthe AR operator to be the unit:

(1− φ1B − ...− φpBp)(1− B)dzt = c + (1− θ1B − ...− θqBq)at

which we call ARIMA (p, d , q) processes.

p is the order of the autoregressive stationary part.

d is the number of unit roots (order of integration of the process).

q is the order of the moving average part.

� Using the difference operator, ∇ = 1− B, the above process is usually writtenas:

φp(B)∇dzt = c + θq(B)at . (107)


ARIMA processes

� ARIMA stands for Autoregressive Integrated Moving Average, where“integrated” indicates that letting ωt = ∇dzt denote the stationary process, zt isobtained as a sum (integration) of ωt . Indeed, if

ωt = (1− B)zt

since, defining(1− B)−1 = 1 + B + B2 + B3 + ...

results in:

zt = (1− B)−1ωt =

∑t

j=−∞ωt .

� We have seen two examples of ARIMA processes in two earlier examples: therandom walk is an ARIMA(0,1,0) model and the simple exponential smoothing isthe ARIMA(0, 1, 1) or IMA(1, l).

� Both are characterized by the fact that the autocorrelation function has slowlydecaying coefficients.

� All non-stationary ARIMA processes have this general property.Alonso and Garcıa-Martos (UC3M-UPM) Time Series Analysis June – July, 2012 30 / 51

ARIMA processes

� To prove it, recall that the correlogram of an ARMA(p,q) satisfies the equationfor k > q:

φp(B)ρk = 0 k > q

whose solution is of type:

ρk =∑p

i=1AiG

ki

where G−1i are the roots of φp (B) = 0 and |Gi | < 1.

� If one of these roots Gi is very close to the unit, writing Gi = 1− ε, with verysmall ε, for large k the terms AjG

kj will be zero due to the other roots (since G k

j

−→ 0) and we have that, approximately:

ρk = Ai (1− ε)k ' Ai (1− kε) for large k

� As a result, the ACF will have positive coefficients that will fade outapproximately linearly and may be different from zero for high values of k.


� This property of persistence of positive values in the correlogram (even thoughthey are small) and linear decay characterizes the non-stationary processes.

Example 53

The figure shows a realization of the series (1− .4B)∇zt = (1− .8B)at and itsestimated autocorrelation function. The linear decay of the autocorrelationcoefficients is again observed.

0 100 200 300-8

-6

-4

-2

0

2

4

5 10 15 20 25 30-0.2

0

0.2

0.4

0.6

0.8

Lag



� Not all non-stationary processes are integrated, but integrated processes covermany interesting cases that we find in practice.

� It is easy to prove that any process which is the sum of a polynomial trend anda stationary process will be integrated. For example, let us take the case of adeterministic linear growth process:

zt = b + ct + ut (108)

where ut is white noise.

� This process is integrated of order one or I (1), since taking the first differenceof the series zt :

ωt = zt − zt−1 = b + ct + ut − (b + c (t − 1) + ut−1)

= c + ut − ut−1

since c is constant and ∇ut is a stationary process (being the difference of twostationary processes, see section 3.4), process ωt is stationary as well.



� Therefore, zt defined by (108), is integrated of order one, and its first differencefollows model (104) with at = ut and parameter θ equal to the unit.

� The process is not invertible, and this property tells us that we have adeterministic linear component in the model.

� We observe that by differentiating the process the remaining constant is theslope of the deterministic linear growth.

� This result is valid for processes generated as a sum of a polynomial trend oforder h and any stationary process, ut :

zt = µt + ut (109)

where µt = a + bt + ct2 + ...+ dth.

� If we differentiate this process h times a stationary process is obtained with anon-invertible moving average part.



� For example, let us assume a quadratic trend. Then, taking two differences inthe process, we have:

∇2zt = c +∇2ut

and the process ηt = ∇2ut has a non-invertible moving average.

� For example, if ut is an AR(1), (1− φB)ut = at , we have

ηt = (1− φB)−1∇2at

and ηt is a non-invertible ARMA(1,2) process.

� This example shows us that if a series has a deterministic polynomial trend andit is modelled by an ARIMA process we have: (1) non-invertible moving averagecomponents; (2) a stationary series with a mean different from zero and aconstant that is the highest order coefficient in the deterministic trend.

� Again, we see the importance of constants in integrated processes, since theyrepresent deterministic effects.



� Processes with a deterministic linear trend are infrequent in practice, but wecan generalize the above model somewhat allowing the slope at each time to varya little with respect to the previous value. We can write the process with a lineartrend but with variable components, as:

zt = µt + vt . (110)

� With this model the value of the series at each point in time is the sum of itslevel, µt , and a white noise process, vt .

� We assume that the level evolves as:

µt = µt−1 + c + εt . (111)

� The process has a trend c , because the level at time t, µt is obtained from theprevious time adding a constant c , which is the slope, plus a small randomcomponent, εt .



� In this process the first difference is:

∇zt = c + εt +∇vt

and since the sum of an MA(1) process and white noise is an MA(1), we againobtain process (104), but now with a θ parameter smaller than the unit, as in(104).

� The above model can be further generalized by making the deterministic slopec , change with time, but with certain inertia. To do that, we replace expression(111) with:

µt = µt−1 + βt + εt

andβt = βt−1 + υt

where the processes εt and υt are independent white noises. Now the level growslinearly, but with a slope, βt , that changes over time.



� In this representation if the variance of υt is very small the series has an almostconstant trend, and we return to the above model. If we take the difference in theoriginal series, we have

∇zt = βt + εt +∇ut

and taking a second difference

∇2zt = υt +∇εt +∇2ut = (1− θ1B − θ2B2)at (112)

since if we sum MA processes a new MA process is obtained.

� Depending on the variances of the three white noise processes we can havedifferent situations: (i) if V (ut) is much smaller than that of the other noiseprocesses the term ∇2ut may not be taken into account in the right hand side of(112) and the sum will be an MA(1); (ii) if the term υt is the dominant one andwe do not take the other two into account, the series is approximately the model∇2zt = υt , and (iii) if V (υt) is zero, then βt = βt−1 = β and we return to themodel which has a deterministic trend.



� It is important to understand the difference between the two types of modelspresented here:

Model (108) is pure deterministic, since at each time, t, the expected valueof the series is determined, and is b + ct. Moreover, the expected value of theseries at zt knowing the value at zt−1 is still the same, b + ct. In this model,knowing the previous values does not modify the predictions.

Model (110) is more general, because although at each point in time it alsohas linear growth equal to c , the level of the series at each point in time isnot determined from the beginning and can vary randomly. This means thatthe expected value of the series at zt knowing the value at zt−1 is zt−1 + c ,and depends on the observed value.

� To summarize, the ARIMA models, by including differences, incorporatestochastic trends and, in extreme cases, deterministic trends. In this case, thedifferentiated series will have a mean different from zero and non-invertiblemoving average terms may appear.



� There is a class of stationary processes that are easily confused withnon-stationary integrated processes. These are long-memory processes, whichare characterized by having many autocorrelation coefficients with smallcoefficients and which decay very slowly.

� This property has been observed in some stationary meteorological orenvironmental series (but also in financial series). The slow decay of theautocorrelations could lead to modelling these series as if they were integrated oforder one, by taking a difference.

� Nevertheless the decay is different from that of integrated processes: it is muchfaster for the first lags than in an integrated process, but slower for high lags.

� This structure means that in a long-memory process we observe many smallautocorrelation coefficients and they decay very slowly for high lags, as opposed towhat usually happens with integrated processes, where the decay is linear.



� The simplest long-memory process is obtained by generalizing the random walkusing:

(1− B)dzt = at (113)

where now the parameter d instead of being an integer is a real number.

� To define the operator (1− B)d when d is not an integer, we start from theexpression of Newton’s binomial theorem. When d is a positive integer, it isverified that:

(1− B)d =∑d

i=0

(di

)(1)d−i (−B)i =

∑d

i=0αiB

i , (114)

where the coefficients of this sum are defined by

αi =

(di

)(−1)i =

d!

(d − i)!i !(−1)i =

Γ(d + 1)

Γ(i + 1)Γ(d − i + 1)(−1)i ,

where Γ(a + 1) = aΓ(a) is the Gamma function which for integer values coincideswith the factorial function.



� To generalize this definition when d is not an integer, we can take advantage ofthe fact that the Gamma function is defined for any real number, and for real dwrite:

(1− B)d =∑∞

i=0αiB

i , (115)

where the coefficients of this infinite expansion are given by

αi =Γ(d + 1)

Γ(i + 1)Γ(d − i + 1)(−1)i .

� Moreover, it can be proved that when d is an integer this definition isconsistent with (114), since all terms greater than d in this infinite summation arezero, such that for an integer d (115) is reduced to (114).



� We observe that for i = 0 the result is α0 = 1, and using the properties of theGamma function it is shown that:

αi =∏

0<j≤i

j − 1− d

j, for i = 1, 2, ...

� Using this expression we can write the fractional process (113) by means of theAR(∞) representation developing the operator (1− B)d , to obtain:

zt = −∑∞

i=0αizt−i + at . (116)

It can be proved that this process also admits an MA(∞) representation of theform

zt =∑∞

i=0ψiat−i ,

where the coefficients verify Wold’s representation of a stationary MA(∞) serieswith finite variance, that is,

∑ψ2i <∞.



� These coefficients are obtained using

ψi =Γ(d + i)

Γ(i + 1)Γ(d).

It is proved that the autocorrelation function of the long-memory fractionalprocess (113) is

ρ(k) =Γ(d + k)Γ(1− d)

Γ(k + 1− d)Γ(d).

� Particularly for k = 1, we obtain

ρ(1) =dΓ(d)Γ(1− d)

(1− d)Γ(1− d)Γ(d)=

d

1− d,

for k = 2

ρ(2) =d(d + 1)

(2− d)(1− d)= ρ(1)

(d + 1)

(2− d)

and, in general

ρ(k) = ρ(k − 1)(d + k + 1)

(k − d).



� For large k, we can approximate the correlations by:

ρ(k) ≈ Γ(1− d)

Γ(d)|k |2d−1

and taking logarithms

log ρ(k) ≈ a + (2d − 1) log |k | .

� Therefore, if we represent log ρ(k) with respect to log|k | for high values of k weobtain a line with slope 2d − 1.

� This property differentiates a long-memory process from a non-stationaryintegrated process, where we have seen that log ρ(k) decays linearly with k andnot with log k .


Example 54

The left figure shows the coefficients -αi of the AR representation given by (116)for several values of d and right figure the autocorrelation function for thosevalues of d.

5 10 15 20 25 300

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

d = 0.4

d = 0.3

d = 0.1

5 10 15 20 25 300

0.1

0.2

0.3

0.4

0.5

0.6

0.7

d = 0.4

d = 0.3

d = 0.1

� Notice that the AR coefficients quickly become very small, and that the AC functionhas many non-null coefficients that decay slowly.

� These properties make it easy to confuse this process with a high order AR(p), and if

d is close to .5 the high values and slow decay of the AC function can suggest an I(1)

process.



� Generalizing on the above, we can consider stationary ARMA processes, butones that include long-memory as well, defining

(1− φ1B − ...− φpBp)(1− B)dzt = c + (1− θ1B − ...− θqBq)at

where d is not necessarily an integer.

� If d < 0.5 the process is stationary, whereas if d ≥ 0.5 it is non-stationary.

� These models are called ARFIMA(p, d , q), autoregressive fractionallyintegrated moving average.

� If the process is stationary it has the property whereby for high lags, when theARMA structure disappears, the long-memory characteristic will appear. If it is notstationary, taking a difference will convert it to stationary, but with long-memory.



� Long-memory processes may appear when short-memory series are added undercertain circumstances. Granger (1980) proved that if we add N AR(1)independent processes as a limit we obtain a long-memory process.

� We known that the aggregation of independent AR processes leads to ARMAprocesses, and this result is expected with few summands.

� Granger’s results indicate that in the limit we will have a long-memory processand suggest that these processes can be approximated by an ARMA with orders pand q that are high and similar.

� With series that are not very long it is difficult to differentiate a long-memoryprocess from an ARMA. Nevertheless, if we have a series with a large number ofobservations, as is typical in meteorological or financial data, in some cases LMprocesse can provide better fit than the short-memory ARMA processesn.

� However, as we will see in the next chapter, the differences in the prediction areusually small.


Long-memory processes - Nile river example

Example 55

The figure shows a series of yearly data for minimum levels of the Nile Riverbetween the years 622 and 1284, which can be found in the file nilomin.dat.

9

10

11

12

13

14

15

0700 0800 0900 1000 1100 1200

Minimum yearly levels of Nile river, 622 - 1284


� The figure shows its autocorrelation function, and it is observed that thecorrelations decay very slowly.

5 10 15 20 25 30 35 40-0.1

0

0.1

0.2

0.3

0.4

0.5

0.6

Lag

� As a reference its also included the AC function of a fractional process withd = 0.35.


� The figure shows the logarithm of the AC compared to the logarithm of the lag,and we see that the relationship is approximately linear.

� As a comparison we have also in-cluded in this figure the Madrid StockExchange index data.

� In this integrated process for smallvalues of k the AC decay more slowlyin the integrated series than in thelong-memory one, but for larger k thecoefficients for this series, which fol-low an ARIMA model, decay fasterthan that which would correspond toa long-memory process.

0 0.5 1 1.5 2 2.5 3 3.5 4-2.5

-2

-1.5

-1

-0.5

0

log(k)

log

(ρ(k

))

Nile river

Madrid Stock Index


Date post:	12-Feb-2017
Category:	Documents
Upload:	lythuan
View:	224 times
Download:	0 times

Time Series Analysis - Integrated and long-memory processes

Documents