+ All Categories
Home > Documents > A dynamic parameterization modeling for the age-period-cohort mortality

A dynamic parameterization modeling for the age-period-cohort mortality

Date post: 20-Jan-2016
Category:
Upload: flavio
View: 41 times
Download: 0 times
Share this document with a friend
Description:
A dynamic parameterization modeling for the age-period-cohort mortality. The case of E&W males mortality experience 1841-2006. Two important, general approaches have been developed for modeling and forecasting mortality, in discrete-time: The curve fitting approach and - PowerPoint PPT Presentation
Popular Tags:
49
06/11/22 1 A dynamic parameterization modeling for the age-period-cohort mortality The case of E&W males mortality experience 1841-2006
Transcript
Page 1: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 1

A dynamic parameterization modeling for the age-period-cohort

mortality

The case of E&W males mortality experience

1841-2006

Page 2: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 2

Two important, general approaches have been developed for modeling and forecasting mortality, in discrete-time:

The curve fitting approach

and

The principal component (PC) approach

Page 3: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 3

The curve fitting approach involves fitting parametric curves to the cross-sectional mortality rates in age effects, and extrapolating the parameter estimates in time effects

As mentioned by many authors, in the curve fitting approach, the deficiency of this approach comes from the high parameter time series interdependencies which cause serious difficulties for forecasting and for estimating the forecast prediction-intervals.

Building a multivariate time series for the parameters is theoretically possible but seriously complicates the approach and can lead to computational intractability.

Page 4: A dynamic parameterization  modeling for the age-period-cohort mortality

Recent developments in mortality modeling have tended to be extrapolative in nature, with the principal components (PC) approach being prominent. The PC approach involves computing PCs to the table of age-time-specific (log)-mortality rates, to obtain a linear transformation of the data in lower dimension and simplified structure, and extrapolating the PCs.

The main statistical tool used was least-squares estimation via singular value decomposition (SVD) of the matrix of the log age specific observed forces of mortality.

Page 5: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 5

The most well known PC approach is the Lee - Carter (LC) model (1992) :

where denote the singular value and respective left and right singular vectors of

, ,log( )-tx t x x x t t xz m k

( ) ( )x tk s u x v t , ( ), ( )s u x v x

txz

Page 6: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 6

Under the proposed approach, we build a GLM, modeling the actual number of deaths as independent realizations of Poisson random variables, for each calendar year independently, treating age as an explanatory variate, and using a log link function and (orthonormal) polynomial predictor structure in age effects:

or

, 1 , 11

ˆ expk

tx t j x jj

m L

ˆ expt tm L

Page 7: A dynamic parameterization  modeling for the age-period-cohort mortality

The optimum degree k for the design matrix (orthonormal polynomials) can be determined by maximizing the Bayesian Information Criterion (BIC) :

where, l is the (maximum) value of the total likelihood for all the n-calendar years (with n*a observations and n*k parameters).

04/21/23 7

2 ( ) ( ) ( )BIC log l n k log n a

Page 8: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 8

Next we can apply SVD to the random matrix B to extract the mortality dynamics in age-time effects. Then, the vector of the graduated log-central death rate, for all ages, in year t, inherits the structure of a dynamic mortality factor model:

where the random vector denotes the main age profile, is the matrix of the factor loadings, is the matrix of PC scores, and P is the matrix of eigenvectors.

rt tF P

rt tF P rt tb b b= -rt tb b b= -

ˆlog( )t tm A C F

A L C L P

Page 9: A dynamic parameterization  modeling for the age-period-cohort mortality

Sparse Principal Component Analysis

Although PC analysis is a classic tool for analyzing multivariate data, by reducing the dimensionality of the data, one of the key shortcomings of PC analysis however is that these factors or PCs are linear combinations of all variables, that is, all factor coefficients (or loadings) are non-zero.

Additionally, if the input data consists of non-stationary time series, a single linear combination of all the time series can solely explain the variability, because simultaneous drifting of the series may register as correlations of the columns.

04/21/23 9

Page 10: A dynamic parameterization  modeling for the age-period-cohort mortality

Further, in empirical studies with various mortality experiences, using PCA, results in factor loadings structures with significant values for both positive and negative loadings.

This undesirable feature gives high interdependence structure for different range of ages, explaining different mortality dynamics with the same factor, resulting in spurious interpretations and forecasting.

SPCA improves this problematic dependent structure by giving a better clustering of significant high factor loading values, seeking a trade-off between the two goals of expressive power (explaining most of the variance) and interpretability (making sure that the factors involve only a few variables).

Page 11: A dynamic parameterization  modeling for the age-period-cohort mortality

Luss and Aspremont (2006), solves a penalized formulation of the problem (given a covariance matrix

find sparse factors which explains the maximum amount of variance):

Maximize subject to

Where λ is a scalar which defines the sparsity (higher values of λ gives more sparsity).

04/21/23 11

( ) 1 | | 1Tr A X X

( ) 1Tr X 0X kX S

kA S

Page 12: A dynamic parameterization  modeling for the age-period-cohort mortality

Thus, applying SPCA to the random matrix B and keeping an optimum subset p(<k) of the SPCs, which explains the “majority” of the variance, leads to

Where denote the first p-columns of the matrix of sparse factor loadings and the SPC scores respectively, and is the error component which denotes the deviation of the model represented by the excluded SPCs.

04/21/23 12

ˆlog( )= p pt t tm A C F u

&p ptC F

tu

Page 13: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 13

, , ,1

uˆlog( )= +p

tx x t i x i t t xi

m A b d F

1 , 11

k

x j x jj

A L

After some simple algebraically manipulations we obtain

,1

p

t i i ti

b m F

, , , 12

k

i x j i x jj

d e L

,0 1,i x im L e

The model now can be viewed as a member of the class of log-linear models, with age and time being the two main random effects (similar to the so-called association model of Goodman, 1991).

Page 14: A dynamic parameterization  modeling for the age-period-cohort mortality

Optimum number of SPCs

The values are very informative for the number of components retained, since

1. The decomposition implies that the value can be viewed as mean indices of the factor loadings (or associations) for all the ages with the i-factor

2. The value is a measure of the importance (or a weight) for the i-factor in the construction of the time additive term

04/21/23 14

, ,i x i i xC m d

im

tb

im

Page 15: A dynamic parameterization  modeling for the age-period-cohort mortality

Confidence intervals for the values can be constructed by bootstrapping techniques. Alternatively, if we just count the number the values change sign in the bootstrap procedure, then the rate of times where the sign is changed (say a%) will define a confidence coefficient or a confidence level (CL=1-a%) for each SPC. If the values do not change sign in the bootstrap procedure (at any given significant level), it is an indication of robustness and the significance of the particular factor.

Next, we can choose the ‘optimum’ value λ (the scalar which defines the sparsity in the SPCA), to be the value that maximizes the number of significant SPC, as defined by the CL criterion (at a given significance level).

04/21/23 15

im

im

Page 16: A dynamic parameterization  modeling for the age-period-cohort mortality

Then, among these significant SPC, we can specify the key age groups by taking, for each age, the maximum association value (MAV), defined by:

It is desirable that the above method should produce distinct key age groups of neighbouring values, which belong to the same components. This desirable feature would lead to a clear picture for the mortality dynamics.

max,

{ }maxx i x

iC C

maxxC

Page 17: A dynamic parameterization  modeling for the age-period-cohort mortality

Cohort effects

04/21/23 17

Page 18: A dynamic parameterization  modeling for the age-period-cohort mortality

In the presence of cohort effects, the GLM estimates are marginal estimates of age and period effects. If we settle for the residual cohort effect conditional on the already estimated age and period effects we have:

In which the term is treated as

an offset in the rectangular DEBF.

04/21/23 18

2

, , , 1 , 11 1

log( ( )) log( )p k

tx tx x i x i t c j x ji j

E D R A C F b L- -= =

×= + + × +å å

, ,1

log( )p

tx x i x i ti

R A C F=

+ + ×å

Page 19: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 19

In the same way with the age period effects we apply the eigenvalue decomposition to the covariance matrix of these conditional (cohort) GLM estimates, leading to graduated central mortality rates in age – period – cohort effects:

defined in the rectangular DEBF.

Forecast estimates become (area DCGF)

, , , , ,1 1

ˆlog( )=p q

tx x i x i t i x i t x t x xi i

m A C F D H

, , , , ,1 1

ˆlog( )=p q

tx x i x i t i x i t x t x xi i

m A C F D H

, , , , ,1 1

ˆˆ ˆ ˆˆlog( )=p q

t x x i x i t i x i t xi i

m A C F D H

Page 20: A dynamic parameterization  modeling for the age-period-cohort mortality

Using the same arguments, the log-graduated central mortality rates, can be alternatively decomposed as a member of the class of log-linear models with three main random effects:

where denotes the independent main cohort effect:

for and

04/21/23 20

, , , , ,1 1

ˆˆlog( )=p q

ctx x t t x i x i t i x i t x t x x

i i

m A b c d F d H

t xc

,1

ˆq

ct x i i t x

i

c m H

,0 1,c c ci x im L e , , , 1

2

kc c ci x j i x j

j

d e L

Page 21: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 21

ForecastingFor forecast purposes, for each time SPC, we advocate a specific class of dynamic linear regression (dlr) models under state space models and Kalman filter techniques:

for each calendar year t (the so-called regressor) and for each time related SPC (i=1,…,p), with the slope as stochastic time variable parameter (TVP) that follow a random walk (RW) process.

The cohort related PCs are modelled as independent RW plus noise time series model, since conditional on age-time effects, we may assume that the cohort related PCs are mean reverted stochastic processes

, , ,i t i i t i tF b t e

Page 22: A dynamic parameterization  modeling for the age-period-cohort mortality

Confidence Intervals (CI) for the true mortality rates

There are several sources of randomness in the proposed model. The true value of the log central mortality rates, assuming the model specification is correct, is given by

Where e( ) denotes the errors in estimating the relevant quantity, is the residual error in fitting the model for age x and cohort t-x using a finite set of PC functions, and is the observation error which comes from the random variation of deaths in the Poisson distribution.

04/21/23 22

, , , , ,1

ˆ ˆ ˆlog( )p

t x x x i x i x i t i ti

m A e A C e C F e F

, , , , , ,1

ˆˆ ˆ ˆ eq

i x i x i t x i t x t x x t x xi

D e D H e H

,t x x

,et x x

Page 23: A dynamic parameterization  modeling for the age-period-cohort mortality

Assuming independence among all those different sources of error the total variance is given by

04/21/23 23

, , ,,

2 2 2 2 2 2 2, ,

1 1

ˆ ˆt x x x i x i xi t

p p

E i x i te A e C e Ce Fi i

C F

, ,, ,,

2 2 2 2 2 2 2, , eˆ

1 1

ˆˆ ˆt x x t x xi x i xi t x

q q

i x i t xe D e De Hi i

D H

Page 24: A dynamic parameterization  modeling for the age-period-cohort mortality

In order to generate an interval forecast, we can obtain the following 95% confidence interval estimate for :

Where

if

,t xm

, ,1.96 1.96

, , ,ˆ ˆˆ ˆ( ) ,E Et x x t x x

t x t x t xCI m m e m e

, , ,ˆ ˆˆ ˆ= expt x x x t x c xm a b x f t f f

,1

ˆ ˆp

px i x i

i

f b a

, , ,1

ˆ ˆp

pt x i x i t

i

f b b

,1

ˆ ˆp

px i x i

i

f b a

, , ,1

ˆ ˆp

pt x i x i t

i

f b b

, , ,1

ˆ ˆq

cc x i c i x

i

f b b

Page 25: A dynamic parameterization  modeling for the age-period-cohort mortality

The construction of complete lifetableFrom various mortality investigations, it has been observed that the number of people surviving up to older ages has increased considerably. Crude mortality rates at these ages are usually lack the required quality demanded for the construction of complete lifetables.

Empirical observations shows that the curve of mortality rates, in logarithmic scale, presents a concave shape for the oldest ages. One of the most well known approach to capture this empirical evidence is the Coale-Kisker method (1990), where assumes that the exponential rate of mortality increase at the oldest ages is not constant but declines linearly.

04/21/23 25

Page 26: A dynamic parameterization  modeling for the age-period-cohort mortality

The exponential rate of mortality increase is defined as

at age x, up to ultimate age ω.

We note that in Gompertz law, i.e.

A possible extension to Gompertz law is proposed if we assume for the slope term, b, follows an integrated random walk (IRW) time series model in order to capture the empirical patterns of mortality at oldest ages, such that the projected exponential rate of mortality increase is a linear function of the oldest ages. A more robust approach is if we apply this modelling structure to the age profile

04/21/23 26

1

ln xx

x

mk

m

xk b ln xm a b x

xA

Page 27: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 27

The case of E&W

The data

E&W male mortality experience, calendar years 1841-2006, and individual ages 0,1,…,89.

The data are freely provided by the “Human Mortality Database” (www.mortality.org).

, ,c x c ii xT L

³= å

Page 28: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 28

SDR, in age, time and cohort effects, with k=25 degree for the design matrix

Page 29: A dynamic parameterization  modeling for the age-period-cohort mortality

SDR, in age, time and cohort effects, with p=7 time SPC

04/21/23 29

Page 30: A dynamic parameterization  modeling for the age-period-cohort mortality

SDR, in age and cohort effects, with q=3 cohort PCs (final model)

04/21/23 30

Page 31: A dynamic parameterization  modeling for the age-period-cohort mortality

Sparse eigenvalues, percentage variance explained, mean associations, CL criterion, with

sparse parameter λ=36, in age-time effects

04/21/23 31

SPC index % var CL

1 6,3E+01 8,7E-01 7,08E-02 99,9%

2 8,2E+00 1,1E-01 7,33E-02 99,9%

3 1,0E+00 1,45E-02 -1,0E-02 100%

4 5,1E-02 7,2E-04 1,07E-03 100%

5 6,4E-04 3,6E-06 7,53E-04 99,9%

6 2,9E-04 1,1E-06 5,97E-04 98,5%

7 6,1E-8 2,4E-9 -7,59E-6 99,9%

8 2,7E-12 1,0E-14 -6,41E-7 71,0%

i im

iim

Page 32: A dynamic parameterization  modeling for the age-period-cohort mortality

Eigenvalues, percentage variance explained, mean associations, CL criterion, in age-cohort

effects

SPC index % var CL

1 1,6E-02 14% 4,2E-02 92%

2 2E-02 18% -3E-02 92%

3 2,9E-02 26% 6E-02 80%

4 1,6E-02 10% -5,2E-02 76%

5 9,1E-03 9% -1,8E-03 77%

04/21/23 32

imci

Page 33: A dynamic parameterization  modeling for the age-period-cohort mortality

Model Components for the age-time effects

04/21/23 33

xA

tb

Page 34: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 34

1( )v x

1( )t

1( )v x1( )v x1( )v x

1( )v x

1,xC

1,xC

1,tF

Page 35: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 35

2,xC

2,tF

Page 36: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 36

3,xC

3,tF

Page 37: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 37

4,xC

4,tF

Page 38: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 38

5,xC

5,tF

Page 39: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 39

6,xC

6,tF

Page 40: A dynamic parameterization  modeling for the age-period-cohort mortality

7,xC

7,tF

Page 41: A dynamic parameterization  modeling for the age-period-cohort mortality

Model Components for the age-cohort effects

04/21/23 41

cxA

t xc

Page 42: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 42

1,xD

1,ˆ

t xH

Page 43: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 43

2,xD

2,ˆ

t xH

Page 44: A dynamic parameterization  modeling for the age-period-cohort mortality

3,xD

3,ˆ

t xH

Page 45: A dynamic parameterization  modeling for the age-period-cohort mortality

Observed-projected cohort life expectancies at birth (left curves) and observed-projected cohort expected age at death

given survival at age 65 (right curves), with associated bootstrap 95% percentile CIs,

04/21/23 45

Page 46: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 46

Log central mortality rates, smoothed-predicted log central mortality rates and

associated CI, for various ages

Page 47: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 47

x=20x=10

x=30

x=40

Page 48: A dynamic parameterization  modeling for the age-period-cohort mortality

04/21/23 48

x=50 x=60

x=70 x=80

Page 49: A dynamic parameterization  modeling for the age-period-cohort mortality

Thank you


Recommended