Growth Mixture Modeling of Longitudinal Data David Huang, Dr.P.H., M.P.H. UCLA, Integrated Substance...

transcript

Growth Mixture Modeling of Longitudinal Data

David Huang, Dr.P.H., M.P.H.

UCLA, Integrated Substance Abuse Program

Longitudinal Data

• Subjects have repeated measures on some characteristics over time, which could be

• Medical history (ex blood pressure)

• Children’s learning curve (ex. math score)

• Baby’s growth curve (ex. weight)

• Drug use history (ex. heroin use)

i d 50 216 257 1008

days use

Growth Curve Modeling

• Level 1 represents intra-individual difference in repeated measures over time. (individual growth curve).

• Level 2 represents variation in individual growth curves.

Growth Curve Model with One Class (N = 436)

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Class 1

Years Since The First Use

Days use per month

Limitation of Growth Curve Model

• Assume that growth curves are a sample from a single finite population. The growth model only represents a single average growth rate.

Growth Mixture Modeling

• Including latent classes into growth curve modeling.

• Modeling individual variation in growth rates.

• Classifying trajectories by latent class analysis.

Growth Mixture Model in Mplus

Source: Terry Duncan (2002). Growth Mixture Modeling of Adolescent Alcohol Use Data. www.ori.org/methodology

• This study is based on 436 male heroin addicts who were admitted to the California Civil Addict Program at 1964-1965 and were followed in the three follow-up studies conducted every ten years over 33 years.

Growth Curve Model with Two Classes (N = 436)

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Class 1 (N=63)Class 2 (N=373)

Days use per month

Growth Curve Model with Three Classes (N = 436)

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Class 1 (N=56)Class 2 (N=78)Class 3 (N=302)

Days of use per month

Growth Curve Model with Four Classes (N = 436)

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Class 1 (N=52)Class 2 (N=74)Class 3 (N=277)Class 4 (N=33)

Growth Curve Model with Five Classes (N = 436)

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Class 1 (N=70)Class 2 (N=66)Class 3 (N=249)Class 4 (N=34)Class 5 (N=17)

Goodness of fit

• Loglikelihood

• Akaike Information Criterion (AIC)

• Bayesian Information Criterion (BIC)

• Sample-size Adjusted BIC

• Entropy

Adjusted BIC Index by Latent Classes

0 1 2 3 4 5

Adjusted BIC

Latent Classes

Adjusted BIC

Difficulties in Model fitting

• EM algorithm reaches a local maxima, rather than a global maxima.

• Repeat EM algorithm with different sets of initial values.

• Use BIC to compare the goodness-of-fit of models

Example of Wrong Starting ValuesThree Classes (WRONG STRATING

VALUES)Three Classes

Member 1 Member 2 Member 3 Member 1 Member 2 Member 3

Intercept 21.44 3.43 14.11* 6.40 10.08** 26.06**

Slope -1.16 -1.25 0.67 -0.02 1.26** -0.58

Treatment on I -0.13 -0.01 0.16 0.16 -0.08 -0.04

Treatment on S 0.005 0.06 -0.01 -0.02 0.002 0.01

Class mean -2.43** -1.15 -- -0.71 -- 0.41

Treatment on class 0.05** -0.05 -- 0.03* -- 0.004

% of individual in each class (estimated)

161.8(0.32)

73.7 (0.14) 275.5 (0.54) 178.9(0.35)

121.8 (0.24)

210.3 (0.41)

% of individual in each class (observed)

162(0.32)

70 (0.14) 279 (0.54) 176(0.34)

123 (0.24) 212 (0.41)

Log Ho -31234.5 -31132.3

Akaike (AIC) 62533.0 62328.7

Bayesian (BIC) 62668.6 62464.3

Adjusted BIC 62567.0 62362.7

Entropy 0.897 0.890

1012141618202224262830

Year s si nce t he fi r st her oi n use

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

11 1 1 1

2 2 22 2 2 2

3 3 3 3 3 3 33

• EM algorithm would NOT converge.

• Start with a simple model. Set variance of intercept and slope at zero. Assume residuals are constant across the classes.

• Individual classification is model dependent and initial value dependent. Individual classification could vary in different models.

1012141618202224262830

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

11 1 1 1 1 1 1 1 1

2 2 22 2

3 33 3

3 33 3 3 3 3 3

1012141618202224262830

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

11 1 1 1

1 11 1

1 1 1 1

2 22 2 2 2 2 2 2 2 2 2

3 3 3 3 3 33 3

References

• Terry Duncan (2002). Growth Mixture Modeling of Adolescent Alcohol Use Data. www.ori.org/methodology

• Muthén, B. (2004). Latent variable analysis: Growth mixture modeling and related techniques for longitudinal data. In D. Kaplan (ed.), Handbook of quantitative methodology for the social sciences (pp. 345-368). Newbury Park, CA: Sage Publications.

Growth Mixture Modeling of Longitudinal Data David Huang, Dr.P.H., M.P.H. UCLA, Integrated Substance...

Documents