Download - CHARACTERIZATION OF ACTIVE JOINT COUNT TRAJECTORIES … · Roberta Berard, Master of Science (Clinical Epidemiology and Health Care Research) Graduate Department of Health Policy,

CHARACTERIZATION OF ACTIVE JOINT COUNT TRAJECTORIES IN

JUVENILE IDIOPATHIC ARTHRITIS

By

Roberta Berard MD FAAP FRCPC

A thesis submitted in conformity with the requirements for the degree of

Master of Science (Clinical Epidemiology and Health Care Research)

Graduate Department of Health Policy, Management and Evaluation

University of Toronto

© Copyright by Roberta Berard 2011

ii

Characterization of active joint count trajectories in juvenile idiopathic arthritis

Roberta Berard, Master of Science (Clinical Epidemiology and Health Care Research)

Graduate Department of Health Policy, Management and Evaluation

University of Toronto

2011

Abstract

Aim: To describe the longitudinal active joint count (AJC) trajectories in juvenile idiopathic

arthritis (JIA) and to examine the association of baseline characteristics with these trajectories.

Methods: A retrospective cohort study at two Canadian centres was performed. The longitudinal

trajectories of AJC were described using latent growth curve modeling (GCM). Latent GCM is

a novel technique that aims to classify individuals into statistically distinct groups based on

individual response patterns so that individuals within a group are more similar than individuals

between groups. The trajectory classes are each defined by a longitudinal growth curve. The

association of baseline characteristics stratified by trajectory group was examined by univariate

methods.

Results: Data were analyzed for 659 children diagnosed with JIA between 1990/03-2009/09. A

maximum of 10 years of follow-up data were included in the analysis. Participants were

classified into 5 statistically and clinically distinct AJC trajectories by latent GCM.

Conclusions: Using a novel longitudinal statistical method we were able to classify patients with

JIA based on their pattern of AJC over time. These results should be interpreted in light of

clinical significance. The trajectory classes will need to be examined for their relationship to

important genetic and biological predictors. Identification of patterns of disease course is

important in working towards the development of a clinically relevant outcome-based

classification system in JIA.

iii

Acknowledgments

I would like to thank my thesis advisory committee, Drs. Claire Bombardier, Rae Yeung,

Brian Feldman and George Tomlinson for their invaluable guidance, encouragement and

mentorship. I am so grateful for the opportunity to have worked on this exciting project under

their tutelage. I would like to thank Ms. Xiuying Li for the hours she spent with me merging and

cleaning my dataset. I am indebted to Drs. Kiem Oen and Alan Rosenberg and their research

teams for the countless hours of data collection and cleaning without which this thesis would not

have been possible. Thank you to the reviewers of my thesis, Drs. Taunton Southwood and

Rahim Moineddin, for taking the time and interest to critically evaluate my work.

Thank you to my friends who have always stood by me through my years of training.

Special thank you to my dear friend Bindee for proof-reading my thesis. To my family, who

have always supported me in my academic pursuits and have shown patience beyond words.

And to my boys, Pete and Dylan, for their unconditional love and support always.

iv

Table of Contents

Page

ABSTRACT ii

ACKNOWLEDGEMENTS iii

TABLE OF CONTENTS iv

LIST OF TABLES vi

LIST OF FIGURES vii

LIST OF APPENDICES viii

LIST OF ABBREVIATIONS ix

1. INTRODUCTION AND THESIS OVERVIEW 1

2. BACKGROUND 3

2.1 Juvenile idiopathic Arthritis 2.1.1 Epidemiology and Burden of Illness 3

2.1.2 Classification Criteria in JIA 4

2.1.3 Challenges with the clinical application of the International League 4

of Associations for Rheumatology criteria

2.1.4 Difficulties in defining disease course and outcomes in JIA 5

2.1.5 Predictors of Outcomes in JIA 7

2.1.6 Emerging biological evidence for etiologic heterogeneity 8

2.1.7 Summary 8

2.2 Growth Curve modeling 10

2.2.1 Conventional growth modeling 10

2.2.2 Latent variable growth curve modeling 12

2.2.3 Latent class growth analysis versus growth mixture modeling 15

2.2.4 Latent growth variable modeling – model selection 16

2.2.5 Latent growth variable modeling – clinical sensibility 17

2.2.6 Summary 18

3. RATIONALE AND RELEVANCE 20

4. OBJECTIVES AND RESEARCH HYPOTHESIS 22

5. METHODS 23

5.1 Study design and overview 23

5.2 Research Ethics Approval 23

5.3 Study Population 23

5.4 Data Management 24

5.5 Trajectory descriptive variable 25

5.6 Cohort descriptive variables 26

5.7 Analysis 27

v

6. RESULTS 34

6.1. Study population 34

6.1.1. Follow-up times 34

6.1.2. Missing Visits 38

6.1.3. Cohort baseline characteristics 38

6.2. Univariate description of the AJC 40

6.3. Characteristics of patients with no active joint disease 40

6.4. Model Building 42

6.4.1. Selection of the distribution 42

6.4.2. Selection of the best fitting model 43

6.4.2.1. Fit indices 43

6.4.2.2. Classification quality 51

6.4.2.3. Clinical sensibility 53

6.4.2.4. Association of baseline characteristics with trajectories 57

7. DISCUSSION 61

7.1. Overview and strengths of study 61

7.2. Key findings 61

7.2.1. Successful application of a novel longitudinal data modeling technique 61

7.2.2. Clinical interpretation of the trajectory groups and characteristics of the 62

trajectory group members

7.3. Limitations 64

7.3.1. Limitations of the Mplus software 64

7.3.2. Limitations of the study design 65

7.3.3. Limitations of the use of the AJC as outcome measure 67

7.4. Future directions 67

8. REFERENCES 69

9. APPENDICES 73

vi

List of Tables

Page

Table 1. Interpretation of the 2(∆ Bayesian Information Criterion) in model selection 31 Table 2. Comparisons of average active joint count in patients who continue to 37

the next visit to those whom drop out. Table 3. Cohort demographics 39 Table 4. Univariate description of the active joint count at each visit 40 Table 5. Comparison of the baseline demographics of the subgroup of patients 41

with no active joint disease with those with active joint disease

Table 6. Fit statistics for latent curve growth analysis for the normal distribution 42

Table 7. Fit statistic for latent curve growth analysis for the Poisson distribution 42

Table 8. Fit statistics for latent curve growth analysis for the negative binomial 43

distribution Table 9. Fit statistics for latent curve growth analysis for the zero-inflated negative 43

binomial distribution Table 10. Fit statistics for the latent curve growth analysis and growth mixture models 46

under the zero-inflated negative binomial distribution Table 11. Average posterior probabilities for five-class solution 52

Table 12. Average posterior probabilities for four-class solutions 53

Table 13. Summary of model qualities 59

Table 14.Characteristics of study participants, stratified by trajectory 60

vii

List of Figures

Page

Figure 1. Growth curve model 11

Figure 2. Latent curve growth analysis 14

Figure 3. Growth mixture model 14

Figure 4. Kaplan-Meier curve of the proportion of completed visits over time 35

Figure 5. Mean active joint count at visit N by follow-up status at visit N+1 37

Figure 6. Four-class latent curve growth analysis 47

Figure 7. Four-class growth mixutre model(variance of the non-inflated 47

intercept estimated) Figure 8. Four-class growth mixture model(variance of the non-inflated intercept 48

and slope estimated) Figure 9. Five-class latent curve growth analysis 48

Figure 10. Five-class growth mixture model (variance of the non-inflated 49

intercept estimated)

Figure11. Five-class growth mixture model (variance of the non-inflated intercept 49

and slope estimated) Figure 12. Five-class growth mixture model (non-inflated intercept and slope 50

estimated for groups 2 and 5) Figure 13. Six-class latent curve growth analysis 50

Figure 14. Five-class cubic latent curve growth analysis 51

Figure 15. Five-class latent growth curve analysis Persistent high class (9.8%) 54

Figure 16. Five-class latent growth curve analysis. Moderate increasing class (10%) 55

Figure 17. Five-class latent growth curve analysis. Persistent moderate class (18.5%) 55

Figure 18. Five-class latent growth curve analysis. Persistent low class (43.6%) 56

Figure 19. Five-class latent growth curve analysis. No joint activity class (18.2%) 56

Figure 20. Five-class latent growth curve analysis. Growth curves for all five classes 57

based on the estimated means from the model.

viii

List of Appendices

Page

Appendix A: Comparison of the ILAR, JRA and JCA criteria 73

Appendix B: ILAR Classification of JIA: Second Revision, Edmonton, 2001 74

Appendix C: Data abstraction form 76

Appendix D. Flow chart of patients and exclusions 80 Appendix E. Frequency of AJC at each visit. 81

Each unit of timeframe is 6 months of follow-up.

ix

List of abbreviations - clinical

ACR American college of rheumatology

ACRPedi30 Pediatric American college of rheumatology 30 response

AJC active joint count

CHAQ Child health assessment questionnaire

DMARD disease-modifying antirheumatic drug

ESR erythrocyte sedimentation rate

EULAR European league against rheumatism

HLA human leukocyte antigen

ILAR International league of associations for rheumatology

JIA juvenile idiopathic arthritis

JRA juvenile rheumatoid arthritis

RF Rheumatoid factor

RNA ribonucleic acid

ROM range of motion

List of Abbreviations – statistical

AIC Akaïke’s information criterion

BIC Bayesian information criterion

GCM growth curve modeling

GMM growth mixture modeling

LCGA Latent curve growth analysis

LMR LRT Lo-Mendell-Rubin likelihood ratio test

LRT likelihood ratio test

ssABIC sample size adjusted Bayesian information criterion

ZINB zero-inflated negative binomial

1

1. INTRODUCTION AND THESIS OVERVIEW

Juvenile idiopathic arthritis (JIA) is the most common rheumatic disease of childhood. It

is a chronic illness that fluctuates symptomatically over years. In the literature, there is a paucity

of reliable indicators of prognosis and outcome in this disease. This is due, in part, to the

heterogeneity in clinical phenotype, and consequently in the classification system nomenclature

as well as to the lack of a universal definition of remission and outcome. Most importantly, all

of the outcome studies to date have defined an outcome at a fixed point in time. We argue that

this is not the correct approach to a definition of outcome in a chronic, relapsing and remitting

disease. A more appropriate outcome may be the disease course itself. An understanding of the

disease course and its relationship to distal outcomes and genetic and biologic predictors is

crucial in order to correctly define outcome states in JIA.

The aim of this study was to use a novel longitudinal data analysis technique (latent

growth curve modeling) applied to active joint count (surrogate for disease activity) to

characterize the disease course in JIA. We recognize that JIA is a multidimensional disease and

there are other important patient and disease-related factors that are important determinants of

disease activity. If these methods are successful, then other components may be tested.

This thesis is organized as follows: chapter 2 lays out the background relating to

JIA and growth curve modeling. With regard to JIA, challenges with the classification criteria,

difficulties in defining predictors and outcomes and emerging biological evidence for etiologic

heterogeneity will be discussed. Conventional growth curve modeling will be presented in

contrast to latent growth curve modeling. Two types of latent growth curve modeling will be

reviewed: latent class growth analysis and growth mixture modeling. Finally, an approach to

selection of a latent growth curve model based on statistical and clinical sensibility will be

2

discussed. Given the framework and background developed in chapter 2, chapters 3 and 4

present the rationale, relevance and study objectives. Chapter 5 will expand in detail on the

methods used and chapter 6 will present the results. Finally, a discussion of the results, study

limitations, implications and future directions is presented in chapter 7.

3

2. BACKGROUND

2.1 Juvenile idiopathic Arthritis

2.1.1 Epidemiology and Burden of Illness

Juvenile idiopathic arthritis (JIA) is the most common rheumatic disease of childhood. It

is estimated to affect at least 1 in 1000 Canadians under the age of 16, making JIA one of the

most prevalent chronic diseases among children in our country. A recent Canadian pediatric

surveillance program determined an annual incidence rate of 7/100, 000 [1]. A community-

based prevalence study in Australia demonstrated a prevalence of 400 per 100, 000 children [2].

Point prevalence estimates of 52 per 100,000 in Saskatchewan and 32 per 100,000 in Manitoba

have been calculated however there was marked imprecision in the estimates and a reporting bias

(subspecialty reporting only) which likely has resulted in an underestimation of the true burden

of illness in Canada [3].

Although JIA is rarely fatal, it is a chronic illness that can be associated with serious

physical disability for many affected children due to joint damage. Additional morbidity

associated with JIA can be due to its treatment and effect on growth and development. Children

with inadequately treated or recalcitrant JIA may have chronic pain, mood disturbances, and

difficulty with peer relationships, school performance and attainment of educational and

vocational goals [4-9].

In addition to the significant personal cost, an increase in utilization of health care

services has been demonstrated. In a 2007 Canadian study of 155 consecutive clinic attendees,

the total difference in annualized average direct medical costs for children with JIA versus

controls was $ 1,686 (95%CI $875 to $2,500) [10].

4

2.1.2 Classification Criteria in JIA

JIA is not a single disease but rather a diagnosis that applies to arthritis of unknown

origin, persisting for more than 6 weeks and with onset before the age of 16 years[11]. The

most recent proposed classification criteria (International league of associations for

rheumatology (ILAR): second revision, 2001) were developed to delineate relatively

homogeneous, mutually exclusive categories of idiopathic childhood arthritis to aid in the

conduct of research [11]. These criteria addressed the heterogeneity in nomenclature and criteria

between European (juvenile chronic arthritis (JCA)) [12] and North American (juvenile

rheumatoid arthritis (JRA)) classification systems (Appendix A) [13].

This expert, consensus-based classification system is based on clinical characteristics

during the first six months of disease. It is purported to define categories of clinically

homogeneous groups of patients that may demonstrate, to some extent, etiologic and pathogenic

homogeneity and predictability of response to therapies [14-15]. The seven mutually exclusive

categories are systemic arthritis, oligoarthritis, rheumatoid factor (RF) positive polyarthritis, RF

negative polyarthritis, psoriatic arthritis, enthesitis related arthritis and undifferentiated arthritis

(Appendix B)[11]. The ILAR criteria have been recognized as a “work in progress” since their

inception - a framework to which novel biologic predictors of outcome or disease course may be

added.

2.1.3 Challenges with the clinical application of the ILAR criteria

Strictly speaking, the purpose of the ILAR classification criteria in juvenile arthritis is

descriptive [16]. Although this tool was designed to discriminate between individuals, there are

difficulties in applying the criteria inherent in its design. The category of “undifferentiated

5

arthritis” encompasses patients that fulfill criteria in no category or in two or more categories

[11]. This subtype represents between 11 and 21% of patients in studies reporting application of

the criteria [17-18]. There is particularly little known about this subtype as it is poorly

characterized in studies to date. This subgroup also poses unique difficulties for treatment trials

and may bias the results towards finding no effect of treatment when in fact there is one. A

heterogeneous group of patients in a treatment trial is undesirable as any potential efficacy may

be negated based on the inclusion of subjects that have a biological or genetic justification for a

response/lack of response.

Although the subtypes are descriptive terms, certain subtypes confer prognostic

information (i.e. RF positive polyarthritis patients are known to have a more severe disease

course) [19-21]. However information regarding expected disease course within subtypes is

lacking. While some of the subsets clearly identify distinct disease entities that are more

adequately characterized (RF positive polyarthritis, systemic arthritis), others still show marked

clinical variation within a subtype. Therefore, for individual patients, there is a limited ability to

advise or predict the expected disease course[22].

In summary, classification of patients at baseline or after the first six months is not

sufficient to determine outcome and does not adequately inform regarding the course.

2.1.4 Difficulties in defining disease course and outcomes in JIA

Traditional outcomes in JIA include joint damage on radiographs, persistent disease

activity, loss of function and effects on quality of life [9, 19, 23-27]. Outcomes have generally

been evaluated at a fixed-time point and have been modeled as a continuous or dichotomous

variable or are analyzed as a time to a defined event (i.e. remission, disability).

6

A few selected outcome studies highlight the historical challenges in reporting outcomes

in JIA. Data from a Canadian multicentre cohort study demonstrated that 10 years following a

diagnosis of JRA, [13] the probability of remission (defined as two years off medication) was

37% in systemic, 47% in oligoarthritis, 23% in polyarthritis RF negative and 6% in polyarthritis

RF [24]. This is in contrast to a multicentre cohort study in Italy and the USA in which after a

median of 7.7 years following the diagnosis of JIA (only 5 categories considered) [11], 40% of

patients were in remission (defined as six months off medication). However at two years off

medications (the more rigid criteria defined by Oen et al.[24]) only 28% of patients were in

remission. When subtypes were considered, remission was achieved in 60% with systemic

arthritis, in 53% with persistent oligoarthritis, 33% with extended oligoarthritis, 31% with RF

negative polyarthritis and 0% with RF positive polyarthritis. A German population-based cohort

study reporting on long-term outcomes of 260 patients with JIA, used the American college of

rheumatology (ACR) definition of remission [28] and a cutoff of “no medications” for 2 months.

This study found 47% with systemic arthritis, 73% with persistent oligoarthritis, 12% with

extended oligoarthritis, 30% with RF negative polyarthritis and 0% with RF polyarthritis were in

remission.

These three papers highlight a few key issues. First, there has been wide variability in the

reported definitions of remission, classification of patients and outcome intervals. This has

likely been a barrier to the interpretability of this information for clinicians and dissemination to

patients. Second, even within subtypes, variations in disease duration before remission,

disability and radiographic joint damage are evident [29-31] which provides further evidence for

heterogeneity. The historical classification of childhood arthritis does not effectively capture the

clinical and outcome variability present in this population. Third, the current statistical methods

7

for reporting outcome studies measure an outcome at only one time for each patient. Patients'

courses vary so a single measurement may not represent the long-term experience with the

disease given our knowledge of the chronic and fluctuating nature of this illness.

In summary, the difficulties related to defining disease course and outcome are related in

part to heterogeneity in definitions but also to incorrectly identifying an outcome at one point in

time instead of capturing the effect of the disease over time.

2.1.5 Predictors of Outcomes in JIA

The identification of reproducible indicators of outcome (remission, disability, persistent

disease activity and radiologic damage) has been a challenge [19, 23, 27]. Studies to date have

shown inconsistent results likely related to differences in definitions (in subtypes of arthritis and

remission), small sample sizes, varied length of follow-up and lack of consideration for the effect

of medication on outcome. The inconsistency is also in part due to the use of the consensus-

based ILAR classification system which has defined the subtypes based on subjective

combinations of clinical and laboratory features. We proposes that future studies of predictors

should focus on predicting the disease course.

The traditional patient characteristics evaluated in prognostic studies include: age at

onset, sex, race, onset type of JIA, total number and distribution of affected joints, persistence of

systemic features at ≥ 3 months, and uveitis [19-20, 32-34]. More recently, evidence from two

longitudinal cohort studies demonstrated that a delay in diagnosis of JIA may be an important

predictor of outcome [35-36]. Further investigation of the association of a delay in diagnosis to

prognosis is warranted and may prove to be an important predictor of response to treatment and

outcome. Traditional laboratory investigation evaluated in prognostic studies have included

8

rheumatoid factor, anti-nuclear antibodies, HLA-B27, ESR, and in some cases white blood cell

count, hemoglobin and platelets [19-20, 32, 37-40]. Until now, classification systems have not

integrated these traditional prognostic factors. In addition, in the last 10 years, there is growing

evidence for both HLA and non-HLA genetic associations with subtypes of arthritis and outcome

in JIA, which have also not been addressed with the current classification systems [17, 27, 41-

42].

2.1.6 Emerging biological evidence for etiologic heterogeneity

In addition to the evidence for genetic association with subtypes of arthritis, there is a

growing body of literature to support biological variability among the subtypes of JIA. In 2009,

a stimulating editorial entitled “Can molecular profiling predict the future in JIA” highlighted the

significant findings in this field [43]. In the last two years, distinct protein expression profiles in

synovial fluid mononuclear cells have been found to be predictive of a more severe course in

oligoarthritis [44]; also specific RNA gene expression patterns in peripheral blood

polymorphonuclear cells can distinguish RF positive and RF negative polyarthritis [45]. Thus,

biologic evidence supports the notion that JIA is a heterogeneous disease and phenotypic

variability may ultimately be explained, at least in part, on the basis of molecular differences.

2.1.7 Summary

In summary, childhood arthritis is a chronic illness with relapses and remissions that

often continues into adulthood and is a source of morbidity for both the child and family. JIA is

an umbrella term that encompasses several etiologically distinct diseases. There are deficiencies

in our approach to the classification of childhood arthritis that has led to difficulties in defining

9

and reporting of outcomes and predictors of disease course. These difficulties result at least in

part from the multiple revisions and lack of unified criteria as well as the fact that the criteria

remain consensus-based and incorporate only clinical and basic laboratory factors. There is also

plausible biologic evidence for phenotypic heterogeneity that will need to be incorporated into

future classification schemes in an attempt to identify and characterize truly homogeneous

groups of patients.

There is clearly a need for a new multifaceted approach to classification of childhood

arthritis that incorporates genetic, immunologic and biochemical information. The existing

literature about disease course and outcome has several limitations as outlined above. The use of

statistical techniques that quantify outcome based on a patients’ status at a fixed single point in

time contribute at least in part to these limitations. These are not the optimal methods to evaluate

disease outcome when the disease is chronic and fluctuating as in JIA. In order to adequately

characterize the disease course, novel statistical methods to examine longitudinal disease

activity, which properly characterizes the course of this illness, need to be explored.

10

2.2 Growth Curve modeling

In this study we aim to characterize disease course so we may work towards the

development of predictive models that identify disease course as the outcome. This is in contrast

to traditional prognostic models that generally predict an outcome at a fixed time point. To

characterize the disease course, we propose the use of longitudinal growth curve modeling

techniques. There are three longitudinal methods that will be described in sections 2.2.1, 2.2.2

and 2.2.3 these are conventional growth curve modeling (GCM), latent curve growth analysis

(LCGA) and growth mixture modeling (GMM).

2.2.1 Conventional growth modeling

In conventional GCM, it is assumed that a sample is drawn from a single

population characterized by an underlying set of growth parameters (i.e. intercept (mean starting

value), slope and quadratic terms). These parameters describe the average course of an outcome

over age or time (i.e. a growth curve). Individual variation in developmental trajectories is

captured by random effects [46]. For example, if all subjects have a growth curve that has the

same shape, but some have higher than average levels and some have lower than average levels,

we would add a random intercept to the model. If the shape was also different across subjects,

we might consider allowing a random slope. A theoretical conventional growth curve model is

presented in Figure 1.

11

Figure 1. Growth Curve model. The bold line represents the population mean of the outcome

over time. The faint lines represent the individual variation.

We are however, often interested in and deal with samples from multiple populations (i.e.

males and females with a disease, patients without the disease). One can simultaneously model

the course of disease in multiple observed populations using a different growth model for each

population or by including covariates that reflect how the growth curves differ across

populations. However these approaches require a priori knowledge of individuals’ group

membership.

When group membership is not known or if one’s hypothesis is to determine whether

there are distinct sub-populations within a single population, conventional GCM is not sufficient.

GCM in a latent variable framework allows for post-hoc identification and description of

longitudinal change within unobserved sub-populations[47].

12

2.2.2 Latent variable growth curve modeling

The objective of latent variable growth curve modeling is to measure and explain

differences across population members in their developmental trajectories. While the growth

curve modeling approach attributes all heterogeneity between subjects to random variation

around a common mean growth curve, the defining characteristic of the latent variable growth

curve model is that it identifies groups of subjects with different mean growth curves. In the last

10 years there has been a marked increase in the use of the latent growth modeling techniques.

A recent review found that in the PSYC INFO database, between the years of 2000 and 2008, the

number of publications per year increased from 8 to 80 [48]. Historically this methodology has

been used in the fields of social and psychological sciences particularly to examine how social

behaviors unfold over time and to study personality development. There is growing interest in

allopathic medicine as this methodology may be extended to capture heterogeneity in treatment

responses and to further our understanding of the growth and development of medical illnesses

[49-50].

Latent variable growth curve modeling is an elaboration based on a class of statistical

models call finite mixture models [51]. One application of finite mixture models is the post-hoc

identification of subpopulations based on measured characteristics. Classification of individuals

into distinct groups based on individual response patterns so that individuals within a group are

more similar than individuals between groups is the desired output.

In contrast to conventional GCM, the latent growth curve modeling approach allows for

differences in the average values of growth parameters across unobserved subpopulations. This

is accomplished using latent trajectory classes (i.e. categorical latent variables), which allow for

13

different groups of individual growth trajectories to vary around different means [52]. The

results are growth models for each latent class, each with its unique estimates of variances.

Another way of relating conventional GCM to latent variable modeling is in terms of

person and variable-centered approach to data analysis [53]. The focus of GCM is on

relationships among variables (identify significant predictors of outcome and describe how

dependent and independent variables are related). Latent variable modeling uses a person-

centered approach focusing on the relationship between individuals. This type of modeling can

be used for data at one point in time (cross-sectional). An example of a finite mixture model as

applied to cross-sectional data is a study performed by Thomas et al. whereby a novel

classification of JIA was proposed using a latent class analysis (see section 3). This same

approach can be applied to longitudinal data. When finite mixture models are applied to

longitudinal data, the purpose is to identify groups with similar growth curves. Within this class

of models, there are two distinct approaches. The first assumes that everyone in the same group

has exactly the same true growth curve. This is called the LCGA. The second approach allows

that subjects within a group have true growth curves that vary around the mean for the group.

This is called GMM. A theoretical LCGA and GMM are presented in Figures 2 and 3. The

sections below provide more details on LCGA and GMM.

14

Figure 2. Latent curve growth analysis. Each line represents the growth curve for that class. No

individual variability is accounted for with this type of model.

Figure 3. Growth mixture model. Each bold line represents the growth curve for that class.

Individual variability is represented by the faint lines around each bold line.

15

2.2.3 Latent class growth analysis versus growth mixture modeling

LCGA is a special type of GMM, whereby the variance estimates for the growth factors

within each class are assumed to be fixed at zero. By this assumption, all individual growth

trajectories within a class are the same. The LCGA framework has been developed by Nagin

and colleagues [54]. A SAS procedure (PROC TRAJ) has been developed to implement this

method [55-56]. GMM on the other hand uses both continuous latent variables (random effects)

and categorical latent variables (trajectory classes) in the model. Nagin noted that a LCGA may

find a k + m-class solution where a GMM may find a k-class solution given that in GMM, the

intra-class variability is captured by random effects which may produce a more parsimonious

model [57].

In terms of deciding which technique to choose, there is no clear choice. The decision

should be based on the underlying hypothesis regarding the heterogeneity of the study

population. Computationally, the LCGA models sometimes converge more easily and produce a

simpler model than the GMM. Muthen has suggested that both approaches should be used, with

LCGA as a first step to qualitatively asses the number of classes and location of curves. If large

variability is observed for the plots of the individual values around the estimated, class specific

mean curves, it is then proposed to relax the variances in a GMM and reevaluate model fit

indices [58]. Estimating additional growth factors (in addition to the intercept and slope), for

example a quadratic term, will add computational burden, so it is not unusual to see the variance

of the quadratic term fixed to zero to aid in convergence during GMM [52]. Mplus is the

software commonly used for GMM and since LCGA models are a special case of GMM, the

specification of LCGA models is easy with Mplus software[59].

16

2.2.4 Latent growth variable modeling – model selection

The aim of using latent growth curve modeling in our study was to statistically derive

homogeneous subsets (classes) based on the pattern of longitudinal disease activity. Determining

the number of homogenous classes is important to further our understanding of disease course in

JIA. An ideal number of homogenous classes is that which adequately describes clinically

meaningful groups that differ (based on disease progression, features at presentation or response

to therapy). Identification of a large number of small subsets of patients would not be helpful to

clinicians in guiding practice nor would too few.

In the latent growth curve modeling field, statistical determination of the number of latent

classes is an active area of research and debate. There is not one commonly accepted statistical

indicator for deciding on the number of classes in a study population. The commonly used log-

likelihood ratio test cannot be used to test nested latent class models as this test assumes a chi-

square distribution which is not the case in mixture models [60].

Currently, Mplus provides three criteria for assessment of the optimal number of classes

in a GMM: (1) the model with the smallest Bayesian information criterion (BIC) defined as -2

ln(L) + p ln(n) (where ln(L) is the log-likelihood, p is the number of parameters and n is the

sample size) [61]; (2) the model with the smallest sample size adjusted BIC (BIC where

n=(n+2)/24)[62-63]; (3) a significant (p<0.05) Lo-Mendell-Rubin (LMR) adjusted LRT statistic

comparing a model with k classes to a model with k-1 classes[64] . A lower BIC and ssABIC

and a significant LMR-LRT indicate the k-class model is preferred over the k-1 class model.

More recently, further simulations have demonstrated that while the BIC performed best among

the information criteria-based indices, the bootstrapped LRT proved to be a better indicator in

determining the number of classes across all of the models considered for the likelihood-based

17

tests [52]. Nyland et al. have suggested the model with a low BIC value and a significant

LMR p-value comparing the k and k-1 class model should initially guide analysis. As the

computation burden greatly increases in requesting the BLRT, it has been suggested that the

initial steps of model exploration be done with the BIC and LMR p-values to arrive at possible

solutions and then once a few plausible models have been identified, that these models be

reanalyzed with the BLRT[60].

Other considerations include successful model convergence, a high entropy value (near

1.0), no less than 1% of the total count of the sample in a class, and high posterior probabilities

of classification into the latent classes[61]. The entropy value quantifies the uncertainty of

classification of subjects into latent classes. Entropy values range from 0 to 1, with 0

corresponding to randomness and 1 to a perfect classification[62]. In terms of posterior

probabilities, a value of <0.7 is considered a poor fit and >0.9 considered an excellent

classification [57]. As it is not possible to know with certainty to which group a subject belongs,

the posterior probability is used. This probability is computed based on post-model estimations

using parameters estimated from the model and is the probability of class membership in the

groups that make up the model (see section 5.7.3.3).

Importantly, criteria for statistical selection of the number of latent classes and clinical

relevance and utility of the classes should be considered when deciding on the number of latent

trajectory classes.

2.2.5 Latent growth variable modeling – clinical sensibility and danger of overextraction

As noted above, the number of classes should be determined by a combination of factors

in addition to fit indices; including one’s research question, parsimony, theoretical justification

18

and interpretability [63-64]. Bauer and Curran demonstrated in a simulation analysis that a

seemingly modest specification error (i.e. incorrect distribution) may results in the overextraction

of groups in GMM analysis [64]. It is important to be conscious that the trajectory groups are

not literally distinct entities but approximations. Once trajectory groups are identified, it is

important to determine if they differ based on preexisting characteristics, subsequent outcomes,

their response to treatment, or their relationship to trajectories for other outcomes or behaviors

[48]. Essentially, if they differ in none of these characteristics, then the grouping does not serve

a useful purpose.

In particular, when selecting the number of classes to describe patterns of disease course;

the trajectories must be evaluated to ensure they fit with the physicians’ clinical judgment about

disease course. Clinical utility is also important in terms of the classes’ ability to predict distal

outcomes or their relationship to important genetic or biological markers. A model may be

selected based on additional knowledge of its relationship to etiology or outcome even if it does

not meet all statistical criteria (i.e. small class size).

2.2.6 Summary

When the hypothesis pertains to identifying heterogeneity within a study population,

traditional GCM is inadequate. Conventional GCM assume that the growth trajectories of all

individuals can be described using a single set of growth parameters. Novel longitudinal

modeling techniques (GMM and LCGA) allow for differences in growth parameters across

unobserved subpopulations using latent trajectory classes. LCGA estimates a mean growth curve

for each class, but no individual variation around the mean growth curve is allowed. GMM, on

the other hand, combines the features of a random effects model and LCGA by estimating both

19

mean growth curves for each class and individual variation around these curves by estimating

growth factor variances for each class. Selection of the best fitting model is a combination of

both clinical judgment and statistical fit indices. These new techniques are relatively in their

infancy however, showing promise for the examination of study populations comprised of

apparently (but undefined) heterogeneous subjects such as in JIA.

20

3. RATIONALE AND RELEVANCE

In order to adequately characterize childhood arthritis, novel statistical techniques are

required. A study performed 10 years ago by Thomas et al. further demonstrated that there are

unanswered questions regarding the classification of patients with childhood arthritis[65]. Using

cross-sectional data, they reported their use of a latent class analysis to explain the statistically

observed relationship of clinical and laboratory variables to underlying subtypes of JIA. A 7

class model was identified. There was some overlap with ILAR criteria but not complete

agreement. Latent class analysis provided a means for objective analysis of the pattern of

symptom profiles. It was determined that the patterns of joint involvement and ANA were

important in determining the latent classes; these two predictors are not present in the current

classification. In addition, 3 HLA associations were differentially expressed amongst the groups

[66]. This study presented a novel approach to objectively classifying patients into

homogeneous groups based on a cross-sectional symptom profile that identified classes and

predictors distinct from ILAR.

As childhood arthritis is a chronic illness with a relapsing and remitting course, we

believe an objective statistical technique applied to longitudinal data is required to further our

understanding of disease progression patterns. No attempt has been made to classify JIA patient

subgroups based on longitudinal disease course using an objective statistical approach. Growth

mixture modeling is an ideal methodology to characterize the subpopulations in this

heterogeneous disease.

Improved characterization of the disease course and identification of homogenous groups

of patients based on progression patterns is needed. Identification of homogeneous groups of

patients based on disease course will further our understanding of the observed differences in

21

etiology and response to treatment between groups. In addition, a well-characterized disease

course provides a platform upon which novel genetic and biologic information can be evaluated.

Identification of predictive factors for a more severe disease course would allow tailoring of

therapy to those patients at risk of adverse outcomes to prevent joint damage and mitigate

exposure of potentially toxic therapies to those with a mild course. Knowledge of disease course

(and its predictors) will also allow clinicians to better counsel patients regarding expected course

and outcome. Our exploratory analyses will be an important first step in the development of an

outcome-based novel classification system for patients with childhood arthritis.

22

4. OBJECTIVES AND RESEARCH HYPOTHESIS

4.1 Primary objective

The primary objective of this study was to identify statistically and clinically distinct

trajectories of disease activity in children with juvenile arthritis.

4.2 Secondary objectives

The secondary objective of this study was to identify baseline clinical and laboratory

characteristics associated with the identified trajectories of disease activity.

4.3 Research hypothesis

Through the use of a novel statistical approach to longitudinal data modeling, we will

identify distinct patterns of longitudinal disease activity (using active joint count as a surrogate)

in juvenile arthritis in additional to clinical and laboratory characteristics associated with these

trajectories.

23

5. METHODS

5.1 Study design and overview

This was a retrospective cohort study of children with JIA. Data were obtained from 2

Canadian pediatric rheumatology centres (Royal University Hospital in Saskatoon and Health

Sciences Centre, Arthritis Centre, Winnipeg). Clinical data were evaluated to identify distinct

longitudinal active joint count trajectories in this population.

5.2 Research Ethics Approval

Research Ethics Board approval for this study was obtained from The Hospital for Sick

Children, Royal University Hospital, Saskatoon, Saskatchewan, Health Sciences Centre, Arthritis

Centre, Winnipeg Manitoba, and the University of Toronto, Toronto.

5.3 Study Population

5.3.1 Inclusion and Exclusion Criteria

Eligible patients for the study had a confirmed diagnosis (by a pediatric rheumatologist)

of JRA or JIA. Patients were excluded from the study for any of the following reasons: fewer

than three visits with the rheumatologist, no first visit documented in the medical record,

incorrect original diagnosis or the diagnosis of JRA/JIA was made more than 90 days before the

first visit with the rheumatologist. A cutoff of 90 days was chosen to limit the time of potential

medication exposure prior to the first visit with the rheumatologist (study entry).

24

5.3.2 Data collection

A data extraction sheet was developed, version September 17, 2008. (Appendix C). An

ACCESS database was created using the data extraction sheet as a template. All data were

entered directly into the database. Study visit data were collected every six months (± 2 months)

from the time of the first visit until the last visit with the rheumatologist or the end of the data

collection (April 16, 2009 for Saskatoon and July 31, 2009 for Winnipeg).

5.4 Data management

The dataset was examined for extreme observations. In the event of inconsistencies or

outliers, the records were sent back to the site of origin for verification. In addition, the

following variables were created/verified from the dataset:

1) Age at diagnosis (date of diagnosis – date of birth): When age was >16 or <1 at onset of

symptoms, records were verified.

2) Diagnostic delay (date of diagnosis – date of onset of symptoms: For values of >1096

days (3years) or <0 (indicating a data error) from onset of symptoms to diagnosis, records

were verified.

3) Family history of HLA-B27 associated diseases: If the first-degree family history was

positive for one of: ankylosing spondylitis, acute anterior uveitis, reactive arthritis

(Reiter’s syndrome) or inflammatory bowel disease, this variable was coded as “yes.”

4) Diagnosis by ILAR criteria: The ILAR criteria were assigned by the site for the

Saskatoon subjects. The Winnipeg subjects were classified using the ACR JRA criteria.

The diagnoses were reassigned to ILAR by the author using the clinical data in the

25

database. Where there was not sufficient information or a discrepancy was noted, the

records were sent to the site for verification.

5) Active joint counts: In instances where the AJC was 0 at the first visit, the records were

sent for verification. AJC >50 at any visit were also verified. All joint counts were

generated by Dr. K. Oen (Winnipeg) or Dr. A. Rosenberg (Saskatoon) only.

6) Medications: 5 categories of medications were retained for analysis. DMARD

(methotrexate, leflunomide, sulfasalazine, cyclosporine), non-steroidal anti-inflammatory

drugs, corticosteroids (oral and intravenous), intra-articular steroids and biologics

(etanercept, infliximab, anakinra, adalimumab).

5.5 Trajectory descriptor variable - “outcome variable”

The active joint count (AJC) is the main outcome variable. An active joint is defined as

either an effused joint or a joint with loss of range of motion (ROM) with stress pain. The AJC

was chosen as the trajectory descriptor variable because it is a well-recognized marker of disease

activity and is related to damage. It is one of the core set variables used to define improvement

used in assessment of outcomes in clinical trials in juvenile arthritis [67]. A recent Italian study,

demonstrated that in later disease (≥ 10 years), the AJC is moderately correlated with both the

number of joints with restricted range of motion (r = 0.63) and the Childhood Health Assessment

Questionnaire (functional ability assessment) (r = 0.54)[68]. Additionally, an Italian study

examining prognostic factors for radiographic progression, radiographic damage and disability in

JIA reported that at the last follow-up (median 4.5 years, range 2-13.5) in 94 patients with JIA,

the yearly radiographic progression was correlated with the number of active joints (r = -0.41,

26

p<0.0001) [69]. Increased risk of radiographic progression has clearly been related to

cumulative joint inflammation in adult rheumatoid arthritis [70-71].

The inter-examiner reliability for determination of an active joint is generally suboptimal;

the reported kappa values vary depending on the joint but the range is from 0.35-0.65 [72-73].

There is no reported kappa value for the total AJC. Despite this limitation, both in clinical

practice and research, the joint count is the main measure of disease activity. In the context of

this retrospective cohort study, the outcome measure was assessed by the same staff

rheumatologist at each time so the intra-examiner reliability is presumed to be excellent.

Although there may have been variability between examiners, it is unlikely that this will result

in significant misclassification of patients into different ILAR subtypes based on the number of

active joints detected. In addition, there has been the same single rheumatologist in Winnipeg

and Saskatoon for the last 30 years, which spans the entire data collection period.

Furthermore, the AJC is a reliably recorded and easily accessible marker of disease

activity present in the medical charts that could be abstracted for a retrospective study. We

recognize that JIA is a multidimensional disease and there are other important patient and

disease-related factors that are important determinants of disease activity. If these methods are

successful, other components may be tested.

5.6 Descriptive variables

Important baseline demographic and clinical variables were extracted from the charts:

age, sex, race according to Statistics Canada definitions, individual components of the ILAR

subtype criteria, ILAR diagnosis, and autoantibody status (ANA, RF, HLA-B27). All

information was coded as yes/no or by number (categorical variables). The individual

27

components comprising the subtypes of the ILAR criteria were examined, as the ILAR subtypes

by themselves are not sufficient to explain the variability in course and outcome witnessed

within subtypes. In addition, the work by Thomas et al. using LCA demonstrated factors other

than the ILAR criteria (ANA, pattern of joint involvement) were important in determining the

classes [65]. Medications were re-coded into 5 categories as noted above. Medications were

coded in the database as “yes” or “no” at the time of the visit. If an intra-articular steroid

injection occurred between visits, this was indicated as “yes” at the visit following the injection.

5.7 Analysis

5.7.1 Statistical Software

Univariate analyses were conducted with SAS version 9.2 for Windows (SAS institute,

Inc., Carey, NC). Latent curve growth analysis and growth mixture models were conducted with

Mplus version 6.0 (Muthen and Muthen, 1998-2010) [59, 65]. Statistical significance was

defined as a p-value < 0.05.

5.7.2 Descriptive Statistics

To describe the study population characteristics mean (standard deviation) for continuous

variables and proportions for categorical variable were used. When the distribution of the

variable was skewed, median values (interquartile range) were used. A Kaplan-Meier plot was

used to summarize the number of completed visits per patient.

28

5.7.3 Model building

5.7.3.1 Estimation procedure

In Mplus, parameters are estimated by maximum-likelihood estimation using the

expectation maximization (EM) algorithm [74]. EM is an iterative method that alternates

between performing an expectation (E) step, which computes the expectation of the log-

likelihood, evaluated using the current estimate for the latent variables, and maximization (M)

step, which finds parameters maximizing the expected log-likelihood found on the E step. These

parameter estimates are then used to determine the distribution of the latent variables in the next

E step. In other words, the EM algorithm proceeds in an iterative fashion with parameter

estimates from the current step being compared to those from the previous step until the

difference between the estimates becomes smaller than a specified criterion, suggesting that the

program has converged on a maximum of the likelihood function. For multimodal log-

likelihoods (log-likelihoods with more than one mode) it is possible that the EM algorithm will

converge on a local maximum of the observed data likelihood function and that the global

maximum lies elsewhere. Thus, to ensure that the algorithm has converged on a global

maximum, it is suggested that mixture models be fitted using many random starts. Successful

convergence of the EM algorithm is indicated when the same maximum likelihood value is

reached for different sets of starting values [78]. For our analysis, a set of 100 random starting

values in the initial stage and 25 optimizations in the final stage was used. This resulted (for the

majority of models) in replication of the optimal maximum-likelihood value and normal

termination of the models.

29

In determining the number of classes, three model characteristics were evaluated. The

first was the statistic fit indices, the second the classification quality and third the clinical

usefulness [53].

5.7.3.2 Outcome distribution and fit indices

In order to find the model that was statistically robust, a series of quadratic models were

fit sequentially. LCGA was performed as the first step to determine major types of trajectories.

To determine the appropriate distribution of the outcome variable to be used for GMM,

both continuous and count distributions were explored. The normal distribution is defined by a

mean and variance. The Poisson distribution is a count distribution defined by a single

parameter which is both the mean and variance, that is mean = variance. The negative binomial

distribution is an overdispersed Poisson distribution. Overdispersion implies that there is more

variability around the model’s fitted values than is consistent with a Poisson distribution. The

negative binomial regression addresses the issue of overdispersion by including a dispersion

parameter in the estimation procedures [75]. The negative binomial model can be thought of as a

Poisson model with an additional parameter that allows the variance to exceed the mean.

Overdispersion and an excess of zeros are frequently seen with real-life count data. Zero-

inflated count models provide a parsimonious and powerful way to model this type of situation.

Such models assume that the data are a mixture of two separate data generation processes: one

generates only zeros and the other is either a Poisson or a negative binomial data-generating

process. The mean in a zero-inflated count model depends on 2 factors: the probability that the

mean is zero and the mean when it is not zero [75]. The zero-inflated portion provides a way of

modeling the excess zeros and the negative binomial addresses the overdispersion that would be

30

seen if using a Poisson model alone. Both a zero-inflated Poisson and zero-inflated negative

binomial distribution were explored.

The BIC was used to compare the models to determine the number of classes and the

distribution that best described the data. As outlined in section 1.2.4, of the information criterion

based fit indices, the BIC performed the best. The penalty for the BIC is the logarithm of the

number of subjects multiplied by the number of parameters estimated in the model. The idea of

the penalty is to compensate for the increase in the number of parameters in the model (i.e.

model complexity). A good model, according to BIC, has a high likelihood value without using

many parameters (resulting in a low BIC). The distribution with the optimal fit indices and

which was consistent with the univariate distribution of the outcome variable was selected to

continue with the analysis. All models were run with intercept, linear and quadratic terms to

allow for more flexibility in estimating the shape of the trajectories. In instances where the

linear model is sufficient to define the trajectory, the quadratic term would be estimated to be

near 0 (by Mplus).

In order to determine the number of latent classes, models were run sequentially starting

with two-classes. The k+1 model was compared with k model using the LMR LRT in addition

to examining the BIC values. A 2(∆BIC) >2 was considered evidence that the k+1 model was

better than the k model [76]. The interpretation of the 2(∆BIC) as the degree of evidence

favoring the k+1 to the k model is presented in Table 1 [76]. A p-value for the LMR LRT <0.05

was evidence that the k+1 model was a superior fit than the simpler model. Once the k and k+1

models were identified, the BLRT was used to compare the fit.

31

Table 1. Interpretation of the 2(∆ Bayesian Information Criterion) in model selection

2(∆BIC*) Evidence against H0 (k model)

0 to 2 Not worth mentioning

2 to 6 Positive

6-10 Strong

>10 Very strong

*BIC = Bayesian information criterion

After determining the best fitting model using LCGA, GMM was explored. It has been

suggested that the GMM method may allow identification of a more parsimonious model with

one fewer class than a LCGA model with the same order of polynomial[57]. Theoretically,

given our knowledge of the biologic heterogeneity of JIA, allowing for intra-class variability

using a GMM would seem appropriate. The GMM was performed first allowing variability only

in the intercept growth parameter and then with variability in both the intercept and slope

parameters for the non-inflated portion (mean function amongst those who do not have 0

counts) of the model only. The model fit was again assessed using the BIC to compare to the

LCGA analysis for the same number of classes.

5.7.3.3 Classification quality

The classification quality can be determined by examining the posterior probabilities.

Using LCGM, it is not possible to determine definitively an individual’s group membership.

However, it is possible to calculate the probability of his or her membership in the groups that

make up the model. This probability is the called the posterior group membership probability

32

because it is computed based on post-model estimations using parameters estimated from the

model. This probability is computed as follows [77]:

Where (a) p(j|Yi) is the estimated probability that subject i is in latent class j, given the observed

trajectory Yi ; p(Yi|j) is the estimated probability of observing individual i’s actual behavioral

trajectory, Yi, given membership in class j ; (c) πj is the estimated proportion of the population in

class j; and (d) J is the number of latent classes. Each subject has some probability of being in

groups 1 to J and individuals are assigned to groups on the basis of the maximum posterior

probability assignment rule. That is, they are assigned to the group for which their posterior

probability membership is the largest. These calculations are done internally by Mplus.

Entropy was also used as a classification index. This index is used to quantify the

uncertainty of classification of subjects into latent classes. Entropy values range from 0 to 1, with

0 corresponding to randomness and 1 to perfect classification [62].

5.7.3.4 Clinical usefulness

In addition to the formalized criteria above (5.7.3.2 and 5.7.3.3), the aim was to achieve a

model that was sensible given our knowledge of the clinical course of juvenile arthritis. The

usefulness of a model in clinical practice was determined by examining the shape of the

trajectories for similarity and the number of individuals in each class. The shapes and clinical

usefulness of the curves were evaluated by experienced pediatric rheumatologists on the

investigative committee (Drs. Brian Feldman and Rae Yeung). A final model was selected based

33

on the statistical and clinical criteria to use in further analysis. The final model needs to be

evaluated for its relationship to known predictors and distal outcomes to assess its clinical

usefulness. The final model in this study does not represent the “final” answer but rather a

possible solution.

5.7.4 Model Verification

To ensure that the best solution for the final selected model corresponded to the global

optimum rather than a local maximum likelihood solution, the number of random starting values

sets was increased to 300 and 50 final optimizations. Replication of the maximum likelihood

value was considered evidence that the global optimum was obtained.

5.7.5 Association of baseline characteristics with class membership

Class membership for each participant was determined based on the most likely latent

class for each individual given the fitted model (output from Mplus).

To examine the characteristics of the subjects within each trajectory, a chi-square test of

proportions was used for categorical variables and an analysis of variance was used for

continuous variables. A variable with >20% missing values was not included in this analysis.

34

6. RESULTS

6.1 Study population

The initial cohort and flow chart of exclusions is included in Appendix D (Figure I). There

were 1074 subjects eligible for the study. Three hundred and forty-four subjects were excluded

as they had <3 visits with the rheumatologist (55 from Winnipeg, 289 from Saskatoon). No

demographic information was available for these subjects for comparison to study subjects. Data

was collected on 730 subjects; two were excluded as no first visit was documented, 15 for age at

onset of symptoms >16 years, 54 for diagnosis made >90 days before the first visit with the

rheumatologist and one patient was excluded for an incorrect diagnosis. Data from the

remaining 659 subjects (361 from Saskatoon, 298 from Winnipeg) were used for analysis.

6.1.1 Follow-up times

The study participants had variable lengths of follow-up. The maximum length of follow-

up was 23.5 years for 1 patient and the minimum was 18 months (as specified by the exclusion

criteria). There was a steady decline in the total number of subjects still being seen at each

subsequent visit. An arbitrary cut off of 10 years (visit number 20) was chosen as the time period

over which outcomes would be used in the growth curve modeling. At the 10-year time point,

there were 81 visits recorded. The Kaplan-Meier curve in Figure 4 depicts the distribution of the

number of completed visits (i.e. last visit recorded per patient). The median number of

completed visits was nine (4.5 years).

35

Figure 4. Kaplan-Meier plot of the number of completed visits.

As the follow-up time varied among subjects, we sought to examine if longer follow-up

times were related to a lower (or higher) AJC. To examine this, the AJC of study completers

was compared to those patients who dropped out (non-completers). For each patient an average

joint count over time was computed. A completer was defined as a participant who was >18 at

the last visit or a visit occurred <240 days (6 months + 2 months) before the close of the study

(end of data abstraction). A non-completer was defined as a participant who was <18 at the last

visit and did not have a visit within 240 days of the end of the study. There was a total of 38.9%

(256/659) of participants who completed the study. A small but statistically significant

(1=6months)

36

(p=0.002) difference in the mean joint count between the completers (2.5± 4.1) and non-

completers (1.6 ± 2.8) was found.

Secondly, we examined if those who dropped out at a given visit tended to have a lower

(or higher) AJC that those who did not drop out. The purpose was to see if being an early or late

non-completer had any effect on the AJC. We compared the AJC of subjects who dropped out by

the next visit with the AJC of subjects who continued through to the next visit. The difference in

AJC of those who dropped out versus those who continued is shown in Table 2. Overall, there

was a statistically significant difference in the mean AJC at about one-third of the visits; with

those who dropped out being lower than those who continued to the next visit. There is marked

random variation (large standard deviation) and it is unlikely the differences are clinically

significant. Figure 5 depicts the mean AJC at each visit for dropped out or continued to the next

visit. From this figure, there is no obvious trend apart from the later visits (16-20) where it

appears the mean AJC is higher among those who continued, however the number of subjects is

small.

37

Table 2. Comparisons of average active joint count in patients who continue to the next visit to

those whom drop out.

Visit Mean AJC (SD) Mean AJC (SD) p-value

Continue to next visit Drop out

1 2.7 (6.0) No drop outs

2 2.4 (5.8) No drop outs

3 3.6 (7.6) 1.5(2.6) 0.0001

4 1.9 (5.0) 1.7(4.6) 0.7984

5 2.2 (5.8) 2.1(6.0) 0.9227

6 2.0 (5.9) 1.5 (3.4) 0.3478

7 1.9 (5.2) 1.3 (3.6) 0.3205

8 1.9 (4.7) 1.0 (2.4) 0.0496

9 1.9 (5.1) 1.9 (3.7) 0.9274

10 2.3 (5.6) 2.3 (4.8) 0.9891

11 2.7 (7.1) 2.3 (5.6) 0.6888

12 3.3 (8.1) 1.9 (5.5) 0.2546

13 2.2 (5.3) 0.6 (1.9) 0.0021

14 1.7 (4.0) 2.3 (7.4) 0.7264

15 2.0 (5.3) 2.2 (4.7) 0.8784

16 2.1 (4.7) 1.2 (2.6) 0.2148

17 2.4 (5.7) 1.6 (4.3) 0.496

18 2.4 (7.1) 0.6 (1.5) 0.042

19 3.4 (9.9) 0.2 (0.4) 0.0102

SD= standard deviation; AJC = active joint

Figure 5. Mean active joint count at visit N by follow-up status at visit N+1

(1=6 months)

38

6.1.2 Missing Visits

Of the total 8324 visits entered into the database, there were 1318 missing visits (15.8%).

The missing visits were visits that were expected (6 ± 2 months) during the period of follow-up

but did not occur. These values were assumed to be missing at random (MAR), and handled that

way by the Mplus software.

6.1.3 Cohort baseline demographics

The baseline demographics of the 659 study participants are presented in Table 3. Of the

35.7% of participants with the oligoarthritis subtype, 12.8% (30/235) had an extended course.

Medications started before or at the first visit included non-steroidal anti-inflammatory drugs

(NSAIDs) for 394 (60%), DMARD for 20 (3%), oral corticosteroids for 18 (3%) and

intraarticular steroid injections for 70 (11%). No patient was taking biologic therapy at the first

visit. During the follow-up period, 615 (93%) of patients were treated with NSAIDs, 223 (34%)

with intraarticular injections, 94 (14%) with oral corticosteroids, 208 (32%) with DMARD and

34 (5%) with biologic therapies.

39

Table 3. Cohort Demographics*

Characteristic Result

Female sex (n)(%) 402 (61)

ANA positive (n)(%) 286 (45.5)

Age at diagnosis (mean)(SD) 8.9 (4.9)

Diagnostic delay (median)(25th

%-75th

%) 4.9 months (2.2-12.3)

ILAR diagnosis (n)(%)

Systemic arthritis 45 (6.8)

Oligoarthritis 235 (35.7)

Polyarthritis – RF negative 87 (13.2)

Polyarthritis – RF positive 23 (3.5)

Psoriatic arthritis 54 (8.2)

Enthesitis-related arthritis 134 (20.3)

Undifferentiated 81 (12.3)

Individual ILAR criteria (n)(%)

Systemic Fever 44 (6.7)

Psoriasis 46 (8.9)

Dactylitis 33 (7.9)

RF positive 34 (5.6)

Lumbosacral back pain 147 (28.0)

Enthesitis 148 (31.8)

HLA-B27 positive 93 (16.6)

1st degree family history of psoriasis 99 (17.6)

Family history of HLA-B27 associated disease 91 (16.3)

Ethnicity (n)(%)

Caucasian

America∆

Asia

Africa & Caribbean Islands

Other

Missing data

338 (51.3)

53 (8.0)

5 (0.8)

2 (0.3)

3 (0.5)

258 (39.2)

ESR at first visit (median)(25th

%-75th

%) 21.00 (9.0-40.5)

CRP at first visit (median)(25th

%-75th

%) 4.00 (0-12.0)

N= number of subjects; SD=standard deviation; ∆

=Aboriginal Inuit, Aboriginal North American

Indian, Latin American; * The number of subjects with complete data varies by variable

40

6.2 Description of the active joint count

The univariate descriptions of the trajectory descriptor variable (AJC) are presented in

Table 4. This variable did not follow a normal distribution. For all visits, the skewness and

kurtosis values were greater than 2 and tests for non-normality were highly significant (p<0.001).

The distribution was right-skewed and the preponderance of zero values is seen in the bar charts

of the AJC at each visit (Appendix E).

Table 4. Univariate description of the active joint count at each visit

Visit AJC (mean)(SD)

AJC (median)(25

th%-75

th%)

Visit AJC (mean)(SD)

AJC (median)(25

th%-75

th%)

1 2.7 (6.0) 1 (0-2) 11 2.6 (6.9) 0 (0-2)

2 2.4 (5.8) 1 (0-2) 12 3.1 (7.8) 0 (0-2.5)

3 3.4 (7.4) 1 (0-3) 13 1.9 (4.9) 0 (0-1)

4 1.9 (5.0) 0 (0-1) 14 1.8 (4.6) 0 (0-1)

5 2.2 (5.8) 0 (0-1) 15 2.1 (5.2) 0 (0-2)

6 1.9 (5.6) 0 (0-1) 16 2.0 (4.5) 0 (0-2)

7 1.8 (5.0) 0 (0-1) 17 2.3 (5.5) 0 (0-2)

8 1.8 (4.5) 0 (0-1) 18 2.0 (6.4) 0 (0-1)

9 1.9 (4.9) 0 (0-1) 19 2.8 (9.0) 0 (0-2)

10 2.3 (5.5) 0 (0-1) 20 1.9 (4.7) 0 (0-2)

SD = standard deviation; AJC = AJC

6.3 Characteristics of patients with no active joint disease

There were 95 (14.4%) of subjects with no active joint disease documented throughout

their follow-up time. These participants were followed for fewer visits than those with active

joint disease. They tended to have a longer delay to diagnosis, were older and had a higher

proportion of diagnosis of enthesitis-related arthritis (69.5%) and systemic arthritis (10.5 %)

subtypes of JIA than would have been expected based on the size of the group (Table 5). At the

first visit, none of the patients with no joint disease was taking DMARD, had intraarticular

steroids or was taking biological therapy. Fifty-four (56.8%) were taking NSAIDs and four

41

(4.2%) were taking oral corticosteroids; this is not statistically significantly different from the

group with active joint disease (60.2% on NSAIDs, p=0.5721 and 2.5% for corticosteroids,

p=0.3118).

Table 5. Comparison of the baseline demographics of the subgroup of patients with no active

joint disease with those with active joint disease*

Variable Joint disease

(n=564)

No joint disease

(n=95)

p-value

Female sex (n)(%) 364 (64.5) 38(40) <0.0001

ANA positive (n)(%) 272 (48.2) 14 (14.7) <0.0001

Age at diagnosis (mean)(SD) 8.4 ± 4.9 12.0±3.4 <0.001

Diagnostic delay (median)(25th%-75th%) 8.7±11.8 months 19.4±17.3 months <0.001

Number of visits (median)(25th%-75th%) 10 (6-15) 6 (4-9) <0.0001

ILAR diagnosis (n)(%) <0.0001

Systemic arthritis

Oligoarthritis persistent

RF negative polyarthritis

RF positive polyarthritis

Psoriatic arthritis

Enthesitis related arthritis

Undifferentiated

35 (6.2)

233 (41.3)

86 (15.2)

23(4.1)

47 (8.3)

68 (12.1)

72 (12.8)

10 (10.5)

2 (2.1)

1 (1.1)

0

7 (7.4)

66 (69.5)

9 (9.5)

Individual ILAR criteria (n)(%)

Systemic Fever 35 (6.2) 9 (9.5) 0.2508

Psoriasis 37 (6.6) 9 (9.5) 0.6213

Dactylitis 33 (26.3) 0 <0.0001

RF positive 34 (6.0) 0 0.0119

Lumbosacral back pain 87 (15.4) 60 (6.3) <0.0001

Enthesitis 76 (13.5) 72 (7.6) <0.0001

HLA-B27 positive 72 (12.8) 21 (22.1) 0.0473

1st degree relative with psoriasis 88 (15.6) 11 (11.6) 0.1470

Family hx HLA-B27 associated

disease

56 (9.9) 35 (36.8) <0.0001

ESR at first visit (mean)(SD) 31.0±27.3 22.7±26.4 0.0199

CRP at first visit (mean)(SD) 23.5±59.0 4.7±12.0 0.0011

N=number of subjects; SD=standard deviation;

* The number of subjects with complete data varies by variable

42

6.4 Model Building

6.4.1 Selection of the distribution

A series of quadratic LCGA models were fitted testing the normal, negative binomial,

Poisson and zero-inflated negative binomial (ZINB) distributions (Tables 6-9). A zero-inflated

Poisson model did not fit the data. When this distribution was used, no modeling attempts

converged on a replicated log-likelihood value irrespective of the number of random starts. To

determine the distribution that fit the data optimally, the BIC values were compared.

Table 6. Fit statistics of the latent growth curve analysis for the normal distribution

Number of

classes

AIC BIC ssABIC Entropy LMR LRT

p-value

2 39276.2 39397.4 39311.7 0.991 0.1519

3 38779.9 38919.1 38820.7 0.969 0.6234

4 38498.3 38655.5 38544.4 0.975 0.4625

5 38268.5 38443.6 38319.8 0.971 0.2261

AIC = Akaike’s information criterion; BIC=Bayesian information criterion; ssABIC =

sample size adjusted BIC; LMR LRT = Lo-Mendell-Rubin likelihood ratio test

Table 7. Fit statistic of the latent curve growth analysis for the Poisson distribution

Number of

classes

AIC BIC ssABIC Entropy LMT LRT

p-value

2 35856.2 35887.6 35865.4 0.991 0.0272

3 33194.7 33244.1 33209.2 0.974 0.2712

4 31283.3 31350.7 31303.1 0.969 0.0727

5 30586.8 30672.2 30611.8 0.958 0.6542

AIC = Akaike’s information criterion; BIC=Bayesian information criterion;

ssABIC = sample size adjusted BIC; LMR LRT = Lo-Mendell-Rubin likelihood ratio test

43

Table 8. Fit statistics of the latent curve growth analysis for the negative binomial distribution

Number of

classes

AIC BIC ssABIC Entropy LMR LRT

p-value

2 20591.3 20712.5 20626.8 0.874 0

3 20326.7 20465.9 20367.5 0.766 0

4 20238.2 20395.4 20284.3 0.714 0.3364

5 20163.7 20338.8 20215.0 0.791 0.0054

AIC = Akaike’s information criterion; BIC=Bayesian information criterion; ssABIC = sample

size adjusted BIC; LMR LRT = Lo-Mendell-Rubin likelihood ratio test

Table 9. Fit statistics of the latent curve growth analysis for the zero-inflated negative binomial

distribution

Number of

classes

AIC BIC ssABIC Entropy LMT LRT

p-value

2 20530.6 20665.3 20570.1 0.889 0

3 20206.3 20359.0 20251.0 0.789 0

4 20106.2 20276.9 20156.2 0.734 0

5 20005.2 20193.8 20060.4 0.730 0.0482

AIC = Akaike’s information criterion; BIC=Bayesian information criterion; ssABIC = sample

size adjusted BIC; LMR LRT = Lo-Mendell-Rubin likelihood ratio test

The distribution with the best fit to the data, based on fit indices (lowest BIC = 20665.3 for 2

class model) and distribution of the active joint count (section 6.2), was a zero-inflated negative

binomial distribution.

6.4.2 Selection of the number of classes

6.4.2.1 Fit indices

The fit statistics for the LCGA and GMM examined under the ZINB distribution are

presented in Table 10. Quadratic models for 2-6 classes were fit to the data. Considering the

BIC (20193.8) and estimated proportion for the class sizes (0.1-0.44) based on the posterior

probability the five-class LCGA (Figure 9) provides the best solution for the data. The four-class

44

LCGA model (Figure 6) also provides a reasonable solution (BIC 20276.9 class size (proportion)

0.1-0.44). However in choosing the five-class solution, the additional class consists of a

clinically distinct group (“class 2” in Figure 9) comprised of patients with an initial moderate

mean AJC with progression to higher mean AJC after 5 years.

The GMM for the four-class model allowing the variance of the intercept for the non-

inflated portion of the model to estimated provides a clinically sensible solution however the

entropy is low (0.605) and class two in this model represents only 2% of the cohort (Figure 7).

The GMM for the four-class model allowing the variance of the intercept and slope growth

factor for the non-inflated portion of the model to be estimated has a smaller BIC and reasonable

class sizes but the resultant classes are not clinically sensible (Figure 8).

In selecting the final model (five-class LCGA), in addition to the BIC, the LMR-LRT

comparing the five-class LCGA to four-class LCGA was statistically significant (p=0.0482).

The bootstrapped LRT did not produce results (did not converge).

To evaluate if the fit of the quadratic term was sufficient to describe the variability in the

data, a cubic model was fit to the five-class LCGA. The cubic five-class LCGA model results in

a larger BIC (20225.376) and the fit of the estimated curves to the observed did not provide a

better fit (Figure 14). Thus the quadratic five-class LCGA was retained.

Growth mixture models of the five-class solution were run in an exploratory analysis. It

has been suggested that the k-1class GMM (here, four-class) should provide a more

parsimonious model than the k class LCGA [57] however this was the not true for our analysis.

45

The five-class GMM explored were

1) Estimation of the variance of the intercept for the non-inflated portion of the model

(subjects within a class have a different starting point but generally follow the same

trend)

2) Estimation of the variance of the intercept and slope growth factors for the non-

inflated portion of the model (subjects have a different starting point and may follow a

different curve)

3) Estimation of the variance of the intercept and slope growth factor for the non-inflated

portion of the model for class 2 and 5 (large variability was observed in the plots of the

individual values around the estimated class specific mean curves) with variance of all

growth factors for class 1,3 and 4 fixed to zero.

Although the five-class GMM models provided smaller BIC values, the class sizes were

too small (1 class with <5%) and the resulting classes (Figure 10-12) were not clinically sensible.

A six-class LCGA model (k +1 class than makes sense clinically) did have a smaller BIC value

(20148.033) however, the additional class size was small (3%) (Figure 13).

46

Table 10. Fit statistics of the latent curve growth analysis and growth mixture models with the zero-inflated negative binomial

distribution

2-class

LCGA

3-class

LCGA

4-class

LCGA

4-class

GMM

4-class

GMM

5-class

LCGA

5-class

GMM

5-class

GMM

5-class

GMMb

6-class

LCGA

Variance of

growth

factors

I S Q F F F F F F F F F R F F R R F F F F R F F R R F R R F F F F

Ii Si Qi F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F

Sample sizea

(proportion)

C = 1

C = 2

C = 3

C = 4

C = 5

C = 6

0.73

0.26

-

-

-

-

0.25

0.18

0.57

-

-

-

0.18

0.44

0.20

0.18

-

-

0.12

0.02

0.65

0.21

-

-

0.16

0.60

0.13

0.10

-

-

0.44

0.10

0.18

0.18

0.10

-

0.08

0.10

0.63

0.02

0.16

-

0.15

0.13

0.10

0.01

0.60

-

0.17

0.12

0.04

0.32

0.34

-

0.03

0.09

0.17

0.18

0.10

0.42

Fit statistics # parameters

AIC

BIC

ssABIC

Entropy

LMR-LRT

p-value

30

20530.5

20665.3

20570.1

0.889

0.00

34

20206.3

20359.0

20251.0

0.789

0

38

20106.2

20276.9

20156.2

0.734

0

39

19933.6

20108.7

19984.9

0.605

0

41

19858.4

20042.5

19912.3

0.695

0.0215

42

20005.2

20193.8

20060.4

0.730

0.0482

43

19890.5

20083.6

19947.1

0.565

0.1678

45

19846.7

20048.8

19905.9

0.723

0

45

19857.3

20059.4

19916.5

0.596

0.3639

46

19941.5

20148.0

20002.0

0.744

0.0127

(I)=intercept; (S)=slope; (Q)=quadratic; (Ii) = inflated intercept; (Si) = inflated slope; (Qi)=inflated quadratic; (C)=class; (F)= fixed to 0;

(R)=random; a Estimated proportion for the latent classes based on the posterior probabilities;

b variance of all growth factors fixed to 0 for

classes1,3 and 4 but non-inflated intercept and slope estimated for groups 2 and 5.

47

Figure 6. Four-class latent curve growth analysis. Two growth curves are presented for each

class (○ = observed sample means; ∆ = estimated means based on the model). The percent of the

study population in each class is also presented.

Figure 7. Four-class growth mixture model (variance of the non-inflated intercept estimated)

Two growth curves are presented for each class (○ = observed sample means; ∆ = estimated

means based on the model). The percent of the study population in each class is also presented.

Act

ive

join

t co

unt

Time (1 = 6 months)

Time (1=6 months)

Act

ive

join

t co

unt

48

Figure 8. Four-class growth mixture model (variance of the non-inflated intercept and slope

estimated). Two growth curves are presented for each class (○ = observed sample means; ∆ =

estimated means based on the model). The percent of the study population in each class is also

presented.

Figure 9. Five-class latent curve growth analysis. Two growth curves are presented for each



Act

ive

join

t co

unt

Act

ive

join

t co

unt

Time (1 = 6 months)

Time (1=6 months)

49

Figure 10. Five-class growth mixture model (variance of the non-inflated intercept estimated).

Two growth curves are presented for each class (○ = observed sample means; ∆ = estimated

means based on the model). The percent of the study population in each class is also presented.

Figure 11. Five-class growth mixture model (variance of the non-inflated intercept and slope

estimated). Two growth curves are presented for each class (○ = observed sample means; ∆ =

estimated means based on the model). The percent of the study population in each class is also

presented.

Time (1=6 months)

Act

ive

join

t co

unt

Act

ive

join

t co

unt

Time (1=6 months)

50

Figure 12. Five-class growth mixture model (non-inflated intercept and slope estimated for

groups 2 and 5). Two growth curves are presented for each class (○ = observed sample means; ∆

= estimated means based on the model). The percent of the study population in each class is also

presented.

Figure 13. Six-class latent curve growth analysis. Two growth curves are presented for each



Act

ive

join

t co

unt

Time (1=6 months)

Act

ive

join

t co

unt

Time (1 = 6 months)

51

Figure 14. Five-class cubic latent curve growth analysis. Two growth curves are presented for

each class. (○ = observed sample means; ∆ = estimated means based on the model)

6.4.2.2 Classification quality

A second important consideration in the selection of the optimal number of classes is the

classification quality. Based on the fit indices and class size, the four and five class solutions

were superior thus these models were evaluated. The average posterior probabilities of the four

and five class solutions are presented in Table 11 and Table 12. The five-class LCGA had a

mean posterior probability of 0.817 (0.758-0.896) and the four-class LCGA was 0.849 (0.751-

0.959). These were the 2 best fitting models as identified by the BIC and class size in section

6.4.2.1. The four and five-class LCGA models had the highest entropy values of all four and

five class solutions (0.734 and 0.730 respectively) (Table 10).

When considering the classification quality of the model, one must also evaluate the

impact of misclassification. In the five-class LCGA model, the class with the lowest average

posterior probability is class 3 (0.758), of all patients most likely to be in class 3, 23.3% may be

Time (1=6 months)

Act

ive

join

t co

unt

52

classified into class 1. For the four-class model, the class with the lowest average posterior

probability is class 4 (0.751), of all patients most likely to be in class 4, 24% may be classified

into class 2. In both models, the potential for misclassification is from the “no active joint

disease group” into the “minimal active joint disease group (initial mean approximately 1).”

Table 11. Average posterior probabilities for five-class solutions

Five-class LCGA (variance of all growth factors fixed to 0)

Class 1 2 3 4 5

1 0.845 0.004 0.066 0.084 0.001

2 0 0.763 0.00 0.092 0.144

3 0.233 0.001 0.758 0.008 0

4 0.062 0.089 0 0.824 0.024

5 0 0.091 0 0.013 0.896

Five-class GMM (variance of non-inflated intercept estimated)

Class 1 2 3 4 5

1 0.775 0.012 0.199 0 0.015

2 0.002 0.712 0.141 0.007 0.137

3 0.078 0.054 0.741 0.011 0.116

4 0 0.002 0.011 0.950 0.038

5 0 0.087 0.172 0.014 0.728

Five-class GMM (variance of non-inflated intercept and slope estimated)

Class 1 2 3 4 5

1 0.660 0.001 0.001 0.031 0.307

2 0 0.690 0.235 0 0.075

3 0 0.247 0.703 0 0.050

4 0 0 0 1.000 0.000

5 0.055 0.035 0.028 0.001 0.881

Five-class GMM (variance of non-inflated intercept and slope estimated for class 2 and 5)

Class 1 2 3 4 5

1 0.727 0.005 0 0.202 0.066

2 0 0.746 0.051 0 0.203

3 0 0.142 0.736 0 0.122

4 0.061 0.021 0 0.683 0.236

5 0 0.145 0.017 0.065 0.773

Each row contains information for individuals who were most likely to be in the class

represented by that row; LCGA=latent curve growth analysis; GMM=growth mixture model

53

Table 12. Average posterior probabilities for four-class solutions

Four-class LCGA (variance of all growth factors fixed to 0)

Class 1 2 3 4

1 0.959 0 0.041 0

2 0.002 0.848 0.085 0.065

3 0.096 0.066 0.838 0

4 0.001 0.240 0.008 0.751

Four-class GMM (variance of non-inflated intercept estimated)

Class 1 2 3 4

1 0.702 0.006 0.091 0.202

2 0.003 0.951 0.003 0.043

3 0.044 0.010 0.823 0.123

4 0.174 0.016 0.152 0.657

Four-class GMM (variance of non-inflated intercept and slope estimated)

Class 1 2 3 4

1 0.690 0.308 0.001 0.001

2 0.054 0.882 0.035 0.029

3 0 0.069 0.699 0.232

4 0 0.051 0.247 0.702

Each row contains information for individuals who were most likely to be in the class

represented by that row; LCGA=latent curve growth analysis; GMM=growth mixture model

6.4.2.3 Clinical sensibility

Finally and most importantly, the model selection must be considered in terms of clinical

sensibility and usefulness. As outlined above, the final modeled selection represents a possible

solution but this needs to be considered in relation to important predictors (clinical, genetic) and

to distal outcomes. A summary of the model fit qualities is presented in Table 13. This table

highlights the final two models – four-class LCGA and five-class LCGA. The five-class LCGA

model was selected based on clinical relevance and usefulness. The five-class model provides

one further clinically meaningful class (“Class 2”, Figure 9) that represents 10% of the total

population. Given our clinical knowledge of the disease course, this class represented a subset of

patients that present with moderate active joint disease but progress to a more aggressive disease

course. The four-class growth mixture model with the variance of the non-inflated slope

54

estimated could also fit with physicians’ impression of disease course however the entropy and

posterior probabilities are lower and “Class 2” represents only 2% of the population (Figure 7) .

For these reasons, the five-class LCGA was retained for further analysis. The growth curves of

the estimated means based on the model for each class are presented in Figures 15-19 and in

Figure 20; all five classes are presented together. The five classes are described below with

clinical relevance are described below:

1. “Persistent high” (9.8% of study population) – initial mild polyarthritis (mean AJC 14.1)

followed by a gradual decrease in mean AJC over years.

Figure 15. Five-class latent growth curve analysis. Growth curve of estimated class means

based on the model. “Persistent high class” (9.8%)

55

2. “Moderate increasing” (10% of study population) - initial mean AJC of 5.5 followed by an

increasing AJC at 5 years (mean AJC 9.7).

Figure 16. Five-class latent growth curve analysis. Growth curve of estimated class means based

on the model. “Moderate increasing class” (10%)

3. “Persistent moderate” (18.5% of study population) – initial mean AJC 3.2 followed by

persistent moderate AJC)


on the model. “Persistent moderate class” (18.5%)

56

4. “Persistent low” (43.6% of study population) - minimal disease activity (initial mean AJC

0.9) followed by improvement.


on the model. “Persistent low” (43.6%)

5. “No joint activity” (18.2% of study population) - minimal to no active joint disease

throughout course (initial mean AJC 0.3).


on the model. “No joint activity” (18.2%)

57

Figure 20. Five-class latent growth curve analysis. Growth curves for all five classes based on

the estimated sample means from the model. Class size is also presented.

6.4.2.4 Association of baseline characteristics with identified trajectories

The association of baseline characteristics, stratified by class is presented in Table 14.

The evaluation of differences based on preexisting characteristics is important to consider in

evaluating clinical usefulness of models [48]. Enthesitis, dactylitis, ethnicity, ESR and CRP

were not considered in this analysis as these variables had >20% missing values. There was a

statistically significant difference between the classes for all variables considered (p-values for

F-test or χ2 all < 0.05) except for psoriasis (p=0.2622).

The persistent low class contained the highest proportion of the ILAR subtype

oligoarthritis-persistent. The proportion of ANA positive subjects was higher in the persistent

low, moderate increasing and persistent moderate classes than the persistent high or no joint

activity classes. Polyarthritis RF negative and positive patients were mostly in the moderate

increasing and persistent high classes that are clearly two distinct trajectories. The no joint

activity class participants were more likely to have lumbosacral back pain, be HLA-B27 positive

58

and have a family history of HLA-B27 associated diseases but less likely to be ANA positive.

This class contained a significant portion of subjects with systemic arthritis however as did the

persistent high class. The persistent moderate group contained a higher proportion of patients

with diagnoses of RF negative polyarthritis and psoriatic arthritis.

The most significant finding when examining the characteristics of the subjects in each

class is that the ILAR subtypes are dispersed among the classes. This finding supports the

concept that the variability in disease course over time is not adequately explained by the ILAR

classification alone.

59

Table 13. Summary of model qualities

Quality

∆AICa

∆BICa Class size

adequate

LMR LRT

p-value

Entropy Posterior Probability

(mean)(range)

Clinical

sensibility

Model

2-class LCGA Y Y Y Y Y 0.968 (0.960-0.975) N

3-class LCGA Y Y Y Y Y 0.907 (0.850-0.955) N

4-class LCGA Y Y Y Y Y 0.849 (0.751-0.959) Y

4-class GMM (variance I estimated)

Y Y N Y N 0.783 (0.657-0.951) Y

4-class GMM (variance I/S estimated)

Y Y Y Y Y 0.743 (0.690-0.882) N

5-class LCGA Y Y Y Y Y 0.817 (0.763-0.896) Y

5-class GMM (variance I estimated)

Y Y N Y N 0.781 (0.712-0.950) N

5-class GMM (variance I/S estimated)

Y Y N Y Y 0.787 (0.690-1.0) N

5-class GMM (variance I/S estimated for class 2/5)

Y Y N Y N 0.733 (0.683-0.773) N

6-class LCGA Y Y N Y Y 0.807 (0.731-0.901)

(I)=non-inflated intercept; (S)=non-inflated slope; AIC= Akaike’s information criterion; BIC=Bayesian information criterion;

LMR LRT= Lo=Mendell-Rubin likelihood ratio test; aAIC/BIC significantly better than k-1 model

60

Table 14. Characteristics of the study participants, stratified by trajectory *

Persistent

low

Moderate

increasing

No joint activity Persistent

moderate

Persistent

high

Number of subjects in class 295 65 133 108 58

Variable p-value

Female sex (n/N)(%) 175 (59.3) 46 (70.8) 61 (45.9) 79 (73.2) 41 (70.7) <0.0001

Age at diagnosis (mean)(SD) 8.5 (4.8) 10.7 (4.7) 10.0 (4.7) 7.7 (5.3) 8.8 (4.7) <0.0001

ANA positive (n/N)(%) 137/277 (49.5) 32/61 (52.5) 32/130 (24.6) 59/103 (57.3) 26/58 (44.8) <0.0001

Diagnostic delay, months (median)(25th%-75th%) 4.3 (2-10.2) 5.2 (3.2-13.1) 10.1 (2.8-24.4) 4.4 (2.4-11.7) 4.4 (2.6-9.5) 0.0004

ILAR diagnosis (n/N)(%) < 0.0001

Systemic arthritis

Oligoarthritis

RF negative polyarthritis

RF positive polyarthritis

Psoriatic arthritis

Enthesitis related arthritis

Undifferentiated

14 (4.8)

166 (56.3)

12 (4.1)

1 (0.3)

16 (5.4)

49 (16.6)

37 (12.5)

4 (6.2)

4 (6.2)

27 (41.5)

8 (12.3)

6 (9.2)

5 (7.7)

11 (16.9)

12 (9.0)

27 (20.3)

1(0.8)

0

9 (6.8)

70 (52.6)

14 (10.5)

6 (5.6)

35 (32.4)

26 (24.1)

4 (3.7)

19 (17.6)

8 (7.4)

10 (9.3)

9 (15.5)

3 (5.2)

21 (36.2)

10 (17.2)

4 (6.9)

2 (3.5)

9 (15.5)

No active joint disease 0 0 95 (100) 0 0 <0.0001

Individual ILAR criteria (n/N)(%)

Systemic Fever 11/292 (3.8) 4/65 (6.2) 12/133 (9.0) 7/108 (6.5) 11/57 (19.3) 0.0084

Psoriasis 15/220 (6.8) 3/43(7.0) 12/124 (9.7) 13/89 (14.6) 3/43 (7.0) 0.2622

RF positive 6/267 (2.2) 10/60 (16.7) 0 5/98 (5.1) 13/58 (22.4) <0.0001

Lumbosacral back pain 52/232 (22.4) 6/49 (12.2) 65/127 (51.2) 16/77 (20.8) 8/41 (19.5) <0.0001

HLA-B27 positive 47/242 (19.4) 2/50 (4.0) 23/125 (18.4) 18/93 (19.4) 3/49 (6.1) 0.0190

1st degree family hx psoriasis 34/251 (13.5) 15/52 (28.8) 17/126 (13.5) 25/86 (29.1) 8/49 (16.3) 0.0022

Family hx HLA-B27 assoc disease 35/248 (14.1) 7/50 (14.0) 39/129 (30.2) 9/81 (11.1) 1/50 (2.0) <0.0001

Treatment during follow-up (n/N)(%)

DMARD

Biologic therapy

Intraarticular injection

Oral corticosteroid therapy

65 (22.0)

9 (3.1)

129 (43.7)

33 (11.2)

35 (53.9)

6 (9.2)

15 (23.1)

13 (20.0)

4 (3.0)

2 (1.5)

17 (12.8)

11 (8.3)

58 (53.7)

7 (6.5)

49 (45.4)

14 (13.0)

46 (79.3)

10 (17.2)

13 (22.4)

23 (39.7)

<0.0001

<0.0001

<0.0001

<0.0001

* Percentages are expressed as % of class; n/N= number positive/number tested, when N is not specified there was no missing data for the variable and N = class

size

61

7. DISCUSSION

7.1 Overview and strengths of study

In this 10-year retrospective cohort study (8324 patient visits), we explored the use of a

novel statistical approach to characterize disease course in JIA. This study is the first to use a

longitudinal growth modeling technique to identify homogenous subsets (classes) of patients that

follow a similar pattern of disease activity over time. As outlined above, we argue that in a

chronic and fluctuating illness like JIA, the appropriate outcome measure is disease course rather

than a fixed outcome assessed at one point in time. We identified 5 clinically and statistically

distinct trajectories of disease course. The subsets of patients within each class were different

from those described by the ILAR classification criteria. The results of this study are important

and provide evidence that clinical heterogeneity observed between and within the ILAR subtypes

is not adequately captured by the ILAR criteria.

7.2 Key findings

7.2.1 Successful application of a novel longitudinal data modeling technique

The current study successfully used a novel approach to longitudinal growth curve

modeling. The main objective of this study was to apply latent growth curve modeling

methodology to characterize the disease course in JIA. This first attempt used the active joint

count as the trajectory descriptive variable, however, future work needs to examine other

components of disease activity. Despite the fact that latent growth curve modeling is in its

infancy, it does show promise in the identification of homogeneous subsets within our

heterogeneous population. The results from this study support the use of this methodology in a

chronic disease with a fluctuating course like JIA.

62

A five-class latent curve growth analysis solution was chosen as the final model. It

should be reiterated that the selected model in our study does not represent the “final” answer but

one possible solution. The five classes need to be evaluated to see if they differ based on

preexisting characteristics, subsequent outcomes or their response to treatment. These aspects are

most important to determine clinical usefulness. Within the scope of this study, the baseline

characteristics were evaluated with univariate techniques. However, the association of a

trajectory class with biological predictors, response to treatment or to a more distal outcome

needs to be addressed.

7.2.2 Clinical interpretation of the trajectory groups and characteristics of the trajectory

group members

The persistent low and no joint activity classes may represent those subjects whose joint

disease was relatively easy to control with therapy. It is also probable for the no joint disease

group that there are other aspects of the disease (enthesitis, fever, rash, inflammatory back pain)

not captured by using active joint count as a marker of disease activity. The moderate persistent

class represents those that may have required ongoing or intensification in therapy. The

moderate increasing and persistent high groups were characterized by a more refractory disease;

these groups may represent those subjects who required an escalation in therapy (Figures 15 &

16).

If we consider the distribution of the ILAR criteria, certain subtypes are primarily found

in one class (oligoarthritis) whereas others are dispersed among several classes (systemic and

polyarthritis) (Table 14). This is consistent with clinicians’ impression regarding the

heterogeneity of the disease course of patients within a class. An observational study has

63

supported this concept. This small study (n=45) of systemic arthritis patients described the

course of disease as monophasic (one episode of active disease not lasting more than 24 months),

polycyclic (≥ 1 active and inactive disease) and persistent (active disease for more than 24

months). In this study active disease was any of joint disease, systemic features or inflammatory

blood work [39].

With regard to the polyarthritis RF negative patients, at least two distinct subtypes are

recognized. The first is a form similar to adult-onset RF negative RA and is characterized by

symmetric synovitis of large and small joints, onset in school age and negative ANA. The second

resembles oligoarthritis (asymmetric arthritis, early age at onset, female predominance, ANA

positivity) except for the number of joints affected in the first six months [22, 78]. This is

consistent with our results as the RF negative polyarthritis patients are distributed among three

classes (moderate increasing, persistent moderate and persistent high).

The oligoarthritis patients are primarily in the persistent low class. A long-term outcome

study reporting on 207 oligoarticular onset patients, found that at the end of 6 years of follow-up,

the probability of a polyarticular course was 50%. [31]. Our cohort had 12.8% (30/235) of the

oligoarthritis patients who followed a polyarticular course. There were only 4 (6.2%)

oligoarthritis patients in the moderately increasing group. There are a few potential reasons why

this occurred. First, if the polyarticular extension was mild (low total active joint count), this

may not have been detected by the LCGA that identifies average change over time within a

group. Second, if the patient was treated and quickly returned to low active joint counts, this

rapid transition may not have been detected with 6 monthly visits as the effect of medication was

not accounted for in our study.

64

A few groups have proposed that psoriatic subtype should not be part of the classification

criteria as the course of oligoarticular and polyarticular onset psoriatic arthritis patients may not

be clinically different from the oligoarthritis and polyarthritis (non-psoriatic) subtype patients

[78-79] . We found that the proportion of patient with psoriasis in each class was not

significantly different between the classes supporting the concept that the presence of psoriasis is

not sufficient to determine a homogeneous group of patients (p=0.2622).

It would be important to assess the baseline characteristics in a multinomial predictive

model to determine the key factors that determine membership in a trajectory. This would

provide an unbiased estimate of the effect of each of the predictors while controlling for the

known confounders. Novel genetic and biologic markers could similarly be assessed in this way.

7.3 Limitations

As with all studies, our study has limitations that are important to address. These include

limitations related to the statistical software, retrospective study design, issues of data

availability and quality, lack of account for the effect of medication on trajectories and the use of

the AJC to describe the disease course.

7.3.1 Limitations of the Mplus software

The Mplus software was developed 10-years ago and is in its sixth version but this software

is not as user friendly as other commercially available software. The user guide provides limited

information for language to code the models and provides virtually no theoretical background

information [59]. The developer of the software, however, is quite helpful to troubleshoot technical

aspects. There were still difficulties encountered in deciding on the correct distribution to describe

65

the data, issues with model selection and model non-convergence. Two illustrative examples are 1)

the zero-inflated Poisson distribution did not converge for any model and 2) the BLRT failed to

execute. The BLRT is proposed to be the optimal method to choose the number of latent classes

between the final two models [60]. In a private correspondence, Muthen suggested using only the

BIC and not the bootstrapped LRT for “models of this complexity” [80]. When a model fails to

execute, it is not possible to determine if it related to the data or to the computational burden on the

software. These technical difficulties highlight the relative immaturity of this software for latent

growth curve modeling.

In addition to technical difficulties, it is a challenge to properly navigate the process of model

building with the latent growth curve approach. When exploring the random effects models (growth

mixture modeling), there is a multiplicity of combinations of random intercept, slope and quadratic

terms. Furthermore, when adding more than a random intercept and slope term to the model, the

interpretation of the results becomes increasingly more difficult. Concrete guidelines for model

building need to be established to ensure a parsimonious and reproducible model is found.

7.3.2 Limitations of the study design

This longitudinal cohort study was a retrospective chart review. As such, there is a lack of

control over how and what data were originally recorded which may lead to inaccurate data.

Additionally, as with any study, missing data is a concern. The study visits were arbitrary and set to 6

months +/- 2 months from the previous visit. If a visit did not occur during this interval, a missed

visit was recorded. There were 15.8% (1318/8254) missing visits. However, for the visits that were

recorded, the outcome variable (AJC) was missing in only 33. Given a cohort of this size, there was a

relatively small number of missing information for the AJC, which was the major focus of this study.

66

There was a significant percentage of missing data for the baseline characteristics of the

study subjects. The variables ESR, CRP, enthesitis and dactylitis had >25% missing and were not

evaluated in the univariate analysis. The following variables had 14-20% missing data: psoriasis,

family history of HLA-B27 associated diseases, family history of psoriasis, lumbosacral back pain

and HLA-B27 positivity). Certainly, the lack of precision in documenting this information could

have impaired the detection of true relationships between the baseline predictors and class

membership.

Medication data was not included in the latent growth curve modeling. This was due to

technical complexity and interpretability of the results of a time-varying covariate in the context of

the relatively immature latent growth curve modeling technique and software. Medication exposure

is an important determinant of the shape of the trajectory as well as individual class membership. The

results of our study do not reflect the natural history of JIA but rather a treated group of patients.

To minimize the effect of medications started before study entry, we sought to create an

inception cohort that was treatment naïve. Despite the exclusion criteria of patients that were

diagnosed more than 90 days before seeing the rheumatologist, at the first visit 60% were taking

NSAIDs, 3% DMARD, 3% oral corticosteroids for 18 (3%) and 11% had intraarticular steroid

injections. There were 95 subjects with no joint disease throughout their follow-up time and many

zeros at each time point for the other 564 subjects (Appendix D). The shape of trajectories is

affected by medication however, medication exposure occurs as a result of disease activity that

required more or less treatment. Thus, the class shape we observed is confounded by the effect of

medication. It is impossible to separate the relative contribution of medication effect and disease

activity in determining class membership without properly accounting for the effect of medication in

the latent growth curve models.

67

7.3.3 Limitation of AJC as outcome measure

JIA is a multidimensional disease and the active joint count does not represent all important

aspects of disease activity. Other important aspects of disease to consider include components of the

ACR JIA disease core set (Child health assessment questionnaire, Parent/patient global assessment,

Physician global assessment, ESR) [67]. Additional factors that may also be important for disease

activity are the size and symmetry of the joints involved [34]. For patients with systemic onset JIA,

both systemic features and inflammatory markers are important determinants of disease activity and

disease course [32]. In using the active joint count as the disease activity measure, the course of

disease in those subjects whose disease manifested as enthesitis, uveitis or systemic features would

not be adequately characterized.

Given the successful application of this technique using the active joint count as the

prototype, other components of disease activity can now be evaluated. The other components could

be used as the trajectory descriptive variable or as a covariate in the model with the active joint

count. Muthen has suggested that an analysis without covariates can be useful to study different

growth in different trajectory classes. However, it is expected that the class distribution or

individual classification will not remain the same when adding covariates [58].

7.4 Future directions

Future studies should address the identified shortcomings with a prospective cohort study

to minimize missing data as well as collect information for other important aspects of disease

activity. In addition, the final few models (five-class latent curve growth analysis, four-class

latent curve growth analysis and four-class growth mixture model with variance of the non-

inflated intercept estimate) need further evaluation. The latent classes from these models should

be evaluated for their predictive ability for distal outcomes and for their relationship to important

68

biological predictors. For each model, the clinical usefulness needs to be evaluated by

experienced clinicians in pediatric rheumatology.

This method should be replicated in another cohort to evaluate for consistency in the

shape and number of trajectories. Data from a large prospective Canadian multicenter cohort

study (Research in Arthritis in Canadian Children focusing on outcome, ReACCH-Out) [18] will

be used to validate the technique and in particular, the five-class solution. Importantly, the effect

of medication on shape and number of trajectories needs to be evaluated in future studies.

To enable longitudinal growth curve modeling to be used more commonly in clinical

medicine further work needs to be done. Formalized criteria need to be developed to aid in

model building and statistical model selection processes. Additionally, there needs to be more

documentation for the software program to inform users of these techniques.

69

8. REFERENCES

1. Tucker, L.B., et al. Canadian National Surveillance for Juvenile Idiopathic Arthritis. in Arthritis & Rheumatism. 2008.

2. Manners, P. and D. Diepeveen, Prevalence of juvenile chronic arthritis in a population of 12-year-old children in urban Austrailia. Pediatrics, 1996. 98: p. 84-90.

3. Malleson, P.N., M.Y. Fung, and A.M. Rosenberg, The incidence of pediatric rheumatic diseases: results from the Canadian Pediatric Rheumatology Association Disease Registry. Journal of Rheumatology, 1996. 23(11): p. 1981-7.

4. Packham, J.C. and M.A. Hall, Long-term follow-up of 246 adults with juvenile idiopathic arthritis: education and employment. Rheumatology, 2002. 41(12): p. 1436-9.

5. Packham, J.C. and M.A. Hall, Long-term follow-up of 246 adults with juvenile idiopathic arthritis: functional outcome. Rheumatology, 2002. 41(12): p. 1428-35.

6. Packham, J.C., M.A. Hall, and T.J. Pimm, Long-term follow-up of 246 adults with juvenile idiopathic arthritis: predictive factors for mood and pain. Rheumatology, 2002. 41(12): p. 1444-9.

7. Foster, H.E., et al., Outcome in adults with juvenile idiopathic arthritis: a quality of life study. Arthritis & Rheumatism, 2003. 48(3): p. 767-75.

8. Packham, J.C. and M.A. Hall, Long-term follow-up of 246 adults with juvenile idiopathic arthritis: social function, relationships and sexual activity. Rheumatology, 2002. 41(12): p. 1440-3.

9. Minden, K., et al., Long-term outcome in patients with juvenile idiopathic arthritis. Arthritis Rheum, 2002. 46(9): p. 2392-401.

10. Bernatsky, S., et al., Economic impact of juvenile idiopathic arthritis. Arthritis & Rheumatism, 2007. 57(1): p. 44-8.

11. Petty, R.E., et al., International League of Associations for Rheumatology classification of juvenile idiopathic arthritis: second revision, Edmonton, 2001. J Rheumatol, 2004. 31(2): p. 390-2.

12. European League Against Rheumatism. EULAR Bulletin No. 4: nomenclature and classification of arthritis in children. 1977, National Zeitung AG: Basel.

13. Brewer, E.J., et al., Current proposed revision of JRA criteria. Arthritis & Rheumatism, 1977. 20(Supplement 2): p. 195-9.

14. Petty, R. and T. Southwood, Classifcation of childhood arthritis:divide and conquer. Journal of Rheumatology, 1998. 25(10): p. 1869-70.

15. Petty, R.E., Exclusivity versus the hierarchy, or fear and loathing of the undefined. Journal of Rheumatology, 2003. 30: p. 1663-4.

16. Kirshner B and G. G, A methodological framework for assessing health indices. J Chronic Dis, 1985. 38(1): p. 27-36.

17. Ravelli, A. and A. Martini, Juvenile idiopathic arthritis. Lancet, 2007. 369: p. 767-78. 18. Oen, K., et al., Predictors of early inactive disease in a juvenile idiopathic arthritis cohort: results

of a Canadian multicenter, prospective inception cohort study. Arthritis & Rheumatism, 2009. 61(8): p. 1077-86.

19. Adib, N., A. Silman, and W. Thomson, Outcome following onset of juvenile idiopathic inflammatory arthritis: II. predictors of outcome in juvenile arthritis. Rheumatology (Oxford), 2005. 44(8): p. 1002-7.

20. Oen, K., Long-term outcomes and predictors of outcomes for patients with juvenile idiopathic arthritis. Best Practice & Research in Clinical Rheumatology, 2002. 16(3): p. 347-60.

70

21. Selvaag, A.M., et al., Early disease course and predictors of disability in juvenile rheumatoid arthritis and juvenile spondyloarthropathy: a 3 year prospective study. Journal of Rheumatology, 2005. 32(6): p. 1122-30.

22. Martini A and L. D.J., Juvenile idiopathic arthritis: state of the art and future perspectives. Ann Rheum Dis, 2010. 69(7): p. 1260-3.

23. Adib, N., A. Silman, and W. Thomson, Outcome following onset of juvenile idiopathic inflammatory arthritis: I. frequency of different outcomes. Rheumatology, 2005. 44(8): p. 995-1001.

24. Oen, K., et al., Disease course and outcome of juvenile rheumatoid arthritis in a multicenter cohort. Journal of Rheumatology, 2002. 29(9): p. 1989-99.

25. Bowyer, S.L., et al., Health status of patients with juvenile rheumatoid arthritis at 1 and 5 years after diagnosis. Journal of Rheumatology, 2003. 30(2): p. 394-400.

26. Ruperto, N., et al., Long-term health outcomes and quality of life in American and Italian inception cohorts of patients with juvenile rheumatoid arthritis. II. Early predictors of outcome. Journal of Rheumatology, 1997. 24(5): p. 952-8.

27. Oen, K., et al., Early predictors of longterm outcome in patients with juvenile rheumatoid arthritis: subset-specific correlations. Journal of Rheumatology, 2003. 30(3): p. 585-93.

28. Pinals, R.S., A.T. Masi, and R.A. Larsen, Preliminary criteria for clinical remission in rheumatoid arthritis. Arthritis & Rheumatism, 1981. 24(10): p. 1308-15.

29. Schneider, R., et al., Prognostic indicators of joint destruction in systemic-onset juvenile rheumatoid arthritis. J Pediatr, 1992. 120(2 Pt 1): p. 200-5.

30. Flato, B., et al., Prognostic factors in juvenile rheumatoid arthritis: a case-control study revealing early predictors and outcome after 14.9 years. J Rheumatol, 2003. 30(2): p. 386-93.

31. Guillaume, S., et al., Long-term outcome and prognosis in oligoarticular-onset juvenile idiopathic arthritis. Arthritis Rheum, 2000. 43(8): p. 1858-65.

32. Spiegel, L.R., et al., Early predictors of poor functional outcome in systemic-onset juvenile rheumatoid arthritis: a multicenter cohort study. Arthritis & Rheumatism, 2000. 43(11): p. 2402-9.

33. Al-Matar, M.J., et al., The early pattern of joint involvement predicts disease progression in children with oligoarticular (pauciarticular) juvenile rheumatoid arthritis. Arthritis Rheum, 2002. 46(10): p. 2708-15.

34. Berntson, L., et al., Assessment of disease activity in juvenile idiopathic arthritis. The number and the size of joints matter. J Rheumatol, 2007. 34(10): p. 2106-11.

35. Shiff, N.J., et al., Factors Associated with a Longer Time to Access Pediatric Rheumatologists in Canadian Children with Juvenile Idiopathic Arthritis. Journal of Rheumatology, 2010.

36. Foster, H.E., et al., Delay in access to appropriate care for children presenting with musculoskeletal symptoms and ultimately diagnosed with juvenile idiopathic arthritis. Arthritis & Rheumatism, 2007. 57(6): p. 921-7.

37. Ravelli, A., et al., Patients with antinuclear antibody-positive juvenile idiopathic arthritis constitute a homogeneous subgroup irrespective of the course of joint disease. Arthritis Rheum, 2005. 52(3): p. 826-32.

38. Felici, E., et al., Course of joint disease in patients with antinuclear antibody-positive juvenile idiopathic arthritis. J Rheumatol, 2005. 32(9): p. 1805-10.

39. Singh-Grewal, D., et al., Predictors of disease course and remission in systemic juvenile idiopathic arthritis: significance of early clinical and laboratory features. Arthritis & Rheumatism, 2006. 54(5): p. 1595-601.

40. Berntson, L., et al., HLA-B27 predicts a more extended disease with increasing age at onset in boys with juvenile idiopathic arthritis. J Rheumatol, 2008. 35(10): p. 2055-61.

71

41. Thomson, W., et al., Juvenile idiopathic arthritis classified by the ILAR criteria: HLA associations in UK patients. Rheumatology, 2002. 41(10): p. 1183-9.

42. Prahalad S and G. DN, A comprehensive review of the genetics of juvenile idiopathic arthritis. Pediatric Rheumatol Online J, 2008. 21(6): p. 11.

43. Hunter, P.J. and L.R. Wedderburn, Pediatric rheumatic disease: can molecular profiling predict the future in JIA? Nat Rev Rheumatol, 2009. 5(11): p. 593-4.

44. Hunter, P.J., et al., Biologic predictors of extension of oligoarticular juvenile idiopathic arthritis as determined from synovial fluid cellular composition and gene expression. Arthritis & Rheumatism, 2010. 62(3): p. 896-907.

45. Griffin, T.A., et al., Gene expression signatures in polyarticular juvenile idiopathic arthritis demonstrate disease heterogeneity and offer a molecular classification of disease subsets. Arthritis & Rheumatism, 2009. 60(7): p. 2113-23.

46. Raudenbush, S.W. and A.S. Bryk, Hierarchical linear models: applications and data analysis methods 2nd ed. 2002, Thousand Oaks, CA: Sage Publications.

47. Ram, N. and K.J. Grimm, Growth mixture modeling: a method for identifying difference in longitudianl change among unobserved groups. International Journal of Behavioral Development, 2009. 33(6): p. 565-576.

48. Nagin, D.S. and C.L. Odgers, Group-based trajectory modeling in clinical research. Annu Rev Clin Psychol, 2010. 6: p. 109-38.

49. Yeates, K.O., et al., Longitudinal trajectories of postconcussive symptoms in children with mild traumatic brain injuries and their relationship to acute clinical status. Pediatrics, 2009. 123(3): p. 735-43.

50. Gill, T.M., et al., Trajectories of disability in the last year of life New England Journal of Medicine, 2010. 362(13): p. 1173-1180.

51. McLachlan, G.J. and D. Peel, Finite Mixture Models. 2000, New York John Wiley. 52. Jung, T. and K.A. Wickrama, An introduction to latent class growth analysis and growth mixture

modeling. Social and Personality Psychology Compass, 2008 2(1): p. 302-317. 53. Muthen, B. and L.K. Muthen, Integrating person-centered and variable-centered analyses:

growth mixture modeling with latent trajectory classes. Alcohol Clin Exp Res, 2000. 24(6): p. 882-91.

54. Nagin, D.S. and K.C. Land, Age, criminal careers, and population heterogeneity: specification and estimation of a nonparametric, mixed Poisson model. Criminology, 1993. 31: p. 327-362.

55. Jones, B.L. and D.S. Nagin, Advances in Group-based trajectory modeling and an SAS procedure for estimating them. Sociological Methods Research, 2007. 35(4): p. 542-571.

56. Jones, B.L., D.S. Nagin, and K. Roeder, A SAS procedure based on mixture models for estimating developmental trajectories. Sociological Methods Research, 2001 29(3): p. 374-393.

57. Nagin, D.S. and R.E. Tremblay, Developmental trajectory groups: fact or a useful statistical fiction Criminology, 2005. 43(4): p. 873-904.

58. Muthen, B., Latent variable analysis: growth mixture modeling and related techniques for longitudinal data, in Handbook of Quantitative Methodology for the Social Sciences, D. Kaplan, Editor. 2004, Sage Newbury Park. p. 345-368.

59. Muthen, L.K. and B. Muthen, Mplus user's guide Version 6. 2010, Los Angeles: Author. 60. Nyland K.L. , Asparouhov T, and B.O. Muthen, Deciding on the number of classes in latent class

analysis and growth mixture modeling: a monte carlo simulation study. Structural Equation Modeling, 2007. 14(4): p. 535-569.

61. Jung, T. and K.A.S. Wickrama, An introduction to latent class growth analysis and growth mixture modeling. Social and Personality Psychology Compass, 2008. 2(1): p. 302-317.

72

62. Celeux, G. and G. Soromenho, An entropy criterion for assessing the number of clusters in a mixture model. Journal of Classification 1996. 13: p. 195-212.

63. Muthen, B., Statistical and substantive checkign in growth mixture modeling: comment on Bauer and Curran. Psychological Methods, 2003. 8(369-377).

64. Bauer, D.J. and P.J. Curran, Distributional assumptions of growth mixture models: implications for overextraction of latent trajectory classes. Psychological Methods, 2003. 8: p. 338-363.

65. Thomas, E., et al., Subtyping of juvenile idiopathic arthritis using latent class analysis. British Paediatric Rheumatology Group. Arthritis Rheum, 2000. 43(7): p. 1496-503.

66. Thomas, E., et al., Subtyping of juvenile idiopathic arthritis using latent class analysis. British Paediatric Rheumatology Group. Arthritis & Rheumatism, 2000. 43(7): p. 1496-503.

67. Giannini EH, et al., Preliminary definition of improvement in juvenile arthritis. Arthritis & Rheumatism, 1997. 40(7): p. 1202-9.

68. Palmisani, E., et al., Correlation between juvenile idiopathic arthritis activity and damage measures in early, advanced, and longstanding disease. Arthritis Rheum, 2006. 55(6): p. 843-9.

69. Magni-Manzoni, S., et al., Prognostic factors for radiographic progression, radiographic damage, and disability in juvenile idiopathic arthritis. Arthritis Rheum, 2003. 48(12): p. 3509-17.

70. Welsing PM, et al., The relationship between disease activity and radiologic progression in patients with rheumatoid arthritis: a longitudinal analysis. Arthritis & Rheumatism, 2004. 50(7): p. 2082-93.

71. van der Heide, A., et al., Prediction of progression of radiologic damage in newly diagnosed rheumatoid arthritis. Arthritis & Rheumatism, 1995. 38(10): p. 1466-74.

72. Salaffi F, et al., Inter-observer agreement of standard joint counts in early rheumatoid arthritis: a comparison with grew scale ultrasonography -- a preliminary study. Rheumatology, 2008. 47(1): p. 54-8.

73. Guzman, J., et al., Reliability of the articular examination in children with juvenile rheumatoid arthritis: interobserver agreement and sources of disagreement. Journal of Rheumatology, 1995. 22(12): p. 2331-6.

74. Dempster, A.P., N.M. Laird, and D.B. Rubin, Maximum likelihood from incmomplete data via the EM algorithm Journal of the Royal Statistical Society, 1977 Series B, 39 (1): p. 1-38.

75. Liu, W.S. and J. Cela. Paper 371-2008 Count data models in SAS. in SAS Institute Inc. 2008. Proceedings of the SAS® Global Forum 2008 Conference. . 2008. Cary, NC: SAS Institute Inc.

76. Jones, B.L., D.S. Nagin, and K. Roeder, A SAS procedure based on mixture models for estimating developmental trajectories. Sociological Methods Research, 2001. 28: p. 374-393.

77. Xie, H., R. Drake, and G. McHugo, Are there distinctive trajectory groups in substance abuse remission over 10 years? An application of the group-based modeling approach. Administration and Policy in Mental Health & Mental Health Services Research, 2006. 33: p. 423-432.

78. Martini, A., Are the number of joints involved or the presence of psoriasis still useful tools to identify homogeneous disease entities in juvenile idiopathic arthritis? Journal of Rheumatology, 2003 30: p. 1900-1903.

79. Butbul, Y.A., et al., Comparison of patients with juvenile psoriatic arthritis and nonpsoriatic juvenile idiopathic arthritis: how different are they? Journal of Rheumatology, 2009. 36(9): p. 2033-41.

80. Muthen, L.K. Mplus support http://statmodel.com/ 2010 October 14, 2010]. 81. Textbook of pediatric rheumatology. 5th Edition ed, ed. J.T. Cassidy, et al. 2005, Philadelphia:

Elsevier.

http://statmodel.com/

73

9. APPENDICES

Appendix A: Comparison of the ILAR, JRA and JCA criteria [81]

Comparison of Classifications of Chronic Arthritis in Children

Juvenile Rheumatoid

Arthritis (ACR)

Juvenile Chronic Arthritis

(EULAR)

Juvenile Idiopathic Arthritis

(ILAR)

Systemic Systemic Systemic

Polyarticular Polyarticular Polyarticular RF-negative

Oligoarticular (pauciarticular) Juvenile rheumatoid arthritis Polyarticular RF-positive

Pauciarticular Oligoarticular

Persistent

Extended

Juvenile psoriatic arthritis Psoriatic arthritis

Juvenile ankylosing

spondylitis

Enthesitis-related arthritis

Other arthritis

74

Appendix B: ILAR Classification of JIA: Second Revision, Edmonton, 2001[11]

Exclusions

a. Psoriasis or a history of psoriasis in the patient or first degree relative

b. Arthritis in an HLA-B27 positive male beginning after the 6th

birthdat

c. Ankylosing spondylitis, Enthesitis related arthritis, sacroiliitis with inflammatory bowel

disease, Reiter’s syndrome, or acute anterior uveitis, or a history of one of these disorders

in a first-degree relative.

d. The presence of IgM rheumatoid factor on at least 2 occasions at least 3 months apart

e. The presence of systemic JIA in the patients

Categories

1. Systemic arthritis

Definition: arthritis in one or more joints with or preceded by fever of at least 2 weeks’ duration

that is documented to be daily (“quotidian”) for at least 3 days, and accompanied by one of the

following:

1. Evanescent (nonfixed) erythematous rash

2. Generalized lymph node enlargement

3. Serositis

Exclusions: a, b, c, d

2. Oligoarthritis

Definition: Arthritis affecting one to 4 joints during the first 6 months of disease. Two

subcategories are recognized:

1. Persistent oligoarthritis: affecting not more than 4 joints throughout the disease course

2. Extended oligoarthritis: affecting a total of more than 4 joints after the first 6 months of

disease

Exclusions: a, b, c, d, e

3. Polyarthritis (RF negative)

Definition: arthritis affecting 5 or more joints during the first 6 months of disease: a test for RF is

negative.

Exclusions: a, b, c, d, e

4. RF negative Polyarthritis

Definition: arthritis affecting 5 or more joints during the first 6 months of disease; 2 or more tests

for RF at least 3 months apart during the first 6 months of disease are positive.

Exclusions: a, b, c, e

75

5. Psoriatic arthritis

Definition: Arthritis and psoriasis, or arthritis and at least 2 of the following:

1. Dactylitis

2. Nail pitting or onycholysis

3. Psoriasis in a first-degree relative

Exclusions: b, c, d, e

6. Enthesitis related arthritis

Definition: arthritis and Enthesitis, or arthritis or Enthesitis with at least 2 of the following:

1. The presence of or a history of sacroiliac joint tenderness and/or inflammatory

lumbosacral pain

2. The presence of HLA-B27 antigen

3. Onset of arthritis in a male over 6 years of age

4. Acute (symptomatic) anterior uveitis

f. History of ankylosing spondylitis, enthesitis related arthritis, sacroiliitis with

inflammatory bowel disease, Reiter’s syndrome, or acute anterior uveitis in a first-degree

relative.

Exclusions: a, d, e

7. Undifferentiated arthritis

Definition: Arthritis that fulfills criteria in no category or in 2 or more of the above categories.

76

Appendix C: Data abstraction form

I. PATIENT ID, DEMOGRAPHICS

Patient study identification number: Sex: (Circle One) M F

Date of birth (DD-MM-YYYY)_____________

RACE: (CHECK ONE)

□ 01 ARAB

□ 02 BLACK- AFRICAN, CARRIBEAN

□ 11 ABORIGINAL, INUIT

□ 12 ABORIGINA,L NORTH AMERICAN INDIAN

□ 21 CHINESE, 23 KOREAN, or 24 JAPANESE

□ 22 FILIPINO

□ 26 SOUTH ASIAN (INDIAN SUBCONTINENT)

□ 27 SOUTHEAST ASIAN (VIETNAMESE, BURMESE, CAMBODIAN, THAI, MALAY, INDONESIAN)

□ 31-36 CAUCASIAN EUROPEAN

□ OTHER

DIAGNOSIS:

Initial arthritis diagnosis (made by physician) Check One If diagnosis revised later to another

category, enter relevant date

pauciarticular JRA

polyarticular JRA-RF negative

polyarticular JRA-RF positive

systemic JRA

Psoriatic arthritis

SEA syndrome

Juvenile Ankylosing spondylitis

Systemic JIA (ILAR)

Oligoarticular JIA , persistent (ILAR)

Oligoarticular JIA, extended (ILAR)

Polyarticular JIA, RF negative (ILAR)

Polyarticular JIA, RF positive (ILAR)

Psoriatic JIA (ILAR)

Enthesitis-related arthritis JIA (ILAR)

Undifferentiated JIA (ILAR)

Date of symptom onset of arthritis (fever if systemic JIA) (DD-MM-YYYY)

Date of initial diagnosis (DD-MM-YYYY)

Date of first visit: at this paediatric rheumatology centre (DD-MM-YYYY)

77

II. DIAGNOSTIC CRITERIA, COMORBIDITY

CIRCLE ONE

IF YES, DATE

FIRST NOTED

(DD-MM-YYYY)

Diagnostic criteria, exclusions, and other lab: YES NO

Rheumatoid factor (RF) positive x 2, > 3 months apart YES NO

Fever≥2 weeks, quotidian pattern, documented >3 days YES NO

Systemic JIA rash YES NO

Lymphadenopathy YES NO

Hepatosplenomegaly YES NO

Serositis YES NO

Psoriasis YES NO

Dacytilitis YES NO

Nail pitting YES NO

HLA-B27 positive YES NO

Male with onset of arthritis after 6th

birthday YES NO

Acute uveitis YES NO

Enthesitis YES NO

Sacroiliac (SI )joint tenderness and/or lumbosacral pain YES NO

First degree relative with Anklyosing spondylitis (AS) YES NO

First degree relative with sacroiliitis YES NO

First degree relative with acute uveitis YES NO

First degree relative with psoriasis/ psoriatic arthritis YES NO

First degree relative with inflammatory bowel disease

(IBD)

YES NO

First degree relative with reactive arthritis YES NO

Antinuclear Antibody (ANA) positive, ever YES NO

Co-morbidity (e.g. diabetes, osteoporosis, cataracts, asthma, allergic rhinitis, hypertension,

seizure disorder, hypothyroidism, attention deficit/hyperactivity syndrome, developmental delay,

macrophage activation syndrome)

DATE FIRST NOTED

(DD-MM-YYYY)

1.

2.

3.

4.

5.

6.

78

IV. VISITS- including first visit- Joint count Visit 1

Date

______________

Visit 2

Date

______________

Visit 3

Date

______________

Visit 4

Date

______________

Number of active joints

Systemic Fever YES NO YES NO YES NO YES NO

Systemic Rash YES NO YES NO YES NO YES NO

Hepatosplenomegaly YES NO YES NO YES NO YES NO

Lymphadenopathy YES NO YES NO YES NO YES NO

Serositis (pleuritis/pericarditis) YES NO YES NO YES NO YES NO

Enthesitis present YES NO YES NO YES NO YES NO

DMARDs at time of visit Visit 1 Visit 2 Visit 3 Visit 4

Methotrexate

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Sulfasalazine Currently on:

YES □

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Hydroxychloroquine

(Plaquenil) Currently on:

YES □

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Other DMARD

(e.g. intramuscular gold

[aurothiomalate], oral gold,

leflunomide [Arava],cyclosporine

[Neoral], mycophenolate [Cellcept],

chloroquine, chlorabmucil, etc.) or

biologic (e.g. Enbrel, Remicade,

Anakinra, rituximab,)

DMARD name

______________

Currently on:

YES □

NO □

DMARD name

______________

Currently on:

YES □

NO □

DMARD name

______________

Currently on:

YES □

NO □

DMARD name

______________

Currently on:

YES □

NO □

DMARD name

______________

Currently on:

YES □

NO □

DMARD name

______________

Currently on:

YES □

NO □

DMARD name

______________

Currently on:

YES □

NO □

DMARD name

______________

Currently on:

YES □

NO □

79

Other Medications Visit 1 Visit 2 Visit 3 Visit 4

NSAIDS (e.g. Ibuprofen [Motrin/

Advil], diclofenac[Votaren],

indomethacin [Indocid],

naproxen[Naprosyn], etc

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Prednisone Currently on:

YES

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Currently on:

YES □

NO □

Laboratory data value at time of visit or within 3 months before visit date

Visit 1

Lab date

___________

Visit 2 Lab date ___________



Erythrocyte sedimentation rate

(ESR)

C-reactive protein (CRP)

Hemglobin

Platelet count

RF titre

ANA titre

Joint injections, Intravenous immune globulin (IVIG), methylprednisolone

Joint injected: Record dates of injections, if applicable

Shoulder_right

Shoulder_left

Elbow_right

Elbow_left

Wrist_right

Wrist_left

Hip_right

Hip_left

Knee_right

Knee_left

Ankle_right

Ankle_left

Other joint

IVIG –Ever (circle one) YES NO

IF YES: Start date

(DDMMYYYY)

Stop date

(DDMMYYYY)

Number of IVIG doses per course

Course 1

Course 2

80

Appendix D. Flow chart of patients and exclusions

1074 subjects from Winnipeg and Saskatoon

730 subjects

728 subjects

713 sujbects

659 subjects (361 Saskatoon, 298 Winnipeg)

344 subjects, < 3 visits with

rheumatologist

2 subjects, no

first visit

15 subjects, not JIA

54 subjects, diagnosis

>90 days before first visit

81

Appendix E. Frequency of AJC at each visit. Each unit of timeframe is 6 months of follow-up.

82

83

84

85