Date post: | 25-Dec-2015 |
Category: |
Documents |
Upload: | philip-hunt |
View: | 213 times |
Download: | 0 times |
Endpoints
(also Outcomes, Major Response
Variables)
How do you determine whether the treatment is effective?
Randomization (√)
Blinding (√)
Well-defined target population (√)
Appropriate control (√)
Excellent follow-up (√)
Adequate sample size (√)
Appropriate interim monitoring (√)
Endpoints (?)
Outline
• General endpoint considerations
• Surrogate endpoints
• Composite endpoints and recurrent events
• Safety outcomes (adverse events)
Major Problems that Limit Interpretation of Randomized Trials
• Inappropriate controls
• Endpoints which are not clinically relevant
• Inadequately powered studies
• Poor follow-up
• Improper interim analyses
Protocol and Trial Report - 1
• Endpoints should be pre-specified:– Written down in the protocol before trial begins– Defines the trial objectives (in part)– Basis for sample size
• Example: Herpes Zoster Vaccine Protocol– Hypotheses and Objectives: “The primary objective of
this study is to determine whether immunization with live-attenuated varicella-zoster vaccine (OKA/Merck Strain) can reduce the incidence and/or severity of herpes zoster (HZ) and its complications, primarily postherpetic neuralgia (PHN) in persons 60 years of age and older. This will be accomplished by comparing a measure of the burden of illness due to HZ and PHN in vaccine and placebo recipients.
Protocol and Trial Report - 2
• Example: Herpes Zoster Vaccine Trial Report (N Engl J Med 2005)– “The Shingles and Prevention Study (Department of
Veterans Affairs [VA] Cooperative Study No. 403) was conducted to determine whether vaccination with a live attenuated VZV vaccine would decrease the incidence, severity, or both of herpes zoster and post-herpetic neuralgia in adults 60 years of age or older.”
– “The primary endpoint was the burden of illness due to herpes zoster, a severity-by-duration measure of the total pain and discomfort associated with herpes zoster in the population of study subjects.”
Primary Endpoint
– Usually one outcome is specified as most important (primary endpoint)
– Key variable in design (follows from objective)
– Basis for sample size
– A focus of interim monitoring and QA
– Response variable given major attention in trial report
– Usually, but not always, e.g., Cox-2 trials on GI bleeding, relates to efficacy. In some studies the primary endpoint encompasses efficacy and safety, e.g., mortality in CHF study, lipid study, HIV treatment study
– “A clinical endpoint that provides evidence sufficient to fully characterize the effect of a treatment in a manner that would support a regulatory claim for treatment” (O’Neill RT, Cont Clinical Trials, 1997;18:550-556)
Secondary Endpoints
• There are usually several efficacy endpoints and these are commonly referred to as secondary endpoints or secondary efficacy endpoints.
• Safety endpoints must also be specified and are usually consider secondary– Discontinuation of study treatment– Side effects/adverse events– Serious adverse events– O’Neill (FDA) defines a secondary endpoint as one that
“provides additional characterization of treatment effect but that is not sufficient to characterize fully the benefit or to support a claim for a treatment effect”.
Characteristics Desired for Endpoint
• Relevant; easy to interpret
• Easy to diagnose
• Can be ascertained and classified in an unbiased manner
• Sensitive to treatment differences
• Measurable within a reasonable period of time
General Considerations - 1
More commonly occurring endpoints (high incidence) will result in smaller sample sizes than less frequent events (low incidence) as long as expected relative difference between treatment groups is similar • CHD + non-fatal MI vs. CHD death for lipid-lowering trial• Progression-free survival vs survival for cancer trial
Continuous response variables usually result in smaller sample sizes than binary or time to event• BP change vs. % with normal BP• HIV RNA change vs. % < 50 copies/mL• Weight change vs. % who lose > 5% of baseline
weight
General Considerations - 2
More serious events should be considered along with less serious ones • Count CHD deaths along with non-fatal MIs
Related to this, some events may have to be included to avoid misinterpretation due to informative censoring (this could result in a loss of power)
• Non-arrhythmic deaths along with arrhythmic deaths (DEFINITE, NEJM 2004)
• Death and missing data along with change in exercise duration (PICO, Heart 1996)
• Progression to AIDS or death from any cause (SMART, NEJM 2006)
Endpoint Examples - 1
MRFIT (JAMA 1982 and Amer J Cardiol 1986)
Primary: CHD DeathSecondary: CHD Death or non-fatal MI
CVD mortalityAll-cause mortality
NuCombo HIV Study (N Engl J Med 1996)
Primary: Progression to AIDS or death from any cause
Secondary: Death
AMIS (JAMA 1982)
Primary: All deathsSecondary: CHD death or non-fatal MI
Endpoint Examples - 2
• New BP-lowering drug – Systolic BP change
• CHF device – change in NYHA class; 6-minute walk
• New antiretroviral drug – HIV RNA suppression at 24 and 48 weeks
Special considerations: longitudinal measurements; informative missing data; left censoring
Choice of Endpoint in TOMHS[Antihypertensive Drugs]
BP, Side Effects, Quality of Life
Echocardiographic and Electrocardiographic Changes (Asymptomatic CVD)
Non-fatal MI, Stroke, Angina, Peripheral Artery Disease (Symptomatic CVD)
CVD Death• CHD• Stroke
Total Mortality
Choice of Endpoint in Antiretroviral Trials
CD4+ count; viral load
Genotypic/phenotypic resistance;Loss of drug options
Clinical disease progression (AIDS)
Serious AIDS and non-AIDS events
Survival
Choice of Endpoint in HIV Vaccine Trials
HIV Infection
Durable control of viremia post-infection (viralload set point)
CD4+ decline/ART
AIDS or death
Endpoints Used to Approve Cancer Drugs and Biologics
• Survival• Symptom endpoints (patient reported
outcomes)• Disease-free survival (e.g., time to tumor
recurrence or death)• Objective response rate (e.g., proportion of
patients with tumor size reduction of a pre-defined amount for a minimum time period)
Guidance for Industry. Clinical Trial Endpoints for theApproval of Cancer Drugs and Biologics. May 2007.
Requirements for FDA Approval Vary
• Antiretroviral drugs for HIV – viral load (regimen failure)
• Antihypertensive drug – BP
• CHF drug – morbidity and mortality
• Device for CHF – functional status
• Osteoporosis drugs – bone density
For Other Areas There is Uncertainty
• Treatments for community acquired bacterial pneumonia (CABP)– Should focus be on clinical endpoints that
capture how a patient feels or should outcomes incorporate clinical signs (e.g., fever) and laboratory tests (e.g., WBC count)?
– FDA position in November 2011: “improvement in at least 2 symptoms attributable to CABP… at a minimum cough, sputum production, chest pain, and shortness of breath at an early time point (i.e., day 3 or 5 after enrolment).
Anti-Infective Drugs Advisory Committee Briefing DocumentNovember 2011
Choice of Endpoint: General Hierarchical Categorization
• Clinical outcome (morbidity and mortality)
• Surrogate for clinical outcome (may be hard to establish)
• Intermediate outcome that is likely to predict clinical benefit (non-validated surrogate)
• Biomarker which measures biologic activity
Types of Endpoints from an Analytic Point of View
• Binary
• Ordered categorical
• Continuous (single point in time, repeated measures, slope)
• Counts
• Time to event and rates
Other Endpoint Considerations
1. Training of evaluators (e.g., BP measurement)
2. Ongoing quality assurance (e.g., laboratory QC)
3. Endpoint classification committee
4. Blinding of endpoint determination (as noted previously this can be done even in open- label studies)
5. Methods for reducing missing data
– Informed consent
– Training
– Quality assurance procedures
– Collect identifying information at entry
– National Death Index
Outline
• General endpoint considerations
• Surrogate endpoints
• Composite endpoints
• Safety outcomes (adverse events)
Surrogate Endpoint(Definition)
• Failure of treatment (relative to control) to influence the surrogate implies failure to influence “true” endpoint (Prentice, Stat Med, 1989)
• Surrogate must be able to capture “full” dependence of “true” endpoint rate on randomization group, i.e., no pathways whereby randomization assignments affect “true” endpoint that bypass the surrogate response
• Occurs with greater frequency than “true” endpoint
• Occurs sooner after treatment than “true” endpoint
Other Definitions of Surrogate Endpoints
• “a laboratory measurement or clinical sign used as a substitute for a clinically meaningful endpoint that measures directly how a patient feels functions or survives” (Temple).
• “ A surrogate endpoint is expected to expected to predict clinical benefit (or harm or lack of benefit or harm) based on epidemiologic, therapeutic, pathophysiologic, or other scientific evidence” (Biomarker Definitions Working Group; also, IOM report).
• “an outcome measure that substitutes for a clinical event of true importance” (Grimes and Schulz).
Institute of Medicine Report: Evaluation of Biomarkers and Surrogate Endpoints in
Chronic Disease• Biomarker evaluation should consist of 3 steps:
– Analytical validation (e.g., performance of an assay)
– Qualification (assessment of association of biomarker with disease and the effect of interventions on biomarkers)
– Utilization (contextual analysis of available evidence on the specific use proposed). A biomarker may be used as a surrogate for some disease states and not in others, e.g., HIV viral load in setting where suppression is complete versus partial
IOM 2010, National Academies Press
Relationship Between Treatment (A or B), a Surrogate Marker (S), and the Clinical
Outcome (T)
A or B
S
T
OtherMechanisms of Action Surrogate Marker
Treatment
Clinical Outcomes
Treatment has many mechanisms of action
Fleming and DeMets, Ann Int Med, 1996
Surrogate Endpoint:Not in Causal Pathway of Disease Process
Disease Surrogate True Clinical Endpoint Endpoint
Causal Pathway
Fleming and DeMets, Ann Int Med, 1996
A correlate does not a surrogate make!
Fleming TR and DeMets DL, Ann Int Med, 1996.
0
1 0
2 0
3 0
4 0
5 0
6 0
7 0
8 0
9 0
< 1 2 0 1 2 0 -1 2 9
1 3 0 -1 3 9
1 4 0 -1 5 9
1 6 0 -1 7 9
1 8 0 -2 0 9
2 1 0 +
Ag
e-a
dju
ste
d C
HD
Mo
rtal
ity
(pe
r 1
0,0
00
pe
rso
n-y
ea
rs)
Systolic Blood Pressure (mmHg)
Data from the MRFIT Study
Blood Pressure is Considered An Acceptable Surrogate Endpoint by the FDA
• Substantial epidemiological and clinical trial data.
• Demonstration that diverse BP-lowering agents provide benefit.
• Considered to be the principal causal pathway.
ALLHAT Results on Doxazosina: Cumulative 4-Year Rate (%)
Doxazosin Chlorthalidone
No. Patients 9,067 15,268
Fatal/Non-Fatal CHD 6.3%6.3%
Fatal/Non-Fatal Stroke 4.2%3.6%
Congestive Heart Failure 8.1%4.5%
All Cause Mortality 9.6%9.1%
a JAMA; 283:1967-1975, 2000
Surrogate True ClinicalEndpoint Endpoint
Intervention
Disease
True Clinical Endpoint
Surrogate Endpoint
Disease
Intervention
Multiple Pathways of the Disease Process
Surrogate True ClinicalEndpoint Endpoint
Intervention
Disease
Best Situation for Assessing Surrogacy
CD4+ AIDS or Count Death
IL-2
Disease
• IL-2: known to increase CD4+ cell count
• IL-2: known to be associated with toxicities
• Unknown whether IL-2 is increasing functional CD4+ cells
Interleukin -2 Trials for HIV
Deaths and Serious AIDS Event Rates by Latest CD4+ Count Following Initiation of ART
< 200 1327 29 4.29 907 74 8.16
200-350 2624 18 0.84 2284 38 1.66
350-499 3532 21 0.42 3228 18 0.56
500+ 8425 22 0.30 7964 21 0.26
CD4+ Level PY Events Rate PY Events Rate
All-Cause Mortality Serious AIDS
CASCADE Collaboration http://www.cascade-collaboration.org
ESPRIT Study Design
Patients taking ART with CD4+ counts ≥ 300/μL
N = 2071 N = 2040
IL-2
ART plus:
• 3 cycles of IL-2 (7.5 MIU twice daily for 5 days, 8 wks apart)
• additional cycles to maintain goal (2x baseline or ≥ 1000 CD4+ cells)
Control
ART without IL-2
Plan: 320 primary events
Closure date 15 Nov 2008
323 primary events observed
Median follow-up = 7 years
N Engl J Med 2009 361:1548-1559.
Median CD4+ During Follow-up
0
100
200
300
400
500
600
700
800
0 1 2 3 4 5 6 7Year
CD
4+ IL-2
Control
Avg Difference:160 cells, p<.001
Time spent IL-2 Control
< 300 cells 6% 9%
> 600 cells 57% 36%
IL-2: 2071 1846 1829 1797 1757 1721 1410 878
Control: 2040 1928 1861 1803 1739 1648 1350 824
No. pts
Primary EndpointOpportunistic Disease or Death
IL-2 Control
No. Rate* No. Rate* HR (95% CI) p-value
158 1.13 165 1.21 0.93 (0.75, 1.16) 0.52
Predicted HR based on CD4+ difference = 0.74
* rate per 100 person years
Surrogate True Clinical Endpoint EndpointDisease
Intervention
Interventions having Mechanisms of Action Independent of the Disease
Process
Concorde Study Results(Lancet 341:889-90, 1993)
Deaths 95 76
AIDS or Death 175 171
ARC, AIDS or Death 263 284
ImmediateZDV
DeferredZDV
CD4 difference over 3 yearsof follow-up (immediate - deferred) =
30 cells (p < 0.0001)
0
5
10
15
20
25
30
35
40
< 25 25 -49
50 -99
10 0 -19 9
20 0 -49 9
12-
mo
nth
Cu
mu
lati
ve
Mo
rta
lity
(%)
Baseline CD4+ Lymphocyte Count(cells/mm3)
CID 1996; 22:513-520
CD4+ Count and Mortality: Pre-HAART Era
Overview of Trialsof ZDV vs. Placebo (Immediate vs.
Deferred)
Total 4431 3291
No. deaths 734 617
No. AIDS/deaths 1026 882
ImmediateZDV
DeferredZDV (placebo)
Risk ratio = 1.04
Risk ratio = 0.96
Lancet 353:2014-2025, 1999.
Operational Criteriafor Valid Surrogates
• The surrogate (S) must predict the clinical event (T)
• Treatment must effect surrogate
• The surrogate (S) must fully capture the effect of treatment on the clinical event (T)
Prentice R, Stat Med, 1989.
Evaluation of Surrogacy
Determine relative risk (treatment versus control) of long-term clinical outcome
Show that relative risk when adjusted for marker in each treatment group is one
Not an optimal approach. Ideally, results from several studies with surrogate and clinical
outcomes would be compared.
Statistical Analysis for Single Study
• Fit logistic regression models:
1) log odds (long-term clinical outcome) =
a + b (trt)
2) log odds (long-term clinical outcome) =
a + ba (trt) + c (marker change)
• If marker change fully explains treatment effect, then ba would be zero
• (b-ba)/b measures proportion of effect on long-term clinical outcome explained by effect on marker
Validation of Surrogate Endpoints
Statistical – want more than one study · Meta-analyses of clinical trials data
Clinical · Comprehensive understanding of the causal pathways and intended and unintended mechanisms of action
Overview of 16 Antiretroviral Trials State-of-the-Art Conference 1993
Sig. Diff.
Sig. Diff.
No Diff.
7 6
1 2
No Diff.
8 8
13
3
Clinical DiseaseProgression
CD4+Change
Colon Cancer Example
• Traditional endpoint is overall survival (OS)
• Hypothesis: Disease free survival (DFS), assessed after 3
yrs, is an appropriate endpoint to replace overall survival
(OS) in adjuvant colon trials
Allow more rapid completion, reporting of trials
Allow promising agents to benefit patients more quickly
• Approach: Compare difference between treatment and
control (hazard ratio) within each trial for DFS and OS
Sargent DJ et al, J Clin Oncol, 2005;23:8664-8670.
Overall and Disease-Free Survival for Adjuvant Treatment for Colon Cancer
Sargent DJ et al, J Clin Oncol, 2005;23:8664-8670.
“The validity of a surrogate endpoint should be
judged by the probability that the trial results
based on the surrogate endpoint alone are
‘concordant’ with the trial results that would be
obtained if the true endpoint were observed and
used for the analysis”
Begg and Leung, J R Statis Soc A 2000; 163:15-28
Examples
• Anti-arrhythmic treatment for sudden death
• HSV-2 suppressive treatment to prevent HIV infection
• Estrogen/progestin treatment for coronary heart disease
There are many other examples of failed surrogates (e.g., HF treatment, drugs to
raise HDL cholesterol, weight-loss drugs)
Choice of Endpoint in Cardiac Arrhythmia Suppression Trial (CAST)
[Antiarrhythmic Drugs]
Ventricular Premature Beats
Arrhythmias
Sudden Death
Total Mortality
CAST Study DesignDouble-blind, placebo controlled
Eligible patients
Open-label titration
A. Encainide
B. Flecainide
C. Moricizine
Greater 80% suppression of VPDs
Yes No, excluded
Flecainide Placebofor
Flecainide
Moricizine Placebofor
Moricizine
Encainide Placebofor
Encainide
N Engl J Med 1989; 321:406-412.
CAST Study Sample Size Assumptions
Type I error () = .025 (1-sided)
Power (1-) = 0.85
Projected mortality in placebo group = 11% over 3 years
Expected reduction in mortality due to treatment = 30%
(No difference among active treatments)
Required sample size = 4400 patients
CAST Study Results
No. patients730 725
Deaths from arrhythmia 33 9or cardiac arrest
Other cardiac death 14 6
Other deaths or 9 7unclassified cardiac arrest
TOTAL 56 22
Encainide/Flecainide Placebo
Acyclovir and HIV Acquisition
• Background– Herpes simplex virus type 2 (HSV-2 infection
is the most common cause of genital ulcers
– Observational studies indicate that HSV-2 infection is associated with 2-3 fold increased risk for HIV infection
– Acyclovir is effective in suppressing HSV-2
• Two trials were conducted to determine if acyclovir treatment reduced HIV acquisition
HSV-2/HIV Trial #1
HIV-negative HSV-2, sero-positive men and women at risk
for HIV based on sex history
3,277 randomized
1637 Acyclovir 1640 Placebo
HR (acyclovir/placebo) for genital ulcers = 0.53 (95% CI: 0.46-0.62)
HR for HIV = 1.16 (95% CI: 0.83-1.62)
Lancet 2008, 371: 2109-2119
HSV-2/HIV Trial #2
HSV-2/HIV dually infected persons in heterosexual relation
with HIV-negative partner
3,408 couples randomized
1707 acyclovir 1701 placebo
HR (acyclovir/placebo) for genital ulcers = 0.27 (95% CI: 0.20-0.36)
HR for HIV = 0.92 (95% CI: 0.60-1.41) among partners
N Engl J Med 2010; 362: 427-439
Estrogen + Progestin inHealthy Postmenopausal Women: Lipidsafter 1 year and Events after 5.2 Years
LDL -12.7%
HDL +7.3%
CHD +29%
Stroke +41%
All CVD +22%
Estrogen + Progestinvs.
Placebo
JAMA 288:321-333, 2002
Review of Trials Funded by NHLBIGordon D et al N Engl J Med 2013; 369:1926-1934.
• 244 trials completed in 2000 or 2011.– 45 (18%) with clinical endpoints– 199 with surrogates or health-related behaviors
• 156 published (23% <12 mos. and 57% < 30 mos.)• Faster time to publication (predictors in
multivariable analysis)– Clinical endpoints (median 10 vs 30 mos. for surrogate
endpoint studies) – Cost ($5 million)– Positive result
• Clinical endpoint trials accounted for 82% of citations
Can be a big payoff for what usually is many years of work!
Summary
• There is no such thing as a risk-free intervention (unintended effects of treatment are common)
• Failure of surrogates to predict clinical outcome differences are common
• Trials with clinical endpoints have a greater impact on care than trials with surrogate endpoints
• It is time-consuming (and expensive) to rigorously establish surrogacy (need trials for which both the surrogate and clinical outcome are measured)
• Some judgment about causal pathways is always needed
• Potential surrogates are important in early phase studies