+ All Categories
Home > Documents > Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question...

Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question...

Date post: 27-Oct-2019
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
46
1 Cochrane Diagnostic Test Accuracy Reviews Meta-analysis of diagnostic accuracy studies Yemisi Takwoingi UK Support Unit Department of Public Health, Epidemiology & Biostatistics University of Birmingham
Transcript
Page 1: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

1

Cochrane Diagnostic Test

Accuracy Reviews

Meta-analysis of diagnostic accuracy studies

Yemisi TakwoingiUK Support UnitDepartment of Public Health, Epidemiology & BiostatisticsUniversity of Birmingham

Page 2: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

2

Diagnostic Test Accuracy Reviews

� Framing the question

� Identification and selection of studies

� Quality assessment

� Data extraction

� Data analysis

� Interpretation of the results

Page 3: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

3

2x2 Table

Disease

(Reference test)

Present Absent

Index-

Test

+ TP FP TP+FP

- FN TN FN+TN

TP+FN FP+TNTP+FP+

FN+TN

3

Page 4: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

4

Test accuracy

� Sensitivity � describes the proportion of patients with the target condition with index test results above a threshold

� Specificity

� describes the proportion of patients without the target condition with index test results below a threshold

� Thresholds vary between studies

� Same threshold can imply different sensitivities and specificities in different groups

Page 5: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

5

2x2 Table

Disease

(Reference test)

Present Absent

Index-

Test

+ TP FP TP+FP

- FN TN FN+TN

TP+FN FP+TNTP+FP+

FN+TN

sensitivityTP / (TP+FN)

specificityTN / (TN+FP)

5

Page 6: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

6

Heterogeneity in threshold

Non-diseased Diseased

Diagnostic Threshold

TN FN FP TP

specificity=99% sensitivity=71%

Page 7: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

7

Heterogeneity in threshold

Non-diseased Diseased

Diagnostic Threshold

TN FN FP TP

specificity=97% sensitivity=86%

Page 8: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

8

Heterogeneity in threshold

Non-diseased Diseased

Diagnostic Threshold

TN FN FP TP

specificity=94% sensitivity=94%

Page 9: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

9

Heterogeneity in threshold

Non-diseased Diseased

Diagnostic Threshold

TN FN FP TP

specificity=97% sensitivity=86%

Sensitivity =97%Specificity = 86%

X X

Page 10: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

10

Heterogeneity in threshold

Non-diseased Diseased

Diagnostic Threshold

TN FN FP TP

specificity=71% sensitivity=99%

Page 11: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

11

Threshold effects

Increasing

threshold

increases

specificity but decreases sensitivity

Decreasing threshold increases

sensitivity but

decreases

specificity

0.2

.4.6

.81

se

ns

itiv

ity

0.2.4.6.81specificity

for predicting spontaneous birth

Fetal fibronectin

0.2

.4.6

.81

se

ns

itiv

ity

0.2.4.6.81specificity

for predicting spontaneous birth

Fetal fibronectin

Page 12: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

12Deeks, J. J BMJ 2001;323:157-162

Receiver characteristic operating (ROC) curve

The ROC curve represents the relationship between the true positive rate (TPR) and the false positive rate (FPR) of the test at various thresholds used to distinguish disease cases from non-cases.

Page 13: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

13

Diagnostic odds ratios

FNFP

TNTPORDiagnostic

×

×=

veLR

veLR

yspecificit

yspecificit

ysensitivit

ysensitivit

DOR−

+=

=

1

1

Ratio of the odds of positivity in the diseased to the

odds of positivity in the non-diseased

Page 14: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

14

Diagnostic odds ratios

35625472

1981617-

1589365+HPV

Test

AbsentPresent

Cervical Cancer

(Biopsy)

16793

16165DOR =

×

×=

Page 15: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

15

Diagnostic odds ratios

980118818913962311499999%

18813611717644291995%

89117181362114990%

39676361696480%

2314421954270%

1492914642260%

99199422150%

99%95%90%80%70%60%50%Specificity

Sensitivity

Page 16: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

16

Symmetrical ROC curves and diagnostic odds ratios

uninformative test

line of symmetry

0.2

.4.6

.81

Sen

sitiv

ity

0.2.4.6.81Specificity

DOR = 90

DOR = 6

DOR = 15

DOR = 3

As DOR increases, the ROC curvemoves closer to its ideal position near the upper-left corner.

ROC curve is asymmetric when test accuracy varies with threshold

Page 17: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

17

The meta-analysis process

1. Calculation of an overall summary (average) of high precision, coherent with all observed data

2. Typically a “weighted average” is used where more informative (larger) studies have more say

3. Assess the degree to which the study results deviate from the overall summary

4. Investigate possible explanations for the deviations

Page 18: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

18

Meta-analysis of studies of diagnostic accuracy

� Pair of related summary statistics for each study

� Sensitivity and specificity

� Positive and negative likelihood ratios

� Threshold effects induce correlations between

sensitivity and specificity

� Heterogeneity is the norm not the exception

� Substantial variation in sensitivity and specificity are

noted in most reviews

Page 19: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

19

� statisticians like straight lines with axes that are independent variables

� first calculate the logits of TPR and FPR

� and then graph the difference against their sum

Statistical modelling of ROC curves

Page 20: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

20

ROC curve and logit difference and sum

plot: small difference, same spread

0

20

40

60

80

100

0 20 40 60 80 100

false positive rate (%age)

tru

e p

os

itiv

e r

ate

(%

ag

e)

-2

2

6

10

-40 -20 0 20 40

logit TPR + logit FPR

log

itT

PR

-lo

git

FP

R

0

0.02

0.04

0.06

0.08

0.1

0 20 40 60 80 100

measurement

rela

tive f

req

uen

cy non-diseased diseased

Page 21: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

21

ROC curve and logit difference and sum plot:

moderate difference, same spread

0

20

40

60

80

100

0 20 40 60 80 100

false positive rate (%age)

tru

e p

osit

ive

rate

(%ag

e)

-4

0

4

8

-30 -20 -10 0 10 20 30 40

logit TPR + logit FPR

log

itT

PR

-lo

git

FP

R

0

0.02

0.04

0.06

0.08

0.1

0 20 40 60 80 100

measurement

rela

tiv

e f

req

uen

cy diseasednon-diseased

Page 22: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

22

ROC curve and logit difference and sum plot:

large difference, same spread

0

20

40

60

80

100

0 20 40 60 80 100

false positive rate (%age)

tru

e p

os

itiv

e r

ate

(%

ag

e)

-4

0

4

8

-30 -20 -10 0 10 20 30 40

logit TPR + logit FPR

log

itT

PR

-lo

git

FP

R

0

0.02

0.04

0.06

0.08

0.1

0 20 40 60 80 100

measurement

rela

tive

fre

qu

en

cy

non-diseased diseased

Page 23: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

23

ROC curve and logit difference and sum plot:

moderate difference, unequal spread

0

20

40

60

80

100

0 20 40 60 80 100

false positive rate (%age)

tru

e p

os

itiv

e r

ate

(%

ag

e)

HIGH DOR

LOW DOR

-6

-4

-2

0

2

4

6

8

10

-30 -20 -10 0 10 20 30

logit tpr + logit fpr

log

ittp

r-

log

itfp

r

0

0.02

0.04

0.06

0.08

0.1

0 20 40 60 80 100

measurement

rela

tive

fre

qu

en

cy

non-diseased diseased

Page 24: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

24

Moses-Littenberg SROC method

� Regression models can be used to fit the straight lines to model relationship between test accuracy and test threshold

D = a + bS

� Outcome variable D is the difference in the logits� Explanatory variable S is the sum of the logits� Ordinary or weighted regression – weighted by sample

size or by inverse variance of the log of the DOR

� What do the axes mean?� Difference in logits is the log of the DOR� Sum of the logits is a marker of diagnostic threshold

Page 25: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

25

Producing summary ROC curves

� Transform back to the ROC dimensions

� where ‘a’ is the intercept, ‘b’ is the slope

� when the ROC curve is symmetrical, b=0 and

the equation is simpler

Page 26: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

26

SROC regression: PSV example

Transformation linearizes relationship between

accuracy and threshold so that linear regression

can be used

0.0 0.2 0.4 0.6 0.8 1.0

1 - Specificity

0.0

0.2

0.4

0.6

0.8

1.0

Sensitiv

ity

-4 -3 -2 -1 0 1 2

S

1

2

3

4

5

6

7

unweighted

weighted

Page 27: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

27

PSV example cont.

The SROC curve is produced by using the estimates of a and b to compute

the expected sensitivity (tpr) across a range of values for 1-specificity (fpr)

inverse transformation

-4 -3 -2 -1 0 1 2

S

1

2

3

4

5

6

7

unweighted

weighted

0.0 0.2 0.4 0.6 0.8 1.0

1 - Specificity

0.0

0.2

0.4

0.6

0.8

1.0

Se

nsitiv

ity

Page 28: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

28

RevMan 5: data and analyses

� Add data by test or study

� Add covariate � Study or test level

� Continuous or categorical

� Add analysis� Single test

� Multiple tests

� Paired data

Page 29: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

29

Page 30: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

30

SROC curve, points scaled by their inverse standard error

Page 31: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

31

Problems with the Moses-LittenbergSROC method

� Poor estimation

� Tends to underestimate test accuracy due to zero-cell

corrections and bias in weights

� Validity of significance tests

� Sampling variability in individual studies not properly taken

into account

� P-values and confidence intervals erroneous

� Operating points

� knowing average sensitivity/specificity is important but

cannot be obtained

� Sensitivity for a given specificity can be estimated

Page 32: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

32

Advanced models –HSROC and Bivariate methods

� Hierarchical / multi-level

� allows for both within and between study variability, and within study correlations between diseased and non-diseased groups

� Logistic

� correctly models sampling uncertainty in the true positive proportion and the false positive proportion

� no zero cell adjustments needed

� Random effects

� allows for heterogeneity between studies

� Regression models

� used to investigate sources of heterogeneity

Page 33: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

33

0

.5

1S

ensitiv

ity

0.51Specificity

threshold

shape

accuracy

Hierarchical SROC model

Page 34: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

34

0

.5

1S

ensitiv

ity

0.51Specificity

correlation

sensitivity

specificity

Bivariate model

Page 35: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

35

Summary points or SROC curves?

� Clinical interpretation

� Need to estimate performance at a threshold, using sensitivity, specificity or/and likelihood ratios

� Single threshold or mixed thresholds?

� Summary curve describes how test performance varies across thresholds. Studies do not need to report a common threshold to contribute.

� Summary point must relate to a particular threshold. Only studies reporting a common threshold can be combined.

Page 36: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

36

RevMan 5: analyses

Page 37: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

37

Comparative analyses

� Indirect comparisons

� Different tests used in different studies

� Potentially confounded by other differences between the

studies

� Direct comparisons

� Patients receive both tests or randomized to tests

� Differences in accuracy more attributable to the tests

� Few studies may be available and may not be

representative

Page 38: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

38

Example of pilot Cochrane ReviewDown’ Syndrome screening review

72,797 19 2nd trimester - triple test (serology)

222,171 22 1st trimester - NT and serology

79,412 10 1st trimester - NT alone

ParticipantsStudies

Page 39: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

39

Page 40: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

40

Page 41: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

41

Page 42: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

42

Page 43: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

43

Page 44: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

44

NT alone

Sensitivity: 72% (63%-79%)

Specificity: 94% (91% -96%)

DOR: 39 (26-60)

NT with serology

Sensitivity: 86% (82%-90%)

Specificity: 95% (93%-96%)

DOR: 110 (84-143)

RDOR: 2.8 (1.7-4.6),

p <0.0001

Triple test

Sensitivity: 82% (76%-86%)

Specificity: 83% (77%-87%)

DOR: 21 (15-30)

RDOR: 0.5 (0.3-0.9),

p = 0.03

Indirect comparison

Page 45: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

45

DIRECT COMPARISONS

NT alone

Sensitivity: 71% (59%-82%)

Specificity: 95% (91%-98%)

DOR: 41 (16-67)

NT with serology

Sensitivity: 85% (77%-93%)

Specificity: 96% (93%-98%)

DOR: 123 (40-206)

Triple test

No paired studies available

Page 46: Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

46

Summary

� Bivariate nature of the data requires a different approach to traditional meta-analysis

� SROC approach useful for preliminary analyses

� Advanced methods required for making formal inference


Recommended