Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 1
2015 StatChat2V1 1
Milo Schield, Augsburg CollegeMember: International Statistical Institute
US Rep: International Statistical Literacy Project
Director, W. M. Keck Statistical Literacy Project
VP. National Numeracy Network
Editor: www.StatLit.org
August 1, 2016www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf
Confounding:A Big Idea
V1 2015 StatChat2 2
Core Concepts in Intro Stats
McKenzie (2004): Survey of Educators
Goodall@RSS (2007) Big Ideas in Statistics
Garfield & Ben Zvi (2008): Big Ideas of Statistics
Gould-Miller-Peck (2012). Five Big Ideas
Blitzstein@Harvard (2013): 10 Big Ideas Stat110
Stigler (2016): Seven pillars of statistical wisdom
2
V1 2015 StatChat2 3
Ambiguity of “Importance”
Topic (randomness) or a claim: ME ~ 1/sqrt(n)
This paper focuses on claims or relationships having substantial social or cognitive consequences.
3 V1 2015 StatChat2 4
1A: Fallacies
1. Confusion of the inverse: P(A|B) = P(B|A)
2. Conjunction fallacy: P(A&B) > P(A)
3. P(A&B |C) > P(A |B&C): Three-factor fallacy
4. Individual fallacy
5. Ecological fallacy
6. Simpson’s Paradox
4
V1 2015 StatChat2 5
Contributions of Statisticsto Human Knowledge
.
5 V1 2015 StatChat2 6
#2A: Butterfly Fallacy
One should never trust a statistical association generated by an observational study.
An unknown or unmeasured confounder –regardless of size (a small as a butterfly) – can nullify or reverse an observed association.
6
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 2
2015 StatChat2V1
Smoking Causes Cancer:Fisher’s Argument
Observational data: Smokers are 10 times as likely to develop lung cancer as are non-smokers.
Some statisticians wanted to support the claim that smoking “caused” lung cancer.
Sir Ronald Fisher (1958) noted that “association was not causation” and that there was a difference (factor of two) in smoking preference between fraternal and identical twins.
7 2015 StatChat2V1
Smoking Causes Cancer:Cornfield’s Reply
Cornfield et al (1959) argued that to nullify or reverse the observed association, the relative risk of a confounder must exceed the relative risk of that association.
Fisher never replied.
8
“Cornfield's minimum effect size is as important to observational studies as is the use of randomized assignment to experimental studies.” Schield (1999)
2015 StatChat2V1
Cornfield Condition for Nullification or Reversal
Schield (1999) based on realistic data
9 2015 StatChat2V1
Confounder Distribution: Simple One-Parameter Model
How to deal with unknown or unmeasured confounders?
Assume: RR of confounders is distributed exponentially with a minimum RR of one and a mean RR of two.
10
2015 StatChat2V1
Effect Sizes: Relative Risk95% Confounder Resistant: Exp20
Obese vs.non-Obese
11 V1 2015 StatChat2 12
Conclusion
Students should be exposed to the major contributions of statistics to human knowledge.
Including multivariate thinking in the intro course means discussing confounding.
Introducing confounding means dealing with• the Butterfly fallacy, • the Cornfield conditions and • ranking the resilience of an association to
unknown or unmeasured confounders.
12
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 1
2015 StatChat2V1 1
Milo Schield, Augsburg CollegeMember: International Statistical Institute
US Rep: International Statistical Literacy Project
Director, W. M. Keck Statistical Literacy Project
VP. National Numeracy Network
Editor: www.StatLit.org
August 1, 2016www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf
Confounding:A Big Idea
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 2
V1 2015 StatChat2 2
Core Concepts in Intro Stats
McKenzie (2004): Survey of Educators
Goodall@RSS (2007) Big Ideas in Statistics
Garfield & Ben Zvi (2008): Big Ideas of Statistics
Gould-Miller-Peck (2012). Five Big Ideas
Blitzstein@Harvard (2013): 10 Big Ideas Stat110
Stigler (2016): Seven pillars of statistical wisdom
2
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 3
V1 2015 StatChat2 3
Ambiguity of “Importance”
Topic (randomness) or a claim: ME ~ 1/sqrt(n)
This paper focuses on claims or relationships having substantial social or cognitive consequences.
3
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 4
V1 2015 StatChat2 4
1A: Fallacies
1. Confusion of the inverse: P(A|B) = P(B|A)
2. Conjunction fallacy: P(A&B) > P(A)
3. P(A&B |C) > P(A |B&C): Three-factor fallacy
4. Individual fallacy
5. Ecological fallacy
6. Simpson’s Paradox
4
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 5
V1 2015 StatChat2 5
Contributions of Statisticsto Human Knowledge
.
5
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 6
V1 2015 StatChat2 6
#2A: Butterfly Fallacy
One should never trust a statistical association generated by an observational study.
An unknown or unmeasured confounder –regardless of size (a small as a butterfly) – can nullify or reverse an observed association.
6
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 7
2015 StatChat2V1
Smoking Causes Cancer:Fisher’s Argument
Observational data: Smokers are 10 times as likely to develop lung cancer as are non-smokers.
Some statisticians wanted to support the claim that smoking “caused” lung cancer.
Sir Ronald Fisher (1958) noted that “association was not causation” and that there was a difference (factor of two) in smoking preference between fraternal and identical twins.
7
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 8
2015 StatChat2V1
Smoking Causes Cancer:Cornfield’s Reply
Cornfield et al (1959) argued that to nullify or reverse the observed association, the relative risk of a confounder must exceed the relative risk of that association.
Fisher never replied.
8
“Cornfield's minimum effect size is as important to observational studies as is the use of randomized assignment to experimental studies.” Schield (1999)
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 9
2015 StatChat2V1
Cornfield Condition for Nullification or Reversal
Schield (1999) based on realistic data
9
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 10
2015 StatChat2V1
Confounder Distribution: Simple One-Parameter Model
How to deal with unknown or unmeasured confounders?
Assume: RR of confounders is distributed exponentially with a minimum RR of one and a mean RR of two.
10
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 11
2015 StatChat2V1
Effect Sizes: Relative Risk95% Confounder Resistant: Exp20
Obese vs.non-Obese
11
Confounding: A Big Idea V1 August 1, 2016
www.StatLit.org/pdf/2016-Schield-ASA-Slides.pdf Page 12
V1 2015 StatChat2 12
Conclusion
Students should be exposed to the major contributions of statistics to human knowledge.
Including multivariate thinking in the intro course means discussing confounding.
Introducing confounding means dealing with• the Butterfly fallacy, • the Cornfield conditions and • ranking the resilience of an association to
unknown or unmeasured confounders.
12