Survey Methodologies for Sensitive Questions
Kosuke Imai
Princeton University
January 30, 2015
International Methods ColloquiumRice University
Joint work withGraeme Blair, Jason Lyall, Bryn Rosenfeld, Jacob Shapiro
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 1 / 30
Introduction
I Challenge of measuring sensitive attitudes and behaviorsI social desirability biasI non-response bias
I Indirect methods becoming increasingly popularI list experimentsI endorsement experimentI randomized response
I Development of statistical methodsI multivariate regression for each survey technique (Bullock, Imai and
Shapiro 2010; Imai 2011; Blair & Imai 2012; Blair, Imai & Zhou 2014)I using responses as predictors in outcome regression (Imai, Park &
Greene In-press)
I Empirical validation studiesI validation against ground truth (Rosenfeld, Imai & Shapiro In-press)I comparison of multiple measurements (Blair, Imai & Lyall 2014)I prediction of behavior (Hirose, Imai & Lyall 2014)
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 2 / 30
The Mississippi Validation Study (Rosenfeld, Imai & Shapiro In-press)
I Estimate voting on anti-abortion referendum using:I direct questionI list experiment (item/unmatched count technique)I endorsement experimentI randomized response
I Validate estimates against official election outcome:I sample from voter history fileI county-level voting recap reports for validation
I Case selection:I a poll conducted 24 hours before the election predicts 44% no votesI the amendment was defeated 58% to 42%
I Findings:I direct question significant under-estimation though efficientI indirect methods much less biased though more variableI endorsement and randomized response least bias
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 3 / 30
Direct Question
Did you vote YES or NO on the Personhood Initiative, which
appeared on the November 2011 Mississippi General Election
ballot?
Voted Yes
Voted No
Did not vote
Don’t know
Refused
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 4 / 30
List Experiment
Here is a list of four things that some people have done and some
people have not. Please listen to them and then tell me HOW MANY
of them you have done in the past two years. Do not tell me
which you have and have not done. Just tell me how many:
Discussed politics with family or friends
Cast a ballot for Governor Phil Bryant
Paid dues to a union
Given money to a Tea Party candidate or organization
(treatment) Voted ‘YES’ on the ‘Personhood’ Initiative
How many of these things have you done in the past two years?
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 5 / 30
Endorsement Experiment
We’d like to get your overall opinion of some people in the news.
As I read each name, please say if you have a very favorable,
somewhat favorable, somewhat unfavorable, or very unfavorable
opinion of each person.
(control) Phil Bryant, Governor of Mississippi?
(treatment) Phil Bryant, Governor of Mississippi, who
campaigned in favor of the ‘Personhood’ Initiative on the
2011 Mississippi General Election ballot?
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 6 / 30
Randomized Response
To answer this question, you will need a coin. Once you have
found one, please toss the coin two times and note the results of
those tosses (heads or tails) one after the other on a sheet of
paper. Do not reveal to me whether your coin lands on heads or
tails. After you have recorded the results of your two coin
tosses, just tell me you are ready and we will begin.
Now, please answer ‘yes’ if either your second coin toss came up
heads or you voted ‘YES’ on the Personhood Initiative, which
appeared on the November 2011 Mississippi General Election
ballot.
Yes
No
Don’t know
Refused
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 7 / 30
Method for List Experiment (Imai 2011; Blair & Imai 2012)
I Setup:I Yi : observed responseI Xi : observed covariatesI Z∗i : latent response to the sensitive itemI Y ∗i : latent response to the control itemsI Ti : treatment such that Yi = Y ∗i + TiZ
∗i
I Assumptions: (1) no design effect, (2) no liar
I A total of (2× (J + 1)) “types” (Y ∗i ,Z∗i )
I Example: three control items (J = 3)
Yi Treatment group Control group
4 (3,1)3 (2,1) (3,0) (3,1) (3,0)2 (1,1) (2,0) (2,1) (2,0)1 (0,1) (1,0) (1,1) (1,0)0 (0,0) (0,1) (0,0)
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 8 / 30
Method for List Experiment (Imai 2011; Blair & Imai 2012)
I Setup:I Yi : observed responseI Xi : observed covariatesI Z∗i : latent response to the sensitive itemI Y ∗i : latent response to the control itemsI Ti : treatment such that Yi = Y ∗i + TiZ
∗i
I Assumptions: (1) no design effect, (2) no liar
I A total of (2× (J + 1)) “types” (Y ∗i ,Z∗i )
I Example: three control items (J = 3)
Yi Treatment group Control group
4 (3,1)3 (2,1) (3,0) (3,1) (3,0)2 (1,1) (2,0) (2,1) (2,0)1 ���(0,1) ���(1,0) (1,1) ���(1,0)0 ���(0,0) ���(0,1) ���(0,0)
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 9 / 30
Statistical Model for List Experiment
I Submodel for the response to the sensitive item Z ∗i :
Pr(Z ∗i = 1 | Xi ) = logit−1(α + β>Xi )
I Submodel for the responses to the control items Y ∗i :
Pr(Y ∗i = y | Xi ,Z∗i ) = Binomial(J, logit−1(γ + δ>Xi + ζZ ∗i ))
I Combine them under the likelihood framework
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 10 / 30
Model for Endorsement Experiment (Bullock, Imai & Shapiro 2011)
I Setup:I Ti : treatmentI Yi : observed (ordinal) responseI Y ∗i : latent (continuous) responseI Xi : observed covariatesI V ∗i : latent ideological positionI Z∗i : latent additional support inducted by the endorsement
I Latent measurement model:
Y ∗iindep.∼ N (β(V ∗i + TiZ
∗i )− α, 1)
with appropriate cut-points
I Hierarchical model V ∗i and Z ∗i :
V ∗iindep.∼ N (δ>Xi , 1)
Z ∗iindep.∼ N (λ>Xi , ω
2)
I Probability of positive support: Pr(Z ∗i > 0 | Xi )
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 11 / 30
Model for Randomized Response (Blair, Imai & Zhou 2014)
I Setup:I Yi : observed responseI Z∗i : latent response to the sensitive itemI Xi : covariates
I The model is,
Pr(Z ∗i = 1 | Xi ) = logit−1(α + β>Xi )
I The likelihood function is given by,
n∏i=1
{1
2· logit−1(α + β>Xi ) +
1
2
}Yi{
1− 1
2· logit−1(α + β>Xi )−
1
2
}1−Yi
I Many other designs and accompanying methods are available
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 12 / 30
Bias of the Direct Question
● ●● ●
● ● ●●
●
● ● ● ●● ● ● ●
●
●
0.0
0.2
0.4
0.6
0.8
1.0
Pro
port
ion
of 'n
o' v
otes
on
Per
sonh
ood
●
●●
●
●
● ●
●
●
●
●
●
●
●
●
●●
●
●
Car
roll
Linc
oln
Sto
ne
Han
cock
Law
renc
e
Mon
tgom
ery
Laud
erda
le
Jack
son
Pan
ola
Yazo
o
Cop
iah
Okt
ibbe
ha
For
rest
War
ren
Lafa
yette
Mad
ison
Ada
ms
Sun
flow
er
Hin
ds
Actual vote
Direct question
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 13 / 30
Pooled Analysis
seq(
from
= −
2, to
= 1
.5, l
engt
h =
12)
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0E
stim
ated
pro
port
ion
of 'n
o' v
otes
on
Per
sonh
ood
●
●
●
●
actual vote share
DirectQuestion
(n = 2,655)
ListExperiment(n = 1,352)
EndorsementExperiment(n = 1,841)
RandomizedResponse(n = 943)
● UnweightedWeightedRegression Adjusted
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 14 / 30
County-level Analysis
●
● ●●
●
●
●
●
●●●
●
●●
●
●●
●●
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = 0.249RMSE = 0.254
●●●●● ●● ●●●
●●
●●
●● ●
●●
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = 0.170RMSE = 0.183
●
●
●●
●●
●
●
●●●
●●
●●●
●●
●
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = −0.016RMSE = 0.102
●●
●
●●
●
●
●
●●●
●
●●●
●
●●●
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = −0.022RMSE = 0.074
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = 0.215RMSE = 0.224
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = 0.144RMSE = 0.161
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = 0.030RMSE = 0.139
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = −0.042RMSE = 0.119
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = 0.198RMSE = 0.204
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = 0.124RMSE = 0.142
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = 0.118RMSE = 0.141
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Actual
Est
imat
e
bias = 0.010RMSE = 0.067
Unweighted
Weighted
RegressionAdjusted
Direct Question List Experiment Endorsement Experiment Randomized Response
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 15 / 30
Hearts and Minds in Afghanistan (Blair, Imai & Lyall, 2014)
I How do we measure civilian attitudes in a conflict setting?
I Current efforts in Afghanistan rely on direct questions:
1. USAID (TCAPF): “Who do you believe can solve your problems?”2. ISAF (ANQAR): “Over the past 6 months, do you think the Taliban
have grown stronger, grown weaker, or remained the same?”
I Why are direct questions a bad idea?
1. Threats to enumerators and respondents2. Nonresponse, social desirability bias3. Interviews are public4. Danger of selection bias in sampling locations (role of gatekeepers)
I ANQAR (November-December 2011): 50% refusal rate
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 16 / 30
Public Nature of Interviews
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 17 / 30
Surveying in the Heartland of Insurgency
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 18 / 30
Endorsement Experiments
I Script for the control group:
A recent proposal calls for the sweeping reform of the
Afghan prison system, including the construction of new
prisons in every district to help alleviate
overcrowding in existing facilities. Though expensive,
new programs for inmates would also be offered, and new
judges and prosecutors would be trained. How do you
feel about this proposal?
Strongly agree; Agree; Indifferent; Disagree;
Strongly disagree; Don’t Know; Refuse to answer
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 19 / 30
Endorsement Experiments
I Script for the treatment group:
A recent proposal by ISAF calls for the sweeping reform
of the Afghan prison system, including the construction
of new prisons in every district to help alleviate
overcrowding in existing facilities. Though expensive,
new programs for inmates would also be offered, and new
judges and prosecutors would be trained. How do you
feel about this proposal?
Strongly agree; Agree; Indifferent; Disagree;
Strongly disagree; Don’t Know; Refuse to answer
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 20 / 30
Data from the Endorsement Experiments
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Control
ISAF
Taliban
Direct Elections Prison ReformIndependent Election
CommissionAnti−Corruption
ReformO
vera
ll(N
= 2
754)
Hel
man
d(N
= 8
55)
Kho
st(N
= 6
30)
Kun
ar(N
= 3
96)
Loga
r(N
= 4
86)
Uro
zgan
(N =
387
)
Strongly agree Agree Indifferent Disgree Strongly
disagree Don't Know Refused
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 21 / 30
List Experiments
I Script for the control group:
I’m going to read you a list with the names of
different groups and individuals on it. After I read
the entire list, I’d like you to tell me how many of
these groups and individuals you broadly support,
meaning that you generally agree with the goals and
policies of the group or individual. Please don’t tell
me which ones you generally agree with; only tell me
how many groups or individuals you broadly support.
Karzai Government; National Solidarity Program; Local
Farmers
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 22 / 30
List Experiments
I Script for the treatment group:
I’m going to read you a list with the names of
different groups and individuals on it. After I read
the entire list, I’d like you to tell me how many of
these groups and individuals you broadly support,
meaning that you generally agree with the goals and
policies of the group or individual. Please don’t tell
me which ones you generally agree with; only tell me
how many groups or individuals you broadly support.
Karzai Government; National Solidarity Program; Local
Farmers; ISAF
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 23 / 30
Comparing and Combining List and Endorse Experiments
I Need for validation =⇒ Multiple measurement strategy
I Two measures should give similar results
I What is the probability of supporting ISAF?
1. List: prob. of saying yes to the sensitive item2. Endorsement: prob. of endorsement having a positive effect on support
for policy
I These probabilities should be similar!
I They can be estimated with a new multivariate regression method
I Endorsement and list experiments can even be combined for a jointanalysis
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 24 / 30
Statistical Analysis of Endorsement Experiments
I Item response theory to combine questions:
Pr(Yij = 1 | Ti = k) = Φ(αj + βj(xi + sijk))
I αj : average popularity of policy jI βj : how much policy j differentiates pro- and anti-reform respondentsI xi : “ideal point” = how pro-reform respondent i isI sijk : support level for combatant k in policy j
I Multi-stage sampling Multi-level modeling
sijkindep.∼ N (λk,village[i ] + Z>i λ
Zk , ω
2k,village)
λk,village[i ]indep.∼ N (λk,district[i ] + V>village[i ]λ
Vk , ω
2k,district)
λk,district[i ]indep.∼ N (λk,province[i ] + W>
district[i ]λWk , ω
2k,province)
I Same multi-level structure for ideal points xi
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 25 / 30
Descriptive Comparison: Overall
●
● ●
●●
●
●●
●
●
●●
●● ●
●
●●
● ●●●
●
●●● ●
●
●●
●
●
● ●
●● ●
●
●
●
●
●
●●
●
●
●●
●●● ●●
●
●
●
Control Group
Endorsement Experiment
List
Exp
erim
ent
01
23
4
1 2 3 4 5
ρ = 0.16
τ = 0.10
●●
●
● ●
●●
●
●
●
●● ●
●
●●● ●● ●
●
●
●
●
● ●
●●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●● ●
●
●
●
●
●●
●
●
●
● ●
●
●
●
● ●
●
●
●●
●
●
●
ISAF Treatment Group
Endorsement Experiment
01
23
41 2 3 4 5
ρ = 0.52
τ = 0.43
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 26 / 30
Descriptive Comparison: Question by Question
●●
●
●●
●
●
● ●
●●
●●
●
● ●
●
●
List
Exp
erim
ent
01
23
4
1 2 3 4 5
ρ = 0.44τ = 0.37
Direct Elections (p < .01)
ISA
F T
reat
men
t Gro
upC
ontro
l Gro
up
●
● ●●
●●●
●
●
●●●
Endorsement Experiment
List
Exp
erim
ent
01
23
4
1 2 3 4 5
ρ = 0.18τ = 0.14
●
●● ●
●
●●
●
● ●
●
●● ●
●
●
●
01
23
4
1 2 3 4 5
ρ = 0.12τ = 0.10
Prison Reform (p = 0.26)
●●
●
●●
●●●
●
●
●●
Endorsement Experiment
01
23
4
1 2 3 4 5
ρ = 0.09τ = 0.08
●
● ●
●●
● ●
●
●
●
●
●
●
●●● ●
●
01
23
4
1 2 3 4 5
ρ = 0.44τ = 0.38
Election Commission (p < .01)
●
●
●
●●
●●
●●
●
● ●
●
●
Endorsement Experiment
01
23
4
1 2 3 4 5
ρ = 0.10τ = 0.07
● ●
●●
●
●
●
●
●
●●
●●●
●
●
●
01
23
4
1 2 3 4 5
Corruption Reform (p < .01)
ρ = 0.50τ = 0.42
●
●
●●
●
●●
●
●
●
●
●
Endorsement Experiment
01
23
4
1 2 3 4 5
ρ = 0.04τ = 0.03
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 27 / 30
Overall Proportion of ISAF Supportersse
q(fr
om =
−2,
to =
1.5
, len
gth
= 1
2)
−0.25
0.00
0.25
0.50
Ove
rall
Pro
port
ion
of IS
AF
Sup
port
ers
●●
List Endorsement Difference(List − Endorse)
Combined
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 28 / 30
Concluding Remarks
I Direct question is severely biased
I All indirect methods reduce bias:I Endorsement and randomized response least biasI List experiment ceiling/floor effects, design effectsI Ease of implementation: list > endorse > randomized response
I Use of multiple-measurement strategies when truth is not available
I Future research directions:I How to balance bias, precision, and costI Use aggregate-level truth to improve individual-level estimates
I Open-source software:I list for list experiment (Blair, Imai & Park)I endorse for endorsement experiment (Shiraito & Imai)I rr for randomized response (Blair, Imai & Zhou)
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 29 / 30
Project Reference http://imai.princeton.edu/projects/sensitive.html
I Development of new methods:I “Statistical Analysis of Endorsement Experiments: Measuring Support for
Militant Groups in Pakistan.” Political AnalysisI “Multivariate Regression Analysis for the Item Count Technique.” Journal of
the American Statistical AssociationI “Statistical Analysis of List Experiments.” Political AnalysisI “Comparing and Combining List and Endorsement Experiments: Evidence
from Afghanistan.” American Journal of Political ScienceI “Using the Predicted Responses from List Experiments as Explanatory
Variables in Regression Models.” Political AnalysisI “Design and Analysis of the Randomized Response Technique.”
I Empirical applications and validations:
I “Explaining Support for Combatants during Wartime: A Survey Experiment
in Afghanistan.” American Political Science ReviewI “An Empirical Validation Study of Popular Survey Methodologies for
Sensitive Questions.” American Journal of Political ScienceI “Can Civilian Attitudes Predict Civil War Violence?”
Kosuke Imai (Princeton) Sensitive Questions January 30, 2015 (IMC) 30 / 30