2/17/2015
1
SAMPLING HARD-TO-REACH POPULATIONS
Karen Foote Retzer
February 18, 2015
www.srl.uic.edu
GENERAL INFORMATION
2
• Please hold questions until the end of the presentation
• Slides available at http://www.srl.uic.edu/SEMINARS/Spring15Seminars.htm
• Please raise your hand so that I can see that you can hear me
2/17/2015
2
OUTLINE
3
• Introduction
• Sampling Hard-to-Reach Populations using Probability Sampling
• Non-Probability Sample Designs
HARD-TO-SURVEY POPULATIONS
4
• Hard to sample
• Hard to identify
• Hard to find
• Hard to persuade
• Hard to interview
(Tourangeau, 2014)
2/17/2015
3
EXAMPLES
5
� Homeless
� Prostitutes
� Migrants
� Illegal Immigrants
� Rare racial/ethnic groups
� Visitors to a museum
� Travelers through airports
� Gay men
� Smokers
PROBABILITY VS. NON-PROBABILITY
SAMPLING
6
Probability
• Generalize to the population from which sample is drawn
• Unbiased results
• Known, non-zero probability of selection
Non-Probability
• Exploratory research
• Convenience
• Probability of selection unknown
2/17/2015
4
TRADITIONAL PROBABILITY SAMPLES
7
• Screen all cases to locate eligible respondents
• Disproportionate Stratified Sampling
• Dual Frames
PROBABILITY SAMPLE: SCREEN ALL
CASES TO FIND ELIGIBLE RESPONDENTS
8
Target population: adults who smoke cigarettes.
Sample and screen individuals in randomly selected households.
� Area probability (face-to-face)
• Address-based sample (mail)
Time consuming & expensive.
2/17/2015
5
PROBABILITY SAMPLE: DISPROPORTIONATE
STRATIFIED SAMPLING
9
Target Population: Households with income $200,000 or higher
Use census data to find blockgroups with households having higher incomes.
Sample households from all blockgroups, but oversample those with higher incomes.
Post-stratification weights.
PROBABILITY SAMPLE:
DUAL/MULTIPLE FRAMES
10
Target population: households with pets
Use two frames:
Households
Lists of people from veterinarian offices
Must determine how to weight the final dataset.
Dual frames becoming more common with
changes in telephone technology.
2/17/2015
6
DUAL FRAME: PHONE SURVEY
11
Contacting a nationwide
sample by phone
PROBABILITY SAMPLES: DUAL LANDLINE
AND CELL FRAMES
• Increase of general population that is cell only
• Cell phone usage differences within the population
• Race and ethnicity
• Age
• Income
• Region of country
12
2/17/2015
7
PROBABILITY SAMPLE: DUAL LANDLINE
AND CELL FRAMES
13
Contacting national sample using
cell phone and landline phones
(dual frame design)
PROBABILITY SAMPLE: DUAL LANDLINE
AND CELL FRAMES
• Difficult to target smaller geographies by cell phone
• Cell-phone-only individuals who have a cell number from a state, but who do not live in the state.
• Cell-phone-only individuals who live in state who do not have cell number from that state
• Smaller geographies even more difficult to target.
14
2/17/2015
8
RECAP: PROBABILITY SAMPLE
15
Each element has a known, non-zero probability of selection
Results can be generalized to the population from which the sample was drawn.
• Screening
• Disproportionate Sampling
• Dual frame
LESS TRADITIONAL
SAMPLE DESIGNS
16
• Location (venue-based)
• Time location
• Respondent-driven
• Snowball
• Network/multiplicity
• Convenience
• Surnames
2/17/2015
9
LOCATION SAMPLING
17
� Location sampling can take place at . . .
� Museums
� National Parks
� Shelters
� Street corners
� Two types of sampling
� Sampling visits
� Sampling visitors
TIME LOCATION SAMPLING
18
An extension of location sampling where
the timing of the sample selection is also
taken into consideration.
2/17/2015
10
SNOWBALL
19
� Initial sample of respondents selected
� Convenience
� Randomly
� Respondents identify others to be interviewed
RESPONDENT-DRIVEN SAMPLING
20
� Initial sample of respondents selected (similar to snowball)
� Respondents become the ‘seeds’ and are given coupons to give to others in their networks who are eligible
� Maintains privacy for those recruited
� Respondents receive monetary incentive
2/17/2015
11
MULTIPLICITY SAMPLING
• Includes non-household members with whom informant is familiar, such as close relatives
• Informant must know needed information about network members
• Informant must know size of network
• Weighting
21
SURNAME LISTS
22
� Some populations can be identified by surname
� Hispanics
� Chinese
� Korean
2/17/2015
12
CONVENIENCE
23
� Questionnaires that potential respondents can pick up at given location
� Persons who take part in studies recruited from flyers
� Hotel guests who complete questionnaires in hotels
� Respondents who are intercepted at malls, etc.
HOW TO SELECT SAMPLE DESIGN
It depends on . . .
• How common or rare your population is
• How geographically clustered/dispersed your population is
• Whether or not you want to generalize your findings
• Why the respondents are hard to reach
• Whether or not you have a list
24
2/17/2015
13
SELECTED BIBLIOGRAPHY
25
Blumberg SJ, Luke JV. Wireless substitution: Early release of estimates from the National Health Interview Survey. July-December 2012. National Center for Health Statistics. July 2013. Available from: http://www.cdc.gov/nchs/nhis.htm.
Tourangeau, R., Edwards, B., Johnson, T., Wolter, K., & Bates, N. (Eds.). (2014). Hard-to-survey populations. Cambridge: Cambridge University Press.
Wagner, J., & Lee, S. (2015). Sampling Rare Populations. In T. Johnson (Ed.), Handbook of Health Survey Methods
(pp. 77-104). Hoboken: John Wiley & Sons, Inc.
Thank you!Please be sure to complete your evaluation
when you receive it later today . . .
26