+ All Categories
Home > Documents > Mary H. Mulry U.S. Census Bureau 2011 International Total Survey Error Workshop June 21, 2011

Mary H. Mulry U.S. Census Bureau 2011 International Total Survey Error Workshop June 21, 2011

Date post: 01-Jan-2016
Category:
Upload: magee-valdez
View: 19 times
Download: 1 times
Share this document with a friend
Description:
Overview of error model for estimates of foreign-born immigration using data from the American Community Survey. Mary H. Mulry U.S. Census Bureau 2011 International Total Survey Error Workshop June 21, 2011. Background. - PowerPoint PPT Presentation
Popular Tags:
24
Overview of error model for estimates of foreign-born immigration using data from the American Community Survey Mary H. Mulry U.S. Census Bureau 2011 International Total Survey Error Workshop June 21, 2011
Transcript

Overview of error model for estimates of foreign-born immigration using data from the American Community Survey

Mary H. Mulry

U.S. Census Bureau

2011 International Total Survey Error Workshop

June 21, 2011

Background

• Two programs use estimates of foreign-born immigration based on data from the American Community Survey (ACS)– Demographic Analysis estimates of the U.S. population on

April 1, 2010 included estimates of foreign-born immigration each year between 2000 & 2010

– Population Estimates Program (PEP) estimates of the U.S. population on July 1 each year include estimates of foreign-born immigration in previous year.

• Main estimation method for foreign-born immigration uses the responses to two ACS questions – citizenship– residence one year ago (ROYA)

2

Goal

• Describe the uncertainty in the estimates of foreign-born immigration

• Design an error model that accounts for sampling and nonsampling errors

• Propose reasonable estimates based on studies of nonsampling errors in ACS for other purposes or studies of nonsampling errors in other surveys– Time limitations prevent us from conducting

studies to measure ACS nonsampling errors

3

Strategy

• Use the error model in the design of a simulation to study the propagation of errors

• Results of simulation study will produce sensitivity analyses that assess the uncertainty in estimates of foreign-born immigration

4

5

“Is this person a CITIZEN of the United States?”

Yes, U.S. citizen by naturalization – Print year of naturalization

____________ No, not a U.S citizen

ACS Question 8

Native

Foreign born

6

“Did this person live in this house or apartment 1 year ago?”

No, outside the United States and Puerto Rico – Print name of foreign country, or U.S. Virgin Islands, Guam, etc. below_____________________

ACS Question 15a

7

Estimate of foreign-born immigration T :

Notation (within a population control cell)

P = size of population

F = size of the foreign-born population (Question 8)

Y = size of foreign-born population whose residence one year ago was outside the U.S. (Question 14a)

= Postcensal Estimates Program estimate of P

= ACS estimate of proportion of household population who

are foreign-born (F/P) = ACS estimates of proportion of the foreign-born whose

residence one year ago was outside the U.S. (Y/F)

T P s rF Y

s F

rY

P

Error model for

β = bias

ε = random error

8

Model for bias in

β = βI + βD

βI = bias from inconsistency in characteristics

in ACS and PEP used in forming the control cells

βD = bias from errors in the data occurring

during collection and processing

9

Model for random error in

ε = εS + εS

εS = the random error due to sampling error

εM = the random error due to imputation for missing

data

10

Potential sources of inconsistencies in characteristics in PEP & ACS ( βI )

• ACS error in the coding of responses to race/Hispanic ethnicity questions

• Changes in reporting of race/Hispanic ethnicity since 2000• Census 2000 is the base for PEP estimates

• Differences between ACS & PEP data sources caused by errors or changes in methodology

11

Potential sources of data error in the PEP estimates ( βD )

• Coverage error in Census 2000 numbers (used as a base)

• Error in the data used to form updated population estimates during the decade

12

Potential sources of data errorin ACS collection (βD)

• Frame coverage error• Misclassification: foreign-born vs. native-born• For foreign-born, residence 1 year ago errors• Address errors

• wrong address interviewed• other errors: HU vs. not; vacant vs. occupied

• Roster errors• omissions & erroneous inclusions

 13

Estimating bias due to ACS data collection error

ACS uses 3 modes in a sequence• mail, telephone, in-person

May have error estimates by mode only

Estimator of bias due to data collection error is weighted sum of estimated biases for modes

where weights are weighted proportion of responses by each mode

14

Potential sources of data errorin ACS processing (βD)

• Error in editing • Question 8 (citizenship) • Question 15a (residence 1 year ago)

• Error in coding • Question 15a (residence 1 year ago) only

• Error in keying • Question 15a (residence 1 year ago) only

15

Random Error due to imputation(εM)

• ACS imputation for missing data uses geography-based hot-deck method• Question 8 (citizenship)• Question 15a (residence 1 year ago)

16

17

Random error due to sampling (εS)foreign-born immigration (thousands)

Period (July 1 to June 30) ACS year estimate standard error 1999-2000 2000 1,420 42.4 2000-2001 2001 1,421 35.2 2001-2002 2002 1,228 33.6 2002-2003 2003 1,025 28.82003-2004 2004 1,124 31.32004-2005 2005 1,188 17.82005-2006 2006 1,190 15.92006-2007 2007 1,114 20.32007-2008 2008 1,082 16.8

Notes: 1) Estimates are 3-year averages before 2005 due to smaller sample size 2) Standard errors calculated with weights prior to population controls

Indicators of General Quality of the ACS

• Coverage ratios for total populationACS estimate/PEP estimate

• Response ratesAccounts for ACS multi-mode collection and

selection of sub-sample for last mode

18

19

2000 2001 2002 2003 2004 2005 2006 2007 200860%

65%

70%

75%

80%

85%

90%

95%

100%

. Annual Response Rates: 2000-2008

ACS CPS NCVS NHIS SIPPYear

Rat

e

Note: ACS has longer response period and a sequence of 3 modes of collection.

20

Coverage ratios by race

FY00 FY01 FY02 FY03 FY04 FY05 FY06 FY07 FY080.750000000000002

0.800000000000002

0.850000000000002

0.900000000000002

0.950000000000002

1

Blacks: Coverage ratios by year by survey

ACS CPS NCVS SIPP NHIS

FY00 FY01 FY02 FY03 FY04 FY05 FY06 FY07 FY080.750000000000002

0.800000000000002

0.850000000000002

0.900000000000002

0.950000000000002

1

Nonblacks: Coverage ratios by year by survey

ACS CPS (white only) NCVSSIPP NHIS

21

Coverage ratios by sex

FY00 FY01 FY02 FY03 FY04 FY05 FY06 FY07 FY080.750000000000002

0.800000000000002

0.850000000000002

0.900000000000002

0.950000000000002

1

Females: Coverage ratios by year by survey

ACS CPS NCVS SIPP NHIS

FY00 FY01 FY02 FY03 FY04 FY05 FY06 FY07 FY080.750000000000002

0.800000000000002

0.850000000000002

0.900000000000002

0.950000000000002

1

Males: Coverage ratios by year by survey

ACS CPS NCVS SIPP NHIS

Coverage ratios for ACS

• Appear fairly stable

• Higher for females than for males

• Higher for nonblacks than for blacks

• Coverage ratio for Hispanics in 2007 & 2008 was 92.8 and 92.5, respectively

22

Next steps• Form estimates of the nonsampling errors,

their variances, & covariance matrix • Draw replicates from their joint distribution to

produce the distribution of foreign-born immigration.

• Resulting estimates • bias• random error

• Perform sensitivity analyses to aid in determining the most influential error sources

23

[email protected]

U.S. Census Bureau

24


Recommended