+ All Categories
Home > Documents > Regression Discontinuity Design

Regression Discontinuity Design

Date post: 25-Feb-2016
Category:
Upload: anthea
View: 79 times
Download: 1 times
Share this document with a friend
Description:
Regression Discontinuity Design. Motivating example. Many districts have summer school to help kids improve outcomes between grades Enrichment, or Assist those lagging Research question: does summer school improve outcomes Variables: x=1 is summer school after grade g - PowerPoint PPT Presentation
Popular Tags:
80
Regression Discontinuity Design 1
Transcript
Page 1: Regression Discontinuity Design

Regression Discontinuity Design

1

Page 2: Regression Discontinuity Design

Motivating example

• Many districts have summer school to help kids improve outcomes between grades– Enrichment, or– Assist those lagging

• Research question: does summer school improve outcomes

• Variables: – x=1 is summer school after grade g– y = test score in grade g+1

2

Page 3: Regression Discontinuity Design

LUSDINE

• To be promoted to the next grade, students need to demonstrate proficiency in math and reading – Determined by test scores

• If the test scores are too low – mandatory summer school

• After summer school, re-take tests at the end of summer, if pass, then promoted

3

Page 4: Regression Discontinuity Design

Situation

• Let Z be test score – Z is scaled such that• Z≥0 not enrolled in summer school• Z<0 enrolled in summer school

• Consider two kids• #1: Z=ε• #2: Z=-ε• Where ε is small

4

Page 5: Regression Discontinuity Design

Intuitive understanding

• Participants in SS are very different• However, at the margin, those just at Z=0

are virtually identical• One with z=-ε is assigned to summer

school, but z= ε is not• Therefore, we should see two things

5

Page 6: Regression Discontinuity Design

• There should be a noticeable jump in SS enrollment at z=0.

• If SS has an impact on test scores, we should see a jump in test scores at z=0 as well.

6

Page 7: Regression Discontinuity Design

Variable Definitions

• yi = outcome of interest

• xi =1 if NOT in summer school, =1 if in

• Di = I(zi≥0) -- I is indicator function that equals 1 when true, =0 otherwise

• zi = running variable that determines eligibility for summer school. z is re-scaled so that zi=0 for the lowest value where Di=1

• wi are other covariates

7

Page 8: Regression Discontinuity Design

8

Key assumption of RDD models

• People right above and below Z0 are functionally identical– Random variation puts someone above Z0

and someone below– However, this small different generates big

differences in treatment (x)– Therefore any difference in Y right at Z0 is due

to x

Page 9: Regression Discontinuity Design

Limitation

• Treatment is identified for people at the zi=0

• Therefore, model identifies the effect for people at that point

• Does not say whether outcomes change when the critical value is moved

9

Page 10: Regression Discontinuity Design

Table 1

10

Page 11: Regression Discontinuity Design

11

Page 12: Regression Discontinuity Design

12

Page 13: Regression Discontinuity Design

13

Z

Pr(Xi=1 | z)

0

1

Z0

FuzzyDesign

SharpDesign

Page 14: Regression Discontinuity Design

14

E[Y|Z=z]

Z0

E[Y1|Z=z]

E[Y0|Z=z]

Page 15: Regression Discontinuity Design

z0 z

Y

y(z0)

y(z0)+α

z0+h1z0-h1

1hy

1hy

z0+2h1z0-2h1

2 1hy

2 1hy

Page 16: Regression Discontinuity Design

Chay et al.

16

Page 17: Regression Discontinuity Design

17

Page 18: Regression Discontinuity Design

18

Page 19: Regression Discontinuity Design

19

Page 20: Regression Discontinuity Design

20

Page 21: Regression Discontinuity Design

21

Page 22: Regression Discontinuity Design

22

FixedEffectsResults

RD Estimates

Page 23: Regression Discontinuity Design

Table 2

23

Page 24: Regression Discontinuity Design

24

Card et al., QJE

Page 25: Regression Discontinuity Design

25

Page 26: Regression Discontinuity Design

26

Page 27: Regression Discontinuity Design

27

Page 28: Regression Discontinuity Design

28

Page 29: Regression Discontinuity Design

29

Page 30: Regression Discontinuity Design

30

Page 31: Regression Discontinuity Design

Oreopoulos, AER

• Enormous interest in the rate of return to education

• Problem:– OLS subject to OVB– 2SLS are defined for small population (LATE)

• Comp. schooling, distance to college, etc.• Maybe not representative of group in policy

simulations)

• Solution: LATE for large group31

Page 32: Regression Discontinuity Design

• School reform in GB (1944)– Raised age of comp. schooling from 14 to 15– Effective 1947 (England, Scotland, Wales)– Raised education levels immediately– Concerted national effort to increase supplies

(teachers, buildings, furniture)• Northern Ireland had similar law, 1957

32

Page 33: Regression Discontinuity Design

33

Page 34: Regression Discontinuity Design

34

Page 35: Regression Discontinuity Design

35

Page 36: Regression Discontinuity Design

36

Page 37: Regression Discontinuity Design

37

Page 38: Regression Discontinuity Design

38

Page 39: Regression Discontinuity Design

39

Page 40: Regression Discontinuity Design

40

Percent Died within 5 years of Survey, Males NLMS

IncomeGroup

35-54 years of age

55-64 years of age

65-74 years of age

0 to $25,000

3.1 10.8 20.6

$25,001 to $50,000

1.8 6.8 15.3

$50,001 + 1.4 5.1 12.3

-42% -25%-37%

-22% -25% -19%

Page 41: Regression Discontinuity Design

41

Percent Died within 5 years of Survey, Males NLMS

EducationGroup

35-54 years of age

55-64 years of age

65-74 years of age

Less than high school

3.8 11.7 22.1

High school graduate

2.4 8.5 18.7

College graduate

1.4 6.5 13.7

Page 42: Regression Discontinuity Design

42

Percent Died within 5 years of Survey, Females NLMS

EducationGroup

35-54 years of age

55-64 years of age

65-74 years of age

Less than high school

2.0 6.0 11.7

High school graduate

1.3 4.3 9.7

College graduate

0.9 4.0 8.0

Page 43: Regression Discontinuity Design

43

18-64 year olds, BRFSS 2005-2009(% answering yes)

EducLevel

Fair or poor

health

No exer in past

30 daysCurrent smoker Obese

Any bad mental

hlth days past mth

<12 years

40.9 45.8 37.8 43.6 43.7

12-15 years

17.8 27.3 26.5 34.7 38.4

16+ years

7.2 13.5 10.8 24.8 34.2

Page 44: Regression Discontinuity Design

44

Page 45: Regression Discontinuity Design

45

Page 46: Regression Discontinuity Design

Clark and Royer (AER, forthcoming)

• Examines education/health link using shock to education in England

• 1947 law – Raised age of comp. schooling from 14-15

• 1972 law– Raised age of comp. schooling from 16-17

46

Page 47: Regression Discontinuity Design

• Produce large changes in education across birth cohorts

• if education alters health, should see a structural change in outcomes across cohorts as well

• Why is this potentially a good source of variation to test the educ/health hypothesis?

47

Page 48: Regression Discontinuity Design

48

Page 49: Regression Discontinuity Design

49

Page 50: Regression Discontinuity Design

50

Page 51: Regression Discontinuity Design

51

Page 52: Regression Discontinuity Design

52

Page 53: Regression Discontinuity Design

Angrist and Lavy, QJE

Page 54: Regression Discontinuity Design

• 1-39 students, one class• 40-79 students, 2 classes• 80 to 119 students, 3 classes

• Addition of one student can generate large changes in average class size

Page 55: Regression Discontinuity Design

eS= 79, (79-1)/40 = 1.95, int(1.95) =1, 1+1=2, fsc=39.5

Page 56: Regression Discontinuity Design
Page 57: Regression Discontinuity Design
Page 58: Regression Discontinuity Design
Page 59: Regression Discontinuity Design
Page 60: Regression Discontinuity Design

IV estimates reading = -0.111/0.704 = -0.1576IV estimates math = -0.009/0.704 = -0.01278

Page 61: Regression Discontinuity Design
Page 62: Regression Discontinuity Design
Page 63: Regression Discontinuity Design

63

Page 64: Regression Discontinuity Design

64

Urquiola and Verhoogen, AER 2009

Page 65: Regression Discontinuity Design

65

Page 66: Regression Discontinuity Design

66

Page 67: Regression Discontinuity Design

67

Camacho and Conover, forthcoming AEJ: Policy

Page 68: Regression Discontinuity Design

68

Page 69: Regression Discontinuity Design

Sample CodeCard et al., AER

69

Page 70: Regression Discontinuity Design

70

Page 71: Regression Discontinuity Design

71

Page 72: Regression Discontinuity Design

72

Page 73: Regression Discontinuity Design

73

* eligible for Medicare after quarter 259;gen age65=age_qtr>259;

* scale the age in quarters index so that it equals 0;* in the month you become eligible for Medicare;gen index=age_qtr-260;gen index2=index*index;gen index3=index*index*index;gen index4=index2*index2;

gen index_age65=index*age65;gen index2_age65=index2*age65;gen index3_age65=index3*age65;gen index4_age65=index4*age65;

gen index_1minusage65=index*(1-age65);gen index2_1minusage65=index2*(1-age65);gen index3_1minusage65=index3*(1-age65);gen index4_1minusage65=index4*(1-age65);

Page 74: Regression Discontinuity Design

74

* 1st stage results. Impact of Medicare on insurance coverage;* basic results in the paper. cubic in age interacted with age65;* method 1;reg insured male white black hispanic _I* index index2 index3 index_age65 index2_age65 index3_age65 age65, cluster(index);

* 1st stage results. Impact of Medicare on insurance coverage;* basic results in the paper. quadratic in age interacted with;* age65 and 1-age65;* method 2;reg insured male white black hispanic _I* index_1minus index2_1minus index3_1minus index_age65 index2_age65 index3_age65 age65, cluster(index);

Page 75: Regression Discontinuity Design

75

Linear regression Number of obs = 46950 F( 21, 79) = 182.44 Prob > F = 0.0000 R-squared = 0.0954 Root MSE = .25993

(Std. Err. adjusted for 80 clusters in index)------------------------------------------------------------------------------ | Robust insured | Coef. Std. Err. t P>|t| [95% Conf. Interval]-------------+---------------------------------------------------------------- male | .0077901 .0026721 2.92 0.005 .0024714 .0131087 white | .0398671 .0074129 5.38 0.000 .0251121 .0546221

delete some results

index | .0006851 .0017412 0.39 0.695 -.0027808 .0041509 index2 | 1.60e-06 .0001067 0.02 0.988 -.0002107 .0002139 index3 | -1.42e-07 1.79e-06 -0.08 0.937 -3.71e-06 3.43e-06 index_age65 | .0036536 .0023731 1.54 0.128 -.0010698 .0083771index2_age65 | -.0002017 .0001372 -1.47 0.145 -.0004748 .0000714index3_age65 | 3.10e-06 2.24e-06 1.38 0.171 -1.36e-06 7.57e-06 age65 | .0840021 .0105949 7.93 0.000 .0629134 .1050907 _cons | .6814804 .0167107 40.78 0.000 .6482186 .7147422------------------------------------------------------------------------------

Method 1

Page 76: Regression Discontinuity Design

76

Linear regression Number of obs = 46950 F( 21, 79) = 182.44 Prob > F = 0.0000 R-squared = 0.0954 Root MSE = .25993

(Std. Err. adjusted for 80 clusters in index)------------------------------------------------------------------------------ | Robust insured | Coef. Std. Err. t P>|t| [95% Conf. Interval]-------------+---------------------------------------------------------------- male | .0077901 .0026721 2.92 0.005 .0024714 .0131087 white | .0398671 .0074129 5.38 0.000 .0251121 .0546221

delete some results index_1mi~65 | .0006851 .0017412 0.39 0.695 -.0027808 .0041509index2_1m~65 | 1.60e-06 .0001067 0.02 0.988 -.0002107 .0002139index3_1m~65 | -1.42e-07 1.79e-06 -0.08 0.937 -3.71e-06 3.43e-06 index_age65 | .0043387 .0016075 2.70 0.009 .0011389 .0075384index2_age65 | -.0002001 .0000865 -2.31 0.023 -.0003723 -.0000279index3_age65 | 2.96e-06 1.35e-06 2.20 0.031 2.79e-07 5.65e-06 age65 | .0840021 .0105949 7.93 0.000 .0629134 .1050907 _cons | .6814804 .0167107 40.78 0.000 .6482186 .7147422------------------------------------------------------------------------------

Method 2

Page 77: Regression Discontinuity Design

77

Page 78: Regression Discontinuity Design

78

Page 79: Regression Discontinuity Design

79

Results for different outcomesCubic term in Index

OutcomeCoef (std error) on AGE 65

Have Insurance 0.084 (0.011)In good health -0.0022 (0.0141)Delayed medical care -0.0039 (0.0088)Did not get medical care 0.0063 (0.0053)Hosp visits in 12 months 0.0098 (0.0074)

Page 80: Regression Discontinuity Design

80

Sensitivity of results to polynomial

Order Insured In goodHealth

Delayedmed care

Hosp. visits

1 0.094(0.008)

0.0132(0.0093)

-0.0110(0.0054)

0.0238(0.0084)

2 0.091(0.009)

0.0070(0.0102)

-0.0048(0.0064)

0.0253(0.0085)

3 0.084(0.011)

-0.0222(0.0141)

-0.0039(0.0088)

0.0098(0.0074)

4 0.0729(0.013)

0.0048(0.0171)

-0.0120(0.0101)

0.0200(0.0109)

Means age 64

0.877 0.763 0.069 0.124


Recommended