1 / 58

Introduction of Regression Discontinuity Design (RDD)

Introduction of Regression Discontinuity Design (RDD). This Talk Will:. I ntroduce the history and logic of RDD, Consider conditions for its internal validity, Considers its sample size requirements, Consider its dependence on functional form, Illustrate some specification tests for it,

chakra
Download Presentation

Introduction of Regression Discontinuity Design (RDD)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction of Regression Discontinuity Design (RDD)

  2. This Talk Will: • Introduce the history and logic of RDD, • Consider conditions for its internal validity, • Considers its sample size requirements, • Consider its dependence on functional form, • Illustrate some specification tests for it, • Describe an application. • Consider limits to its external validity, • Consider how to deal with noncompliance,

  3. RDD History • In the beginning there was Thislethwaite and Campbell (1960) • This was followed by a flurry of applications to Title I (Trochim, 1984) • Only a few economists were involved initially (Goldberger, 1972) • Then RDD went into hibernation • It recently experienced a renaissance among economists (e.g. Hahn, Todd and van derKlaauw, 2001; Jacob and Lefgren, 2002) • Tom Cook has written about this story

  4. RDD Logic • Selection on an observable (a rating) • A tie-breaking experiment • Modeling close to the cut-point • Modeling the full distribution of ratings

  5. Many different rules work like this. Examples: • Whether you pass a test • Whether you are eligible for a program • Who wins an election • Which school district you reside in • Whether some punishment strategy is enacted • Birth date for entering kindergarten This last one should look pretty familiar-Angrist and Krueger’s quarter of birth was essentially a regression discontinuity idea

  6. The key insight is that right around the cutoff we can think of people slightly above as identical to people slightly below Formally we can write it the model as: if is continuous then the model is identified (actually all you really need is that it is continuous at x = x*)

  7. To see it is identified not that Thus That it

  8. There is nothing special about the fact that Ti was binary as long as there is a jump in the value of Ti at x* • This is what is referred to as a “Sharp Regression Discontinuity” • There is also something called a “Fuzzy Regression Discontinuity” • This occurs when rules are not strictly enforced

  9. The size of the discontinuity at the cutoff is the size of the effect.

  10. Conditions for Internal Validity • The outcome-by-rating regression is a continuous function (absent treatment). • The cut-point is determined independently of knowledge about ratings. • Ratings are determined independently of knowledge about the cut-point. • The functional form of the outcome-by-rating regression is specified properly.

  11. RDD Statistical Model where: Yi = outcome for subject i, Ti = one for subjects in the treatment group and zero otherwise, Ri = rating for subject i, ei = random error term for subject i, which is independently and identically distributed

  12. Sample Size Implications • Because of the substantial multi-collinearity that exists between its rating variable and treatment indicator, an RDD requires 3 to 4 times as many sample members as a corresponding randomized experiment

  13. Specification Tests • Using the RDD to compare baseline characteristics of the treatment and comparison groups • Re-estimating impacts and sequentially deleting subjects with the highest and lowest ratings • Re-estimating impacts and adding: • a treatment status/rating interaction • a quadratic rating term • interacting the quadratic with treatment status • Using non-parametric estimation

  14. Here we see a discontinuity between the regression lines at the cutoff, which would lead us to conclude that the treatment worked. But this conclusion would be wrong because we modeled these data with a linear model when the underlying relationship was nonlinear

  15. Here we see a discontinuity that suggests a treatment effect. However, these data are again modeled incorrectly, with a linear model that contains no interaction terms, producing an artifactualdiscontinuity at the cutoff…

  16. Example: State Pre-K • Pre-K available by birth date cutoff in 38 states, here scaled as 0 (zero) • 5 chosen for study and summed here • How does pre-K affect PPVT (vocabulary) and print awareness (pre-reading)

  17. Correct specification of the regression line of assignment on outcome variable

  18. Best case scenario –regression line is linear and parallel (NJ Math)

  19. Sometimes, form is less clear

  20. So, what to do?

  21. Graphical approaches

  22. Parametric approaches • Alternate specifications and samples • Include interactions and higher order terms • Linear, quadratic, & cubic models • Look for statistical significance for higher order terms • When functional form is ambiguous, overfit the model (Sween1971; Trochim1980) • Truncate sample to observations closer to cutoff • Bias versus efficiency tradeoff

  23. Non-parametric approaches • Eliminates functional form assumptions • Performs a series of regressions within an interval, weighing observations closer to the boundary • Use local linear regression because it performs better at the boundaries • What depends on selecting correct bandwidth? Key tradeoff in NP estimates: bias vs precision–How do you select appropriate bandwidth?–Ocular/sensitivity tests • Cross-validation methods • “Leave-one-out” method

  24. State-of-art is imperfect • So we test for robustness and present multiple estimates

  25. Example I

  26. Example II

  27. Do Better Schools Matter? Parental Valuation ofElementary Education Sandra Black, QJE, 1999 In the Tiebout model parents can “buy” better schools for their children by living in a neighborhood with better public schools How do we measure the willingness to pay? Just looking in a cross section is difficult: Richer parents probably live in nicer houses in areas that are better for many reasons

  28. Black uses the school border as a regression discontinuity • We could take two families who live on opposite side of the same street, but are zoned to go to different schools • The difference in their house price gives the willingness to pay for school quality.

More Related