I want to do a non parametric RDD type analysis to know the impact of an intervention (a single dummy variable) on an outcome variable. I have several 'boundaries' (which are actually different geographical locations) around which I will be picking observations.
Consider that I have only 2 observations at each boundary- one with the intervention and one without it. I can assume that all other relevant variables are the same for the pair of observations at each boundary.
Can I then regress the outcome variable on the intervention dummy along with boundary fixed effects? My main concern is regarding use of fixed effects with just 2 observations at each boundary — I am not sure if that is legit.
Best Answer
For reasons explained in my comment, you will get identical estimates for the treatment coefficient.
Here's a numerical example of "hard" RDD using Stata. We will use a experimental dataset of 12 cars. Each car was run once without a beneficial fuel additive (condition 1) and once with (condition 2). The outcome is miles per gallon. This setup is similar to your cross-border pairs, where one member is treated, but they are otherwise similar.
All 3 methods yield an estimated marginal improvement of 1.75 miles per gallon, with a standard error of 0.78.