ECON 628 – 3rd Empirical Assignment
Due on Tuesday, November 29 2005
This is a follow-up on the first assignment based on Lalonde’s data. The
goal of this assignment is to use propensity score and matching methods to
compare experimental and non-experiment estimates of the impact of training
programs on earnings. For this
assignment, simply focus on “PSID-1” as the (non-experimental) control
group. There is no need to perform the
below analysis using the experimental control group.
- Estimate
the propensity score using a logit model and
compare the distribution of the propensity score for the treatment and
control group using an histogram.
- Now
use the propensity score to estimate the treatment effect using regression
methods. First, estimate a
regression model without interaction terms between the propensity score
and the treatment dummy. Do you
find that a cubic specification for the propensity score is adequate or do
you need higher order terms?
Second, estimate a model with interactions between the propensity
score and the treatment dummy. For
the cubic model, do you reject the more restrictive model without
interactions? Finally, compute the “treatment effect for the treated” and
the ATE (assuming that PSID-1 represents the whole population) in the
model with interactions. Do you get
different treatment effects for these two groups? Discuss.
- Dehajia and Wahba use
various “matching” methods to estimates that ATE. There are now several programs available
in Stata to perform matching estimation in
practice. In particular, the “match”
procedure is available on Guido Imbens home page
at http://emlab.berkeley.edu/users/imbens/estimators.shtml.
Other procedures, including a
propensity score matching approach (psmatch2), are also available on the web. Using one of the available approach (or an algorithm of your own!), estimate the
ATE using a matching approach and compare your results with those obtained
using the above regression approach.
- An
alternative and simpler (in my view) estimator is based on re-weighting
observations using the propensity score.
Let p(x) be the propensity score. Reweighting
the control group by p(x)/(1-p(x)) yields a
counterfactual sample that now has the same distribution of the propensity
score as the treatment group. The
treatment effect (on the treated) can now be computed as a simple
difference in mean outcomes between the treatment group and this
counterfactual group. Similarly, reweighting the treatment group by (1-p(x))/p(x)
yields a counterfactual sample that now has the same distribution of the
propensity score as the control group.
It is then easy to compute the ATE by assuming, as in 2, that PSID-1
represents the whole population.
Use the reweighting approach to compute
the treatment effect on the treated and the ATE, and compare your results
to those in 2.
- Use
the reweighting approach to compute the effect
of the training programs on the distribution, as opposed to just the mean,
of log earnings. First compute the
effect of the treatment on the variance of log earnings. Then compute the 10th, 25th,
50th, 75th, and 90th centiles
of log earnings and show the impact of the treatment at each of these points
in the earnings distribution.
Finally, compute kernel density estimates of the distribution of
log earnings for the treatment and the (reweighted)
control group using the “kdensity” procedure in Stata.