INDIVIDUAL
ASSIGNMENT - DEADLINE 17-May-2024 at 23:59.
Please bring a draft to
our session next week 8-May-2024 and submit your final report using this form.
The goal of this lab is to introduce you to the Neyman-Rubin’s potential outcome framework. You can find the template for your report on posit cloud
Table 1 below displays a small data set that is used for practice
purposes only. Two unobserved (and imagined) potential outcomes are
recorded for each patient, denoting years of post-treatment survival
under each of two treatments. Suppose the “perfect doctor” knows each
patient’s potential outcomes and as a result chooses the best treatment
for each patient. She admisters bedrest to patients who
would benefit from it most (\(D = 0\))
and ventilators to those who would benefit from it more
(\(D = 1\)).
table 1 shown below, inserting values
appropriately for the three empty colums: (i) The column labeled \(\delta_i\): please enter each patient’s
treatment effect (ii) The column labeled \(D\): the optimal treatment for this patient
(iii) The column labeled \(Y\): the
observed outcomes. Calculate the average treatment effect (ATE) and the
average treatment effect for the treated (ATT) when comparing the
outcome of the ventilators treatment with that of the
bedrest treatment and comment as to which type of
intervention is more effective on average. Finally, explain under which
conditions might SUTVA be violated
for treatments of covid-19 in the scenario described above.| patient | \(Y^{(0)}\) | \(Y^{(1)}\) | Age | \(\delta\) | D | Y |
|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 29 | |||
| 2 | 5 | 1 | 35 | |||
| 3 | 4 | 1 | 19 | |||
| 4 | 6 | 5 | 45 | |||
| 5 | 1 | 5 | 65 | |||
| 6 | 7 | 6 | 50 | |||
| 7 | 8 | 7 | 77 | |||
| 8 | 10 | 7 | 18 | |||
| 9 | 2 | 8 | 85 | |||
| 10 | 6 | 9 | 96 | |||
| 11 | 7 | 10 | 77 |
Table 1. consists of data about eleven patients, each of whom is infected with coronavirus. There are two treatments: ventilators would lead to the potential outcome \(Y^{(1)}\) and bedrest would lead to the potential outcome \(Y^{(0)}\) .
Calculate the simple difference in outcomes (SDO),
showing the details of your calculation. Is the SDO a good
estimation for the ATE? Finally, check whether the SDO is
equal to the sum of the ATT and the selection bias, \(E[Y(0)|T=1] - E[Y(0)|T=0]\).
Compare the treatment effect for both groups: for those treated with a ventilator and for those treated with bedrest. What explains the difference in the average effect? Now compare all four measures of effects. What are the advantages and disadvantages of each? Is the ATE equal to the mean of the ATU and the ATT? Why or why not?
The following exercises demonstrate that regression is a useful tool to estimate average outcomes and treatment effects in the different groups. Notice that in contrast to the role that regressions play in traditional statistics, here standard errors and significance are not of primary concern. Instead, we are interested in using regression to calculate average effects proper.
bedrest treatment \(\mathbb{E}[Y|D=0]\). Now estimate the
following regression, comparing the coefficients \(\alpha\) and \(\delta\) to the statistics you’ve
previously calculated. What did you find? How would you explain these
finding?\[ Y_i=\alpha~+~\delta\cdot D_i~+~\epsilon_i \]
\[ Y_i=\alpha+\delta\cdot D_i+\beta\cdot X_{age}+\epsilon_i \]
\[\begin{equation} Y_i=\alpha_0+\delta_0\cdot D_i+\beta_0\cdot X_{age}+\epsilon_{0i} \\ D_i=\alpha_1+\beta_1\cdot X_{age} +\epsilon_{1i} \\ Y_i=\alpha_2+\delta_2\cdot \tilde{D}_i +\epsilon_{2i} \\ \end{equation}\]