Frontiers in Difference-in-Differences

More Complicated Treatment Regimes

Brantly Callaway

University of Georgia

October 19, 2023

Introduction

\(\newcommand{\E}{\mathbb{E}} \newcommand{\E}{\mathbb{E}} \newcommand{\var}{\mathrm{var}} \newcommand{\cov}{\mathrm{cov}} \newcommand{\Var}{\mathrm{var}} \newcommand{\Cov}{\mathrm{cov}} \newcommand{\Corr}{\mathrm{corr}} \newcommand{\corr}{\mathrm{corr}} \newcommand{\L}{\mathrm{L}} \renewcommand{\P}{\mathrm{P}} \newcommand{\independent}{{\perp\!\!\!\perp}} \newcommand{\indicator}[1]{ \mathbf{1}\{#1\} }\) The discussion (and much of the recent DID literature) has focused on the setting with staggered treatment adoption.

However, this certainly does not cover the full range of possible treatments. In Part 3, we’ll primarily consider two leading extensions:

A treatment that is multi-valued or continuous (e.g., length of school closures during Covid on student test scores)
A treatment that can turn on and off (e.g., union status)

A couple of things to notice as we go along:

I’m not going to cover much on TWFE regressions here. They have even more sources of things that can go wrong.
Try to pay attention to the pattern. Even though the arguments are getting more complicated, we are still following the idea of (i) target disaggregated parameters, (ii) combine them into lower dimensional objects, (3) here there will be some additional interpretation issues that are worth emphasizing

Continuous Treatment Notation

Potential outcomes notation

Two time periods: \(t^*\) and \(t^*-1\)
- No one treated until period \(t^*\)
- Some units remain untreated in period \(t^*\)
Potential outcomes: \(Y_{it^*}(d)\)
Observed outcomes: \(Y_{it^*}\) and \(Y_{it^*-1}\)

\[Y_{it^*}=Y_{it^*}(D_i) \quad \textrm{and} \quad Y_{it^*-1}=Y_{it^*-1}(0)\]

Parameters of Interest (ATT-type)

Level Effects (Average Treatment Effect on the Treated)

\[ATT(d|d) := \E[Y_{t^*}(d) - Y_{t^*}(0) | D=d]\]

Interpretation: The average effect of dose \(d\) relative to not being treated local to the group that actually experienced dose \(d\)
This is the natural analogue of \(ATT\) in the binary treatment case

Parameters of Interest (ATT-type)

Slope Effects (Average Causal Response on the Treated)

\[ACRT(d|d) := \frac{\partial ATT(l|d)}{\partial l} \Big|_{l=d}\]

Interpretation: \(ACRT(d|d)\) is the causal effect of a marginal increase in dose local to units that actually experienced dose \(d\)

We can view \(ACRT(d|d)\) as the “building block” here. An aggregated version of it (into a single number) is \[\begin{align*} ACRT^O := \E[ACRT(D|D)|D>0] \end{align*}\]

\(ACRT^O\) averages \(ACRT(d|d)\) over the population distribution of the dose
Like \(ATT^O\) for staggered treatment adoption, \(ACRT^O\) is the natural target parameter for the TWFE regression in this case