This chapter introduces instrumental variable (IV) estimation, a method for identifying causal effects when there is unmeasured confounding. Unlike the methods in previous chapters, IV estimation does not rely on conditional exchangeability. Instead, it uses a special variable (the instrument) that affects treatment but not the outcome directly.
An instrumental variable \(Z\) must satisfy three conditions to identify causal effects.
Definition 1 (Instrumental Variable) A variable \(Z\) is an instrumental variable for the effect of \(A\) on \(Y\) if:
Where \(Y^{a,z}\) denotes the potential outcome under treatment \(A = a\) and instrument \(Z = z\).
Statement: \(Z\) is associated with \(A\)
Meaning: The instrument must actually affect treatment assignment.
Example: Randomized encouragement
Testing: Relevance can be tested empirically by checking \(\Pr[A = 1 \mid Z = 1] \neq \Pr[A = 1 \mid Z = 0]\)
Statement: \(Y^{a,z} \perp\!\!\!\perp Z\) (or \(Y^a \perp\!\!\!\perp Z\) under exclusion)
Meaning: The instrument is “as good as randomly assigned” with respect to potential outcomes.
Example: Randomized encouragement
Testing: Exchangeability generally cannot be tested (involves unmeasured confounders)
Statement: \(Y^{a,z} = Y^a\) for all \(a, z\)
Meaning: The instrument affects the outcome ONLY through its effect on treatment.
Example: Randomized encouragement
Testing: Exclusion generally cannot be tested (untestable assumption)
Under the three IV conditions, we can identify a causal effect.
Setting: Binary instrument \(Z\), binary treatment \(A\), outcome \(Y\)
IV estimand:
\[\frac{E[Y \mid Z = 1] - E[Y \mid Z = 0]}{E[A \mid Z = 1] - E[A \mid Z = 0]}\]
This is the Wald estimator or ratio estimator.
Interpretation: The effect of \(Z\) on \(Y\), divided by the effect of \(Z\) on \(A\).
Numerator: \(E[Y \mid Z = 1] - E[Y \mid Z = 0]\)
Denominator: \(E[A \mid Z = 1] - E[A \mid Z = 0]\)
Ratio: Under additional assumptions (see next section), this estimates the average causal effect in a specific subgroup.
Design: Randomize individuals to receive encouragement to exercise (\(Z\))
IV estimate:
\[\frac{\text{Mean health in encouraged} - \text{Mean health in not encouraged}}{\Pr[\text{Exercise} \mid \text{Encouraged}] - \Pr[\text{Exercise} \mid \text{Not encouraged}]}\]
If 60% exercise when encouraged vs 30% when not, and mean health differs by 6 points:
\[\frac{6}{0.60 - 0.30} = \frac{6}{0.30} = 20\]
Effect of exercise on health (in a subgroup) is 20 points.
IV estimation is like an imperfect randomized experiment.
If everyone complied with their assigned treatment (\(A = Z\)):
When \(A \neq Z\) for some individuals:
Definition 2 (Principal Strata) Individuals can be classified into principal strata based on potential treatments \(A^{z=1}\) and \(A^{z=0}\):
Monotonicity assumption: No defiers exist.
Under IV conditions plus monotonicity:
\[\text{IV estimand} = E[Y^{a=1} - Y^{a=0} \mid \text{Complier}]\]
This is the average causal effect in compliers, not in the full population.
Interpretation: IV tells us the effect of treatment for those who would comply with the instrument.
Limitation: We don’t know who the compliers are (unobservable principal stratum).
Two-stage least squares (2SLS) is the most common IV method for continuous outcomes.
Stage 1: Regress treatment on instrument (and covariates if present)
\[A_i = \alpha_0 + \alpha_1 Z_i + \epsilon_i\]
Obtain predicted treatment: \(\hat{A}_i = \hat{\alpha}_0 + \hat{\alpha}_1 Z_i\)
Stage 2: Regress outcome on predicted treatment
\[Y_i = \beta_0 + \beta_1 \hat{A}_i + \eta_i\]
The coefficient \(\hat{\beta}_1\) is the 2SLS estimate of the causal effect.
Intuition:
Mathematical equivalence: For binary \(Z\) and \(A\), 2SLS equals the Wald estimator.
With measured confounders \(L\) that confound \(A \to Y\) but not \(Z \to A\):
Stage 1: \[A_i = \alpha_0 + \alpha_1 Z_i + \alpha_2^{\top} L_i + \epsilon_i\]
Stage 2: \[Y_i = \beta_0 + \beta_1 \hat{A}_i + \beta_2^{\top} L_i + \eta_i\]
Including \(L\) can improve efficiency even if not necessary for identification.
IV estimation can be combined with adjustment for measured confounders.
Scenario 1: Confounders of \(A \to Y\) that don’t affect \(Z\)
Scenario 2: Confounders of \(Z \to Y\)
Modified IV conditions:
Estimation: Use 2SLS with \(L\) as covariates, then standardize over \(L\).
How do IV estimates compare to regression-based estimates?
Regression (e.g., outcome regression or IP weighting):
IV estimation:
If IV and regression give different estimates:
Interpretation: Differences suggest either unmeasured confounding or effect heterogeneity (or both).
IV methods can be extended to handle survival outcomes and time-to-event data.
Issue: With time-to-event outcomes, some individuals are censored before the event.
Question: How do we interpret IV estimates when the outcome is survival time?
Definition 3 (Survivor Average Causal Effect) The survivor average causal effect (SACE) is:
\[E[Y^{a=1} - Y^{a=0} \mid S^{a=1} = 1, S^{a=0} = 1]\]
where \(S^a\) is an indicator for surviving (or remaining uncensored) under treatment \(a\).
This is the effect in always-survivors - those who would survive under both treatment and control.
Setting: Survival \(S\) is affected by treatment \(A\), and outcome \(Y\) is only observed if \(S = 1\).
IV approach: Under IV conditions plus additional assumptions (monotonicity for survival), IV can identify SACE.
Interpretation: Effect of treatment on the outcome for those who would survive regardless of treatment.
Key concepts:
When to use IV methods:
Common instruments:
| Setting | Instrument | Treatment | Outcome |
|---|---|---|---|
| Randomized encouragement | Encouragement | Behavior change | Health |
| Geographic variation | Distance to facility | Healthcare use | Health |
| Mendelian randomization | Genetic variant | Biomarker | Disease |
| Draft lottery | Lottery number | Military service | Earnings |
| Physician preference | Physician tendency | Treatment choice | Outcome |
Assumptions to check:
Advantages:
Limitations: