This chapter extends causal inference methods to survival analysis and time-to-event outcomes. We define causal effects for survival data, address challenges posed by censoring and competing events, and show how to estimate causal survival curves using IP weighting and the parametric g-formula.
We begin by reviewing key concepts from survival analysis.
Definition 1 (Survival Analysis Terms) Event time \(T\): Time from baseline until an event occurs (e.g., death, disease onset)
Censoring time \(C\): Time from baseline until censoring (e.g., loss to follow-up, study end)
Observed time \(Y = \min(T, C)\): The time we actually observe
Event indicator \(D = I(T \leq C)\): 1 if event occurred, 0 if censored
Risk at time \(t\): \(\Pr[T \leq t]\) (cumulative incidence)
Survival at time \(t\): \(S(t) = \Pr[T > t] = 1 - \Pr[T \leq t]\)
Hazard at time \(t\): \(\lambda(t) = \lim_{dt \to 0} \frac{\Pr[t \leq T < t + dt \mid T \geq t]}{dt}\)
Hazard: Instantaneous rate of event occurrence among those at risk
Risk (cumulative incidence): Probability of event by time \(t\)
The survival function \(S(t)\) and cumulative hazard \(\Lambda(t)\) are key quantities.
\[\Lambda(t) = \int_0^t \lambda(s) ds\]
Interpretation: Total accumulated hazard up to time \(t\).
Survival from cumulative hazard:
\[S(t) = \exp(-\Lambda(t))\]
Risk from survival:
\[\Pr[T \leq t] = 1 - S(t) = 1 - \exp(-\Lambda(t))\]
Nonparametric estimator of survival function:
\[\hat{S}(t) = \prod_{t_i \leq t} \left(1 - \frac{d_i}{n_i}\right)\]
where: - \(t_i\) are the ordered event times - \(d_i\) is the number of events at \(t_i\) - \(n_i\) is the number at risk just before \(t_i\)
Assumption: Censoring is independent of event time (given covariates).
Censoring creates missing data problems in survival analysis.
Administrative censoring: Study ends before all events occur
Loss to follow-up: Participants drop out before event or study end
Competing risks: Individuals experience a different event that precludes the event of interest
Definition 2 (Independent Censoring) Censoring is independent if:
\[T \perp\!\!\!\perp C \mid L\]
where \(T\) is event time, \(C\) is censoring time, \(L\) are measured covariates.
Interpretation: Among individuals with the same covariate values, censoring is unrelated to their (unobserved) event time.
When violated: Standard survival methods (Kaplan-Meier, Cox regression) are biased.
Solution: Use IP weighting for censoring (Section 17.5).
The hazard ratio is commonly used to quantify treatment effects in survival analysis.
Definition 3 (Hazard Ratio) The hazard ratio comparing treatment \(a\) to \(a'\) is:
\[HR^{a,a'}(t) = \frac{\lambda^a(t)}{\lambda^{a'}(t)}\]
where \(\lambda^a(t)\) is the hazard function under treatment \(a\).
Cox proportional hazards model: Assumes \(HR^{a,a'}(t) = HR^{a,a'}\) (constant over time).
Collapsibility: Unlike risk differences and risk ratios, hazard ratios are non-collapsible.
Built-in selection bias: Hazards condition on survival to time \(t\), creating selection bias when effects are heterogeneous.
We can use IP weighting to estimate causal survival curves.
Definition 4 (Causal Survival Function) The causal survival function under treatment \(a\) is:
\[S^a(t) = \Pr[T^a > t]\]
where \(T^a\) is the potential event time under treatment \(a\).
Causal risk at time \(t\): \(\Pr[T^a \leq t] = 1 - S^a(t)\)
Goal: Estimate \(S^a(t)\) adjusting for confounding.
Method: IP weighted Kaplan-Meier estimator
\[\hat{S}^a(t) = \prod_{t_i \leq t} \left(1 - \frac{\sum_{j: A_j = a} W_j^A I(Y_j = t_i, D_j = 1)}{\sum_{j: A_j = a} W_j^A I(Y_j \geq t_i)}\right)\]
Result: Estimate of causal survival curve under treatment \(a\).
Joint weights for treatment and censoring:
\[W^{A,C}_i = \frac{1}{\Pr[A_i \mid L_i]} \times \frac{1}{\Pr[C_i > t \mid A_i, L_i, \bar{Y}_i(t)]}\]
where \(\bar{Y}_i(t)\) represents the history of being at risk up to time \(t\).
Stabilized weights can improve stability:
\[SW^{A,C} = \frac{\Pr[A]}{\Pr[A \mid L]} \times \frac{\Pr[C > t \mid A]}{\Pr[C > t \mid A, L, \bar{Y}(t)]}\]
The parametric g-formula can also be used for survival outcomes.
Model: Hazard at each time point \(t\)
\[\Pr[T = t \mid T \geq t, A, L] = \text{expit}(\alpha_0(t) + \alpha_1 A + \alpha_2^{\top} L)\]
This can be fit using pooled logistic regression:
G-formula algorithm:
For continuous time, use parametric survival models:
Competing risks occur when multiple types of events can prevent observation of the event of interest.
Definition 5 (Competing Risks) A competing risk is an event that precludes the occurrence of the event of interest.
Example: When studying heart attack incidence, death from cancer is a competing risk.
If someone dies from cancer, they can never have a heart attack (or at least we can’t observe it).
Censoring is not independent: Individuals who experience competing events may have different risk of the event of interest.
Standard survival methods fail: Treating competing events as censoring leads to biased estimates.
1. Cause-specific hazards:
2. Subdistribution hazards (Fine-Gray model):
3. Parametric g-formula:
Key concepts:
Methods for causal survival analysis:
| Method | Approach | Advantages | Disadvantages |
|---|---|---|---|
| IP weighted KM | Weight observations | Nonparametric, robust | Needs correct treatment/censoring models |
| Parametric g-formula | Model hazards | Efficient, handles competing risks | Needs correct outcome model |
| Cox model | Model conditional hazard | Familiar, flexible | HR not causally interpretable |
Causal estimands:
NOT generally causal:
Assumptions:
Practical recommendations: