Doubly Robust Estimator
Definition
Doubly Robust (DR) Estimator는 outcome regression과 propensity score 모델을 결합하여, 둘 중 하나만 올바르게 specified되어도 consistent한 추정량.
ATE에 대한 DR Estimator:
여기서 pseudo-outcome (efficient influence function):
또는 equivalent form:
Intuitive Understanding
세 가지 추정 전략의 결합:
- Outcome Regression (OR):
- Inverse Propensity Weighting (IPW):
- Doubly Robust: OR + IPW correction term
OR만 사용: μ̂ 틀리면 biased
IPW만 사용: π̂ 틀리면 biased
DR 사용: μ̂ OR π̂ 중 하나만 맞아도 consistent!
왜 “Doubly Robust”인가?
- (outcome model correct): augmentation term의 기댓값이 0
- (propensity model correct): weighting이 정확하여 bias 상쇄
- 둘 다 틀려도 bias가 오차의 곱에 비례:
Key Properties
Double Robustness Property
Theorem: 다음 두 조건 중 하나가 성립하면 는 consistent:
- Outcome model이 correctly specified:
- Propensity model이 correctly specified:
Semiparametric Efficiency
DR estimator는 semiparametrically efficient:
- Efficient influence function을 기반으로 구성
- Semiparametric efficiency bound 달성
- 가장 낮은 asymptotic variance
Rate Doubly Robust
Product rate condition 하에서 -consistent:
예: 각각 rate면 충분
Mathematical Derivation
Efficient Influence Function
ATE 의 efficient influence function:
Properties:
- semiparametric variance bound
- Neyman orthogonal:
Bias Analysis
→ Product of errors 형태
Comparison: OR vs IPW vs DR
| Aspect | Outcome Regression | IPW | Doubly Robust |
|---|---|---|---|
| Model needed | Both | ||
| Consistency | If correct | If correct | If either correct |
| Efficiency | Not efficient | Not efficient | Semiparametrically efficient |
| Variance | Low if good | High with extreme | Best of both |
| With ML | Regularization bias | Variance issues | Robust to both |
Extensions
CATE Estimation
DR-Learner: DR pseudo-outcome을 에 대해 regression
ATT Estimation
Longitudinal Settings
Time-varying treatments, g-computation과 결합
Related Concepts
- Pseudo-outcome - DR estimator의 핵심 구성요소
- DR-Learner - CATE를 위한 DR 확장
- Influence Function - DR의 이론적 기반
- Neyman-Orthogonal Score - Orthogonality 속성
- Propensity Score - Treatment assignment probability
- Double-Debiased ML - 관련 framework
Historical Context
- Robins, Rotnitzky, Zhao (1994): 최초 doubly robust estimator 제안
- Bang & Robins (2005): “Doubly Robust Estimation” 명명
- Scharfstein, Rotnitzky, Robins (1999): Semiparametric theory 연결
- Chernozhukov et al. (2018): ML과의 결합 (DML)
Implementation
Python (econml):
from econml.dr import LinearDRLearner
dr = LinearDRLearner()
dr.fit(Y, T, X=X, W=W)
ate = dr.ate(X)
R (AIPW package):
library(AIPW)
AIPW_SL <- AIPW$new(Y = Y, A = A, W = W,
Q.SL.library = c("SL.glm", "SL.ranger"),
g.SL.library = c("SL.glm", "SL.ranger"))
AIPW_SL$fit()
AIPW_SL$summary()
References
- Robins, Rotnitzky, Zhao (1994) - Original DR estimator
- kennedyOptimalDoublyRobust2023 - Optimal DR for CATE
- chernozhukovDoubleDebiasedMachine2018 - DML framework
- Bang & Robins (2005) - “Doubly Robust Estimation in Missing Data and Causal Inference Models”