#doubly-robust

노트 7개

AIPW (Augmented Inverse Probability Weighting) - $\hat{\mu}t(X)$: Outcome 모델 ($E[Y|T=t, X]$)
Double/Debiased Machine Learning (DML) 고차원 nuisance parameter $\eta0$ 존재 하에서 저차원 관심 parameter $\theta0$에 대한 유효한 통계적 추론을 수행하기 위한 방법론.
Doubly Robust Estimator Doubly Robust (DR) Estimator는 outcome regression과 propensity score 모델을 결합하여, 둘 중 하나만 올바르게 specified되어도 consistent한 추정량.
DR-Learner DR-Learner는 CATE 추정을 위한 2단계 doubly robust estimator로, Pseudo-outcome을 covariate에 대해 regression하는 방식.
ESCM² (Entire Space Counterfactual Multi-Task Model) ESMM의 두 가지 이론적 한계 — Inherent Estimation Bias (IEB)와 Potential Independence Priority (PIP) — 를 해결하기 위해, Inverse Propensity Score (IPS) 및 Doubly Robust Estimator 기반…
Off-Policy Evaluation (OPE) 다른 behavior policy $\pib$로 수집한 로그로 target policy $\pie$의 가치 $V(\pie)=E{\pie}[\sum r]$를 추정.
Optimal Targeting Policy Optimal Targeting Policy는 covariate $x$를 처치 결정 $\pi(x)\in\{0,1\}$로 사상해 정책 가치를 극대화하는 규칙이다: