IPW (Inverse Propensity Weighting)

장점	설명
간단	직관적이고 구현 용이
비모수적	Outcome 모델 가정 불필요
이론적 정당화	조건부 일치성 보장
유연성	다양한 estimand 적용 가능

단점

단점	설명
PS 추정 의존	PS 오특정 시 편향
극단적 PS 민감	$e(X) \approx 0$ 또는 $1$ 에서 불안정
높은 분산	특히 overlap 약할 때
고차원 어려움	PS 추정 어려움

극단적 PS 문제

문제

$e(X) \to 0$ 또는 $e(X) \to 1$ 일 때:

가중치 폭발: $1/e(X) \to \infty$
추정량 불안정

해결책

Trimming: 극단적 PS 샘플 제거
Overlap Weighting: 안정적 가중치 사용
Weight clipping: 가중치 상한 설정

구현

Python (EconML)

from econml.dr import LinearDRLearner

# IPW는 outcome model 없이
model = LinearDRLearner(model_propensity=LogisticRegression())
model.fit(Y, T, X)
ate = model.effect(X).mean()

R

library(WeightIt)

# Propensity score weights
weights <- weightit(treat ~ x1 + x2, data = df, method = "ps")

# Weighted outcome regression
lm(y ~ treat, data = df, weights = weights$weights)

응용: RTB Win Selection Bias 보정

RTB에서 낙찰된 impression만으로 학습 시 win selection bias 발생. IPW로 보정:

w_i = \frac{1}{p_{\text{win}}(x_i, b_i)}, \quad p_{\text{win}} = P(\text{win} \mid X, \text{bid})

Win propensity는 Survival Analysis (Kaplan-Meier) 또는 Gradient Boosting으로 추정. Weight stabilization (clipping, normalization)이 필수적. 자세한 내용은 Multi-Task Learning (IPW-ESCM²) 참조.

참고 논문

yaoSurveyCausalInference2021 - Section 3.1.3
Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the propensity score
Horvitz, D. G., & Thompson, D. J. (1952). A generalization of sampling without replacement
Zhang et al. (2016). Bid-aware Gradient Descent (KDD)

정의

직관적 이해

왜 역수 가중?

재표본 관점

수학적 유도

ATE 식별

샘플 추정량

정규화 버전

ATT를 위한 IPW

장단점

장점

단점

극단적 PS 문제

문제

해결책

구현

Python (EconML)

R

관련 개념

응용: RTB Win Selection Bias 보정

참고 논문

연결 그래프