Tae Hyun Kim (Lowell)

HTE (Heterogeneous Treatment Effects)

4 min read #causal-inference#hte

Definition

The phenomenon in which the treatment effect varies with an individual’s characteristics

τ(x)=E[Y(1)Y(0)X=x]\tau(x) = E[Y(1) - Y(0) \mid X=x]

If τ(x)\tau(x) varies with xx, then heterogeneous treatment effects (HTE) are present.


Intuitive Understanding

Homogeneous vs Heterogeneous

TypeDefinitionExample
Homogeneousτ(x)=τ\tau(x) = \tau (constant)The same drug effect for all patients
Heterogeneousτ(x)\tau(x) depends on xxA larger drug effect for younger patients

Why Does It Matter?

  1. Personalized decision-making: deciding whom to treat
  2. Policy targeting: prioritizing groups with large effects
  3. Resource optimization: efficient allocation of limited resources
  4. Scientific understanding: insight into mechanisms

ConceptDefinitionRelationship
ITEYi(1)Yi(0)Y_i(1) - Y_i(0)Individual-level effect
CATEE[Y(1)Y(0)X=x]E[Y(1)-Y(0) \mid X=x]Functional representation of HTE
HTEVariability of CATEIf CATE varies, HTE is present
ATEE[Y(1)Y(0)]E[Y(1)-Y(0)]The average of CATE

HTE Estimation Methods

1. Meta-learners

CATE estimation leveraging existing ML algorithms:

MethodApproachCharacteristics
S-LearnerSingle modelSimple; suitable when HTE is negligible
T-LearnerSeparate model per treatmentFlexible; no information sharing
X-LearnerTwo-stage imputationSuitable for imbalanced data
R-LearnerResidualized regressionTheoretical guarantees

2. Tree-based Methods

MethodCharacteristics
Causal ForestHeterogeneity-based splitting, provides confidence intervals
BARTBayesian uncertainty quantification

3. Representation Learning

MethodCharacteristics
CFRBalanced representation learning
CEVAEVAE-based latent confounder inference
BNNBalancing Neural Network
GANITEGAN-based ITE estimation

4. Multi-task Learning

MethodCharacteristics
CMGPGaussian Process multi-task
DragonnetTargeted regularization

HTE Analysis Workflow

1. Exploring Effect Heterogeneity

# Estimate CATE with a Causal Forest
from econml.dml import CausalForestDML

cf = CausalForestDML()
cf.fit(Y, T, X)
cate = cf.effect(X)

# Check heterogeneity
print(f"CATE std: {cate.std():.3f}")
print(f"CATE range: [{cate.min():.3f}, {cate.max():.3f}]")

2. Subgroup Analysis

# Effect analysis by characteristic
for group in ['young', 'old']:
    idx = X['age_group'] == group
    print(f"{group}: CATE = {cate[idx].mean():.3f}")

3. Variable Importance

# Which variables contribute to heterogeneity?
importance = cf.feature_importances_

Cautions in Interpreting HTE

1. The Subgroup Hunting Problem

  • Finding high-effect groups via post-hoc analysis → multiple-comparisons problem
  • Separate validation data required

2. Checking Confidence Intervals

  • Do not draw conclusions from point estimates alone
  • Account for uncertainty

3. Causal Interpretation

  • CATE estimation ≠ optimal treatment decision
  • A causal interpretation of the interventional effect is needed

Applications

1. Personalized Medicine

  • “Which treatment is optimal for this patient?”
  • Side-effect risk vs. effect trade-off

2. Marketing Targeting

  • “For which customers is the promotion effective?”
  • Uplift modeling

3. Policy Design

  • “To which group should the policy be applied first?”
  • Optimization under resource constraints

4. Recommender Systems

  • “For which users is this content effective?”
  • Connected to MOC-LLM4RecSys, etc.

Evaluation Metrics

MetricDefinitionUse
PEHEE[(τ(X)τ^(X))2]\sqrt{E[(\tau(X) - \hat{\tau}(X))^2]}CATE estimation accuracy
ATE Estimation Error$\text{ATE} - \widehat{\text{ATE}}
Qini coefficientBased on the uplift curveTargeting efficiency

  • Treatment Effects Overview - integrated overview of treatment effects
  • CATE - functional definition of CATE
  • ITE - individual-level effect
  • Meta-learners - HTE estimation framework
  • Causal Forest - tree-based HTE
  • CFR - representation-learning-based HTE
  • PEHE - HTE evaluation metric

References

  • yaoSurveyCausalInference2021 - Section 2.2, 3.4, 3.5
  • kunzelMetalearnersEstimatingHeterogeneous2019 - Meta-learners
  • Athey, S., & Imbens, G. (2016). Recursive partitioning for heterogeneous causal effects
  • Wager, S., & Athey, S. (2018). Estimation and inference of HTE using random forests

Local graph