Causal Inference · Tae Hyun Kim (Lowell)

Causal Inference

CATE · counterfactual · causal discovery · SCM · semiparametric · partial ID

Identifying what causes what — heterogeneous treatment effects and counterfactuals — with structural causal models, semiparametric estimation, and sensitivity / partial-identification under unobserved confounding.

51 notes

Dunnhumby — Track 2: Causal Targeting via Heterogeneous Treatment Effects

Meta-learner / Causal Forest CATE under severe positivity violation (PS AUC 0.989); an OPE-validated policy targets ~31% of customers and surfaces counter-intuitive negative-CATE segments. Hypothesis-generating on public data.

2026-06-13 #causal-inference#targeting#uplift
Applied Causal Inference for Pricing — CATE & SCM Across Public Datasets

An applied case study using only public datasets (LendingClub, iPinYou) that combines CATE estimation for price-sensitivity heterogeneity with SCM-based moderator analysis to design individual-level, risk-based pricing and RTB bidding policies — all findings illustrative and projected, not proprietary.

2026-06-12 #pricing#causal-inference#cate
Causal Inference Under Partial Identification — Sensitivity and Evidence Hierarchies

When real-world data fail strong ignorability, point identification gives way to bounds, proxies, and sensitivity analysis — an honest hierarchy of evidence that connects credible causal claims to semiparametric efficiency.

2026-06-12 #causal-inference#partial-identification#sensitivity
From Estimation to Action — How HTE Drives Personalized Policy Across Domains

One methodological spine — estimate heterogeneous treatment effects and turn them into individual-level policies — powers both clinical sequential treatment decisions and industrial targeting, pricing, and recommendation.

2026-06-12 #personalization#causal-inference#decision-making
Marketing Attribution at Scale — From Simulation to Causal Inference

A case study comparing 10+ multi-touch attribution methods against a known-ground-truth simulator, then scaling them on the public Criteo dataset, closing the loop with budget off-policy evaluation for channel allocation.

2026-06-12 #causal-inference#decision-making#attribution
Anytime-Valid Inference Overview

Game-theoretic statistics that resolves the "peeking" problem of fixed-sample hypothesis testing. The mathematical foundation for real-time monitoring of identification-validity drift.

2026-06-11 #experiments#causal-inference#anytime-valid
Efficient Influence Function

Among the regular asymptotically linear (RAL) estimators of a (semi)parametric model, the IF with the smallest variance is the efficient influence function (EIF), and its variance equals the semiparametric efficiency bound (the supremum of the Cramér-Rao bounds over all parametric submodels)…

2026-06-11 #causal-inference#semiparametric#eif
Influence Function

If an estimator $\hat\psi$ of a functional parameter $\psi:\mathcal{P}\to\mathbb{R}$ is asymptotically linear, then an influence function (IF) $\phi$ exists such that

2026-06-11 #causal-inference#influence-functions#eif
Negative Control Outcome (NCO)

An NCO is an outcome variable guaranteed a priori to be unaffected by the treatment's causal influence, yet still cast in the shadow of the same confounder $U$. By contrast, an NCE (negative control exposure) is an exposure with no causal effect on the outcome. If the "apparent effect" on an NCO is nonzero → a signal of unmeasured confounding (detection) → correct for it via proximal methods.

2026-06-11 #causal-inference#proximal
One-step Estimator

Corrects first-order bias by adding the empirical mean of the estimated EIF to the plug-in $\psi(\hat P)$:

2026-06-11 #causal-inference#semiparametric#aipw
Partial Identification

When point identification is impossible due to a lack of assumptions, we only know that the parameter lies in the identified set $\ThetaI$ (often an interval $[\thetaL,\thetaU]$) compatible with the data plus assumptions. Manski's assumption-free / worst-case bounds are the starting point. sharp bounds =…

2026-06-11 #causal-inference#partial-identification
Proximal Causal Inference

When unmeasured confounding $U$ is present, the causal effect is identified using two types of proxies:

2026-06-11 #causal-inference#proximal
TMLE (Targeted Maximum Likelihood Estimation)

A procedure that corrects (targets) a plug-in estimator toward the target parameter:

2026-06-11 #causal-inference#tmle#eif
ESCM² (Entire Space Counterfactual Multi-Task Model)

A model that integrates a counterfactual risk regularizer based on the Inverse Propensity Score (IPS) and the Doubly Robust estimator into ESMM, in order to address ESMM's two theoretical limitations — Inherent Estimation Bias (IEB) and Potential Independence Priority (PIP).

2026-03-25 #recsys#causal-inference#doubly-robust
AIPW (Augmented Inverse Probability Weighting)

- $\hat{\mu}_t(X)$: Outcome model ($E[Y|T=t, X]$)

2025-01-29 #causal-inference#doubly-robust
ATT (Average Treatment Effect on the Treated)

Average treatment effect for the group that actually received treatment

2025-01-28 #causal-inference#potential-outcomes
Back-door Criterion

The Back-door Criterion (Pearl, 1993) is a graphical criterion for identifying a causal effect from observational data. It determines whether a set of variables $Z$ is sufficient to identify the causal effect of $X \rightarrow Y$.

2025-01-28 #causal-inference#scm#dag
BART (Bayesian Additive Regression Trees)

A Bayesian ensemble method that models the outcome as a sum of many trees

2025-01-28 #causal-inference#tree-based#bart
CATE (Conditional Average Treatment Effect)

The Conditional Average Treatment Effect (CATE) is the average treatment effect given covariates $X=x$:

2025-01-28 #causal-inference#cate#hte
Causal Forest

Causal Forest is a causal-inference application of the Generalized Random Forest (GRF) proposed by Athey, Tibshirani, and Wager (2019), splitting so as to maximize the heterogeneity of treatment effects.

2025-01-28 #causal-inference#hte#causal-forest
CEVAE (Causal Effect Variational Autoencoder)

A method that uses a VAE to infer latent confounders and estimate causal effects.

2025-01-28 #causal-inference#representation-learning
CFR (Counterfactual Regression)

A deep learning method that learns balanced representations via IPM (Integral Probability Metric) regularization

2025-01-28 #causal-inference#representation-learning
Collider

A collider is a variable affected by both the treatment (X) and the outcome (Y) (a common effect). In the structure X → C ← Y, C is a collider.

2025-01-28 #causal-inference#scm#dag
Confounder

A confounder is a variable that affects both the treatment (X) and the outcome (Y) (a common cause), creating a spurious (non-causal) association between X and Y.

2025-01-28 #causal-inference#scm#dag
Constraint-Based Methods Overview

Constraint-based methods recover the causal graph by testing conditional independence (CI) relations in the data. Under the faithfulness assumption, they exploit the correspondence between CI relations and d-separation.

2025-01-28 #causal-inference#causal-discovery#constraint-based
d-separation

d-separation (directional separation) is a graphical criterion in a DAG for determining whether two sets of variables are conditionally independent given a third set.

2025-01-28 #causal-inference#scm#dag
DAG (Directed Acyclic Graph)

A DAG (Directed Acyclic Graph) is a graph that visually represents the causal relationships among variables. It is a core tool in causal inference for grasping confounding structure and deciding an identification strategy.

2025-01-28 #causal-inference#scm#dag
do-operator

The do-operator is Pearl's formalization of intervention.

2025-01-28 #causal-inference#scm
Double/Debiased Machine Learning (DML)

A methodology for performing valid statistical inference on a low-dimensional parameter of interest $\theta0$ in the presence of a high-dimensional nuisance parameter $\eta0$.

2025-01-28 #causal-inference#double-ml#doubly-robust
Doubly Robust Estimator

The Doubly Robust (DR) Estimator combines an outcome-regression model and a propensity-score model, remaining consistent as long as just one of the two is correctly specified.

2025-01-28 #causal-inference#doubly-robust#potential-outcomes
DR-Learner

The DR-Learner is a two-stage doubly robust estimator for CATE that regresses a pseudo-outcome on the covariates.

2025-01-28 #causal-inference#hte#cate
Endogeneity

Endogeneity is the problem that arises when an explanatory variable is correlated with the error term.

2025-01-28 #foundations#causal-inference#potential-outcomes
Fundamental Problem of Causal Inference

The problem that, for the same individual, the outcomes under treatment (W=1) and control (W=0) cannot be observed simultaneously

2025-01-28 #causal-inference#potential-outcomes
HTE (Heterogeneous Treatment Effects)

The phenomenon in which the treatment effect varies with an individual's characteristics

2025-01-28 #causal-inference#hte
Instrumental Variables

Instrumental variables (IV) are exogenous variables used to address the problem of endogeneity.

2025-01-28 #foundations#causal-inference#potential-outcomes
IPW (Inverse Propensity Weighting)

Estimating treatment effects by using the inverse of the propensity score as weights

2025-01-28 #causal-inference#reweighting#ipw
ITE (Individual Treatment Effect)

The treatment effect for individual $i$

2025-01-28 #causal-inference#hte
Mediator

A mediator is an intermediate variable lying on the causal pathway through which a treatment (X) affects an outcome (Y). In the structure X → M → Y, M is the mediator.

2025-01-28 #causal-inference#scm#mediation
Meta-learners

Meta-learners are a general term for algorithms that estimate the CATE by leveraging existing supervised learning methods (base learners).

2025-01-28 #causal-inference#hte#cate
Positivity (Overlap)

The probability of receiving treatment lies strictly between 0 and 1 for every covariate value

2025-01-28 #causal-inference#potential-outcomes
Propensity Score Matching (PSM)

Matching treated and control individuals with similar propensity scores

2025-01-28 #causal-inference#matching#psm
R-Learner

R-Learner (Residualized Learner) is a meta-learner that estimates the CATE using residualized outcomes and residualized treatments based on the Robinson Transformation.

2025-01-28 #causal-inference#hte#cate
Representation Learning Overview

Methods for learning representations that are independent of treatment while remaining useful for outcome prediction.

2025-01-28 #causal-inference#representation-learning
S-Learner

The S-Learner (Single Learner) is a Meta-learner that estimates the response function with a single model including the treatment indicator as a feature, then computes the CATE.

2025-01-28 #causal-inference#hte#meta-learner
SCM (Structural Causal Model)

An SCM (Structural Causal Model) is a framework for mathematically expressing the causal relationships among variables. It is the core of Pearl's causal inference framework.

2025-01-28 #causal-inference#scm
Score-Based Methods Overview

Score-based methods assign a score function to each graph and search for the graph that best fits the data. Unlike constraint-based methods, they optimize model fit without CI tests.

2025-01-28 #causal-inference#causal-discovery#score-based
Strong Ignorability

An assumption combining Ignorability and Positivity

2025-01-28 #causal-inference#potential-outcomes
SUTVA (Stable Unit Treatment Value Assumption)

The potential outcome of one unit is not affected by the treatment assignment of other units, and only a single version exists for each treatment level.

2025-01-28 #causal-inference#potential-outcomes
T-Learner

The T-Learner (Two Learner) is a Meta-learner that estimates the CATE by training separate models for the treatment group and the control group.

2025-01-28 #causal-inference#hte#meta-learner
Treatment Effects Overview

A systematic overview of the treatment effects that serve as the estimands in the Potential Outcome Framework.

2025-01-28 #causal-inference#potential-outcomes
X-Learner

The X-Learner is a three-stage algorithm that leverages imputed treatment effects, a meta-learner that effectively exploits group imbalance and the structural properties of the CATE.

2025-01-28 #causal-inference#hte#meta-learner

Dunnhumby — Track 2: Causal Targeting via Heterogeneous Treatment Effects

Applied Causal Inference for Pricing — CATE & SCM Across Public Datasets

Causal Inference Under Partial Identification — Sensitivity and Evidence Hierarchies

From Estimation to Action — How HTE Drives Personalized Policy Across Domains

Marketing Attribution at Scale — From Simulation to Causal Inference

Anytime-Valid Inference Overview

Efficient Influence Function

Influence Function

Negative Control Outcome (NCO)

One-step Estimator

Partial Identification

Proximal Causal Inference

TMLE (Targeted Maximum Likelihood Estimation)

ESCM² (Entire Space Counterfactual Multi-Task Model)

AIPW (Augmented Inverse Probability Weighting)

ATT (Average Treatment Effect on the Treated)

Back-door Criterion

BART (Bayesian Additive Regression Trees)

CATE (Conditional Average Treatment Effect)

Causal Forest

CEVAE (Causal Effect Variational Autoencoder)

CFR (Counterfactual Regression)

Collider

Confounder

Constraint-Based Methods Overview

d-separation

DAG (Directed Acyclic Graph)

do-operator

Double/Debiased Machine Learning (DML)

Doubly Robust Estimator

DR-Learner

Endogeneity

Fundamental Problem of Causal Inference

HTE (Heterogeneous Treatment Effects)

Instrumental Variables

IPW (Inverse Propensity Weighting)

ITE (Individual Treatment Effect)

Mediator

Meta-learners

Positivity (Overlap)

Propensity Score Matching (PSM)

R-Learner

Representation Learning Overview

S-Learner

SCM (Structural Causal Model)

Score-Based Methods Overview

Strong Ignorability

SUTVA (Stable Unit Treatment Value Assumption)

T-Learner

Treatment Effects Overview

X-Learner