BART (Bayesian Additive Regression Trees)
Definition
A Bayesian ensemble method that models the outcome as a sum of many trees
where:
- : the -th tree
- : tree structure
- : leaf node values
- Prior on
Application to Causal Inference
Potential Outcome Modeling
CATE Estimation
Estimated by sampling from the posterior:
Model Structure
Prior Specification
Tree structure prior:
- Node split probability:
- Typically
Leaf value prior:
Variance prior:
MCMC Sampling
Posterior estimation via Gibbs Sampling:
- : MH step
- : Conjugate update
- : Conjugate update
Advantages and Disadvantages
Advantages
| Advantage | Description |
|---|---|
| Uncertainty quantification | Posterior → confidence/credible intervals |
| Flexible nonlinearity | Captures complex interactions |
| Regularization | Prior prevents overfitting |
| Continuous/binary treatment | Both can be handled |
| Automatic variable selection | Discovers important variables |
Disadvantages
| Disadvantage | Description |
|---|---|
| Computational cost | Slow due to MCMC |
| Convergence diagnostics | Requires checking MCMC convergence |
| Hyperparameters | Sensitive to prior specification |
| Large-scale data | Hard to scale |
Causal BART Variants
BCF (Bayesian Causal Forests)
Hahn et al. (2020): separates the treatment effect
- : Prognostic function (BART)
- : Treatment effect function (BART)
ps-BART
Includes the propensity score as a covariate:
Implementation
R (dbarts, bartCause)
library(dbarts)
# Basic BART
bart_fit <- bart(x.train = cbind(X, W),
y.train = Y,
ntree = 200)
# CATE estimation
X1 <- cbind(X, W = 1)
X0 <- cbind(X, W = 0)
pred1 <- predict(bart_fit, X1)
pred0 <- predict(bart_fit, X0)
cate <- colMeans(pred1 - pred0)
# Credible interval
cate_samples <- pred1 - pred0
ci <- apply(cate_samples, 2, quantile, c(0.025, 0.975))
R (bartCause)
library(bartCause)
# Causal BART
fit <- bartc(y = Y, z = W, x = X,
method.rsp = "bart",
method.trt = "bart")
# CATE
cate <- predict(fit)
Comparison with Causal Forest
| Property | BART | Causal Forest |
|---|---|---|
| Inference | Bayesian | Frequentist |
| Uncertainty | Posterior | Bootstrap/Asymptotic |
| Honest | No | Yes |
| Speed | Slow (MCMC) | Fast |
| Theory | Posterior consistency | -normality |
Related Concepts
- Tree-based Methods Overview - integration of tree-based methods
- Causal Forest - Frequentist alternative
- Honest Estimation - overfitting prevention
- HTE - estimation target
Key Papers
- Hill, J. L. (2011). Bayesian nonparametric modeling for causal inference. JCGS
- Chipman, H. A., George, E. I., & McCulloch, R. E. (2010). BART: Bayesian additive regression trees. Annals of Applied Statistics
- Hahn, P. R., Murray, J. S., & Carvalho, C. M. (2020). Bayesian regression tree models for causal inference. Bayesian Analysis
- yaoSurveyCausalInference2021 - Section 3.4.3