Tae Hyun Kim (Lowell)

SCM (Structural Causal Model)

4 min read #causal-inference#scm

Definition

An SCM (Structural Causal Model) is a framework for mathematically expressing the causal relationships among variables. It is the core of Pearl’s causal inference framework.

Formal Definition:

M=(V,U,F,P)M = (V, U, F, P)

ComponentMeaning
VVEndogenous variables (observable variables)
UUExogenous variables (unobserved, VU=V \cap U = \emptyset)
FFStructural equations: fi:(VU)pVf_i: (V \cup U)^p \rightarrow V
PPDistribution over exogenous: P(U)=iP(Ui)P(U) = \prod_i P(U_i)

Structural Equation: Xi:=fi(Pa(Xi),Ui)X_i := f_i(Pa(X_i), U_i)

Each variable is determined by its parent variables and exogenous noise.

Example

Causal Graph G:           SCM M:
    X                     V = {X, Y, Z}
    ↓                     U = {U_X, U_Y, U_Z}
    Y
    ↓                     F = { X := U_X,
    Z                           Y := f(X) + U_Y,
                                Z := g(Y) + U_Z }

                          P = { U_X ~ N(0,1),
                                U_Y ~ N(0,1),
                                U_Z ~ N(0,1) }

Joint Distribution (Markov factorization): P(X,Y,Z)=P(X)P(YX)P(ZY)P(X, Y, Z) = P(X) \cdot P(Y|X) \cdot P(Z|Y)

SCM vs DAG

AspectDAGSCM
RepresentationQualitative (presence of arrows)Quantitative (functional form)
Information contentCausal direction onlyCausal direction + functional relationships
UseIdentificationIdentification + Estimation
InterventionConceptual representationMathematically manipulable

Causal Edge Assumption

The value of each variable is fully determined by its parent variables: Xi:=f(Pa(Xi),Ui)XiVX_i := f(Pa(X_i), U_i) \quad \forall X_i \in V

Meaning:

  • Causal sufficiency: all common causes are observed
  • No unmeasured confounders (in ideal case)

Intervention (do-operator)

Definition

do(X=x)do(X = x): forcibly set the variable XX to the value xx

Mathematical manipulation:

  1. Replace the structural equation of XX with X:=xX := x
  2. Keep the other equations

Example

Original SCM:

X := U_X
Y := 2X + U_Y
Z := Y + U_Z

After do(X=3)do(X = 3):

X := 3          ← Changed!
Y := 2X + U_Y   ← Uses X = 3
Z := Y + U_Z

Interventional vs Observational

P(YX=x)P(Ydo(X=x))P(Y | X = x) \neq P(Y | do(X = x))

  • Observational: observational conditional (includes confounding)
  • Interventional: causal intervention (removes confounding)

Causal Effect Identification

From SCM to DAG

Extract a DAG GG from an SCM MM:

  • Each variable in VV is a node
  • Pa(Xi)XiPa(X_i) \rightarrow X_i edge

do-calculus

Pearl’s three rules convert an interventional distribution into an observational one:

Rule 1 (Insertion/deletion of observations): P(ydo(x),z,w)=P(ydo(x),w)P(y|do(x), z, w) = P(y|do(x), w) if (YZX,W)GX(Y \perp Z | X, W)_{G_{\overline{X}}}

Rule 2 (Action/observation exchange): P(ydo(x),do(z),w)=P(ydo(x),z,w)P(y|do(x), do(z), w) = P(y|do(x), z, w) if (YZX,W)GXZ(Y \perp Z | X, W)_{G_{\overline{X}\underline{Z}}}

Rule 3 (Insertion/deletion of actions): P(ydo(x),do(z),w)=P(ydo(x),w)P(y|do(x), do(z), w) = P(y|do(x), w) if (YZX,W)GXZ(W)(Y \perp Z | X, W)_{G_{\overline{X}\overline{Z(W)}}}

Counterfactuals

An SCM enables counterfactual reasoning:

Counterfactual query: “If X had been x’, what would Y have been?”

YX=x=Y(Mdo(X=x))Y_{X=x'} = Y(M_{do(X=x')})

Three-step procedure:

  1. Abduction: Observe evidence, infer UU
  2. Action: Modify SCM with do(X=x)do(X=x')
  3. Prediction: Compute YY in modified model

Linear SCM

Linear Gaussian SCM: Xi=jPa(i)βjiXj+Ui,UiN(0,σi2)X_i = \sum_{j \in Pa(i)} \beta_{ji} X_j + U_i, \quad U_i \sim N(0, \sigma_i^2)

Matrix form: X=BX+U\mathbf{X} = B\mathbf{X} + \mathbf{U} X=(IB)1U\mathbf{X} = (I - B)^{-1}\mathbf{U}

Characteristics:

  • A closed-form solution exists
  • LiNGAM: identifiable with non-Gaussian noise

Causal Discovery from SCM

Goal

Recover the underlying SCM (or DAG) from data DD

Approaches

  1. Constraint-based (PC, FCI): Conditional independence tests
  2. Score-based (GES, FGES): Score optimization
  3. Asymmetry-based (LiNGAM): Distributional asymmetries

Identifiability

  • Markov Equivalence: same conditional independencies → same Markov Equivalence Class
  • Non-Gaussian: a unique DAG can be identified with LiNGAM
  • DAG - Graphical representation of an SCM
  • Confounder - Unmeasured common cause
  • d-separation - Graphical conditional independence
  • Markov Equivalence Class - Observationally equivalent graphs
  • Back-door Criterion - Causal effect identification
  • Potential Outcomes - Alternative causal framework
  • Markov Property - Graph-distribution relationship
  • Graph Foundations Overview - Complete overview of graphical representations

References

  • Pearl, J. (2009). Causality: Models, Reasoning, and Inference
  • yaoSurveyCausalInference2021 - SCM in causal discovery context

Local graph