Tae Hyun Kim (Lowell)

SCM (Structural Causal Model)

4분 읽기 #causal-inference#scm

Definition

SCM (Structural Causal Model)은 변수들 간의 인과 관계를 수학적으로 표현하는 framework. Pearl의 causal inference framework의 핵심.

Formal Definition:

M=(V,U,F,P)M = (V, U, F, P)

ComponentMeaning
VVEndogenous variables (관측 가능한 변수들)
UUExogenous variables (관측 불가, VU=V \cap U = \emptyset)
FFStructural equations: fi:(VU)pVf_i: (V \cup U)^p \rightarrow V
PPDistribution over exogenous: P(U)=iP(Ui)P(U) = \prod_i P(U_i)

Structural Equation: Xi:=fi(Pa(Xi),Ui)X_i := f_i(Pa(X_i), U_i)

각 변수는 부모 변수들과 외생적 noise에 의해 결정됨.

Example

Causal Graph G:           SCM M:
    X                     V = {X, Y, Z}
    ↓                     U = {U_X, U_Y, U_Z}
    Y
    ↓                     F = { X := U_X,
    Z                           Y := f(X) + U_Y,
                                Z := g(Y) + U_Z }

                          P = { U_X ~ N(0,1),
                                U_Y ~ N(0,1),
                                U_Z ~ N(0,1) }

Joint Distribution (Markov factorization): P(X,Y,Z)=P(X)P(YX)P(ZY)P(X, Y, Z) = P(X) \cdot P(Y|X) \cdot P(Z|Y)

SCM vs DAG

AspectDAGSCM
표현Qualitative (화살표 유무)Quantitative (함수 형태)
정보량인과 방향만인과 방향 + 함수 관계
용도IdentificationIdentification + Estimation
Intervention개념적 표현수학적 조작 가능

Causal Edge Assumption

각 변수의 값은 부모 변수들에 의해 완전히 결정: Xi:=f(Pa(Xi),Ui)XiVX_i := f(Pa(X_i), U_i) \quad \forall X_i \in V

의미:

  • Causal sufficiency: 모든 공통 원인이 관측됨
  • No unmeasured confounders (in ideal case)

Intervention (do-operator)

Definition

do(X=x)do(X = x): 변수 XX를 값 xx로 강제 설정

수학적 조작:

  1. XX의 structural equation을 X:=xX := x로 대체
  2. 다른 equations는 유지

Example

Original SCM:

X := U_X
Y := 2X + U_Y
Z := Y + U_Z

After do(X=3)do(X = 3):

X := 3          ← Changed!
Y := 2X + U_Y   ← Uses X = 3
Z := Y + U_Z

Interventional vs Observational

P(YX=x)P(Ydo(X=x))P(Y | X = x) \neq P(Y | do(X = x))

  • Observational: 관측 조건부 (confounding 포함)
  • Interventional: 인과적 개입 (confounding 제거)

Causal Effect Identification

From SCM to DAG

SCM MM에서 DAG GG 추출:

  • VV의 각 변수가 node
  • Pa(Xi)XiPa(X_i) \rightarrow X_i edge

do-calculus

Pearl의 세 가지 규칙으로 interventional distribution을 observational로 변환:

Rule 1 (Insertion/deletion of observations): P(ydo(x),z,w)=P(ydo(x),w)P(y|do(x), z, w) = P(y|do(x), w) if (YZX,W)GX(Y \perp Z | X, W)_{G_{\overline{X}}}

Rule 2 (Action/observation exchange): P(ydo(x),do(z),w)=P(ydo(x),z,w)P(y|do(x), do(z), w) = P(y|do(x), z, w) if (YZX,W)GXZ(Y \perp Z | X, W)_{G_{\overline{X}\underline{Z}}}

Rule 3 (Insertion/deletion of actions): P(ydo(x),do(z),w)=P(ydo(x),w)P(y|do(x), do(z), w) = P(y|do(x), w) if (YZX,W)GXZ(W)(Y \perp Z | X, W)_{G_{\overline{X}\overline{Z(W)}}}

Counterfactuals

SCM은 counterfactual reasoning 가능:

Counterfactual query: “If X had been x’, what would Y have been?”

YX=x=Y(Mdo(X=x))Y_{X=x'} = Y(M_{do(X=x')})

Three-step procedure:

  1. Abduction: Observe evidence, infer UU
  2. Action: Modify SCM with do(X=x)do(X=x')
  3. Prediction: Compute YY in modified model

Linear SCM

Linear Gaussian SCM: Xi=jPa(i)βjiXj+Ui,UiN(0,σi2)X_i = \sum_{j \in Pa(i)} \beta_{ji} X_j + U_i, \quad U_i \sim N(0, \sigma_i^2)

Matrix form: X=BX+U\mathbf{X} = B\mathbf{X} + \mathbf{U} X=(IB)1U\mathbf{X} = (I - B)^{-1}\mathbf{U}

특징:

  • Closed-form solution 존재
  • LiNGAM: Non-Gaussian noise로 identifiable

Causal Discovery from SCM

Goal

데이터 DD로부터 underlying SCM (또는 DAG)을 복원

Approaches

  1. Constraint-based (PC, FCI): Conditional independence tests
  2. Score-based (GES, FGES): Score optimization
  3. Asymmetry-based (LiNGAM): Distributional asymmetries

Identifiability

  • Markov Equivalence: 같은 conditional independencies → 같은 Markov Equivalence Class
  • Non-Gaussian: LiNGAM으로 unique DAG 식별 가능
  • DAG - SCM의 graphical representation
  • Confounder - Unmeasured common cause
  • d-separation - Graphical conditional independence
  • Markov Equivalence Class - Observationally equivalent graphs
  • Back-door Criterion - Causal effect identification
  • Potential Outcomes - Alternative causal framework
  • Markov Property - 그래프-분포 관계
  • Graph Foundations Overview - 그래프 표현 전체 정리

References

  • Pearl, J. (2009). Causality: Models, Reasoning, and Inference
  • yaoSurveyCausalInference2021 - SCM in causal discovery context

연결 그래프