SCM (Structural Causal Model)
Definition
SCM (Structural Causal Model)은 변수들 간의 인과 관계를 수학적으로 표현하는 framework. Pearl의 causal inference framework의 핵심.
Formal Definition:
| Component | Meaning |
|---|---|
| Endogenous variables (관측 가능한 변수들) | |
| Exogenous variables (관측 불가, ) | |
| Structural equations: | |
| Distribution over exogenous: |
Structural Equation:
각 변수는 부모 변수들과 외생적 noise에 의해 결정됨.
Example
Causal Graph G: SCM M:
X V = {X, Y, Z}
↓ U = {U_X, U_Y, U_Z}
Y
↓ F = { X := U_X,
Z Y := f(X) + U_Y,
Z := g(Y) + U_Z }
P = { U_X ~ N(0,1),
U_Y ~ N(0,1),
U_Z ~ N(0,1) }
Joint Distribution (Markov factorization):
SCM vs DAG
| Aspect | DAG | SCM |
|---|---|---|
| 표현 | Qualitative (화살표 유무) | Quantitative (함수 형태) |
| 정보량 | 인과 방향만 | 인과 방향 + 함수 관계 |
| 용도 | Identification | Identification + Estimation |
| Intervention | 개념적 표현 | 수학적 조작 가능 |
Causal Edge Assumption
각 변수의 값은 부모 변수들에 의해 완전히 결정:
의미:
- Causal sufficiency: 모든 공통 원인이 관측됨
- No unmeasured confounders (in ideal case)
Intervention (do-operator)
Definition
: 변수 를 값 로 강제 설정
수학적 조작:
- 의 structural equation을 로 대체
- 다른 equations는 유지
Example
Original SCM:
X := U_X
Y := 2X + U_Y
Z := Y + U_Z
After :
X := 3 ← Changed!
Y := 2X + U_Y ← Uses X = 3
Z := Y + U_Z
Interventional vs Observational
- Observational: 관측 조건부 (confounding 포함)
- Interventional: 인과적 개입 (confounding 제거)
Causal Effect Identification
From SCM to DAG
SCM 에서 DAG 추출:
- 의 각 변수가 node
- edge
do-calculus
Pearl의 세 가지 규칙으로 interventional distribution을 observational로 변환:
Rule 1 (Insertion/deletion of observations): if
Rule 2 (Action/observation exchange): if
Rule 3 (Insertion/deletion of actions): if
Counterfactuals
SCM은 counterfactual reasoning 가능:
Counterfactual query: “If X had been x’, what would Y have been?”
Three-step procedure:
- Abduction: Observe evidence, infer
- Action: Modify SCM with
- Prediction: Compute in modified model
Linear SCM
Linear Gaussian SCM:
Matrix form:
특징:
- Closed-form solution 존재
- LiNGAM: Non-Gaussian noise로 identifiable
Causal Discovery from SCM
Goal
데이터 로부터 underlying SCM (또는 DAG)을 복원
Approaches
- Constraint-based (PC, FCI): Conditional independence tests
- Score-based (GES, FGES): Score optimization
- Asymmetry-based (LiNGAM): Distributional asymmetries
Identifiability
- Markov Equivalence: 같은 conditional independencies → 같은 Markov Equivalence Class
- Non-Gaussian: LiNGAM으로 unique DAG 식별 가능
Related Concepts
- DAG - SCM의 graphical representation
- Confounder - Unmeasured common cause
- d-separation - Graphical conditional independence
- Markov Equivalence Class - Observationally equivalent graphs
- Back-door Criterion - Causal effect identification
- Potential Outcomes - Alternative causal framework
- Markov Property - 그래프-분포 관계
- Graph Foundations Overview - 그래프 표현 전체 정리
References
- Pearl, J. (2009). Causality: Models, Reasoning, and Inference
- yaoSurveyCausalInference2021 - SCM in causal discovery context