Tae Hyun Kim (Lowell)

Influence Function

Definition

If an estimator ψ^\hat\psi of a functional parameter ψ:PR\psi:\mathcal{P}\to\mathbb{R} is asymptotically linear, then an influence function (IF) ϕ\phi exists such that n(ψ^ψ(P))=1ni=1nϕ(Oi;P)+op(1),EP[ϕ]=0.\sqrt{n}\big(\hat\psi-\psi(P)\big)=\frac{1}{\sqrt{n}}\sum_{i=1}^n \phi(O_i;P)+o_p(1),\quad E_P[\phi]=0. The asymptotic variance is Var=EP[ϕ2]\mathrm{Var}=E_P[\phi^2]. Equivalently, the IF is the Gateaux derivative along the path Pε=(1ε)P+εδoP_\varepsilon=(1-\varepsilon)P+\varepsilon\delta_o: ϕ(o)=ddεψ(Pε)ε=0\phi(o)=\frac{d}{d\varepsilon}\psi(P_\varepsilon)\big|_{\varepsilon=0}. von Mises expansion: ψ(Pˉ)ψ(P)=ϕ(o;P)d(PˉP)(o)+R2\psi(\bar P)-\psi(P)=\int\phi(o;P)\,d(\bar P-P)(o)+R_2.

Intuitive Understanding

How much does a single observation perturb the estimate — the “leverage” of each data point. Because the remainder R2R_2 is a product (second-order) of nuisance estimation errors, the first-order bias of the plug-in estimator can be corrected by the mean of the IF (One-step Estimator · AIPW).

Key Papers

  • Hines, Dukes, Diaz-Ordaz & Vansteelandt, “Demystifying Statistical Learning Based on Efficient Influence Functions”, The American Statistician 76(3):292–304, 2022 — introduction to the EIF, start here
  • Kennedy, “Semiparametric doubly robust targeted DML: a review”, arXiv:2203.06469, 2022
  • van der Vaart, Asymptotic Statistics, 1998, Ch.25

Local graph