Tae Hyun Kim (Lowell)

← 모든 기둥

Personalization

Personalization

HTE · targeting · recommendation · pricing

through-line — 임상의 individualized treatment decisions와 산업의 targeting·recommendation·pricing은 동일한 methodological core의 양면이다.

노트 20개

Dunnhumby — Track 1: Latent-Factor Customer Segmentation

NMF latent factors (92.44% explained variance) + K-Means yield 7 stable behavioral segments (Bootstrap ARI 0.77) with per-segment marketing actions. Illustrative case study on the public Dunnhumby retail dataset.

2026-06-13 #targeting#segmentation
Dunnhumby — Track 2: Causal Targeting via Heterogeneous Treatment Effects

Meta-learner / Causal Forest CATE under severe positivity violation (PS AUC 0.989); an OPE-validated policy targets ~31% of customers and surfaces counter-intuitive negative-CATE segments. Hypothesis-generating on public data.

2026-06-13 #causal-inference#targeting#uplift
Applied Causal Inference for Pricing — CATE & SCM Across Public Datasets

An applied case study using only public datasets (LendingClub, iPinYou) that combines CATE estimation for price-sensitivity heterogeneity with SCM-based moderator analysis to design individual-level, risk-based pricing and RTB bidding policies — all findings illustrative and projected, not proprietary.

2026-06-12 #pricing#causal-inference#cate
Customer Segmentation

Customer Segmentation은 고객을 행동·가치·선호의 유사성으로 유한개 세그먼트로 분할하는 비지도(unsupervised) 과제다. 흔히 잠재요인 분해 후 군집화: 행동 피처 → NMF(비음수 parts-based 분해) → factor score → K-Means → 세그먼트.

2026-06-12 #targeting#segmentation
Customer Segmentation & Causal Targeting — An Applied Case Study

An end-to-end applied case study on the public Dunnhumby dataset — NMF latent factors and K-Means segmentation feeding meta-learner / Causal Forest HTE and an OPE-validated optimal targeting policy, with a candid look at positivity violation and counter-intuitive "sleeping dog" segments.

2026-06-12 #targeting#segmentation#uplift
From Estimation to Action — How HTE Drives Personalized Policy Across Domains

One methodological spine — estimate heterogeneous treatment effects and turn them into individual-level policies — powers both clinical sequential treatment decisions and industrial targeting, pricing, and recommendation.

2026-06-12 #personalization#causal-inference#decision-making
LLM Multi-Layer Attribute Extraction for Cross-Domain Recommendation

A case study on extracting a 3-layer attribute taxonomy (product / perceptual / theory-grounded) with LLM/VLM pipelines, turning it into user profiles and a mixture-of-experts adaptor, and plugging it into standard recommenders across two public domains (fashion + music).

2026-06-12 #recsys#llm4rec#cross-domain
Optimal Targeting Policy

Optimal Targeting Policy는 covariate $x$를 처치 결정 $\pi(x)\in\{0,1\}$로 사상해 정책 가치를 극대화하는 규칙이다:

2026-06-12 #targeting#policy-targeting#doubly-robust
RTB Bidding Strategy via Causal ML — From Prediction to Optimization

A five-stage case study on the public iPinYou RTB dataset that moves from pCTR/pCVR prediction through causal effect estimation (CATE, SCM) to budget-constrained optimal bidding and off-policy policy evaluation.

2026-06-12 #decision-making#targeting#ope
Targeting & Profiling Overview

Targeting & Profiling = personalization의 산업 얼굴. 같은 방법론 코어(이질적 효과 추정 → 개인 수준 최적 정책; MOC-Personalization)가 임상에서는 "환자별 최적 치료 배정"으로, 산업에서는 "고객별 최적 캠페인/노출 배정"으로 나타난다. 이 도메인은 누구에게…

2026-06-12 #targeting#policy-targeting
Uplift Modeling

Uplift(증분효과)는 처치(캠페인 노출·쿠폰·추천)가 한 개인의 결과(구매·전환)에 미치는 인과적 증분이다. 이진 처치 $W\in\{0,1\}$, 결과 $Y$, covariate $X$에 대해

2026-06-12 #targeting#uplift#meta-learner
User Profiling

User Profiling은 고객의 행동 이력으로부터 개인 선호 프로파일(취향·맥락·잠재 패턴)을 추론해 벡터로 표현하는 과제다. 타겟팅·세그멘테이션·추천의 공통 입력층 — 임상에서 환자 covariate/multimodal 표현(Multimodal Clinical Data)에 대응하는 산업 측 표현.

2026-06-12 #targeting#profiling
Dynamic Treatment Regimes (DTR / OTR)

DTR은 누적 이력 $Ht$(공변량·이전 처치·중간결과)를 처치로 사상하는 결정규칙 열 $\{dt(Ht)\}{t=1}^T$. optimal treatment regime(OTR) 은 기대 장기결과 $E[Y^{d}]$를 최대화. 추정:

2026-06-11 #decision-making#clinical-decision-making#dtr
ESCM² (Entire Space Counterfactual Multi-Task Model)

ESMM의 두 가지 이론적 한계 — Inherent Estimation Bias (IEB)와 Potential Independence Priority (PIP) — 를 해결하기 위해, Inverse Propensity Score (IPS) 및 Doubly Robust Estimator 기반…

2026-03-25 #recsys#causal-inference#doubly-robust
ESMM (Entire Space Multi-Task Model)

CVR의 Sample Selection Bias와 Data Sparsity 문제를 동시에 해결하기 위해, $\text{impression} \to \text{click} \to \text{conversion}$의 순차적 사용자 행동을 활용하여 전체 impression space에서 CVR을 간접 학습하는…

2026-03-25 #recsys#representation-learning
DeepFM

DeepFM (Guo et al., 2017)은 FM component와 Deep component를 병렬로 결합하여, low-order feature interaction(explicit)과 high-order feature interaction(implicit)을 동시에 학습하는 CTR 예측 모델이다.

2026-02-13 #recsys#factor-models#factorization-machine
Factorization Machine

Factorization Machine (FM)은 Rendle (2010)이 제안한 범용 예측 모델로, 모든 feature 쌍 간의 상호작용을 latent factor vector의 내적으로 모델링한다.

2026-02-13 #recsys#factor-models#factorization-machine
PNN

PNN (Qu et al., 2016)은 embedding layer와 DNN hidden layer 사이에 product layer를 도입하여, feature embedding 간의 interaction을 명시적으로 포착한 후 DNN으로 전달하는 CTR 예측 모델이다.

2026-02-13 #recsys#factor-models#product-network
Wide and Deep

Wide & Deep (Cheng et al., 2016)은 Linear wide component(memorization)와 DNN deep component(generalization)를 결합한 CTR 예측 모델이다. Google Play 앱 추천에서 처음 적용되었다.

2026-02-13 #recsys#factor-models
Multi-Task Learning

여러 관련 태스크를 동시에 학습하여 공유 표현(shared representation)을 통해 일반화 성능을 향상시키는 학습 패러다임

2026-01-29 #recsys#multi-task-learning