Tae Hyun Kim (Lowell)
← 모든 기둥
Decision-Making under Uncertainty

Decision-Making under Uncertainty

bandits · RL · OPE · DTR/OTR · policy learning

추정된 효과를 결정으로 — optimal policy learning, bandits·reinforcement learning, off-policy evaluation, dynamic/optimal treatment regimes.

노트 20개