#contextual-bandits

2 notes

Contextual Bandits Contextual Bandits are a multi-armed bandit problem in which the optimal action (arm) varies depending on the context.
RTB Bidding Strategy via Causal ML — From Prediction to Optimization A five-stage case study on the public iPinYou RTB dataset that moves from pCTR/pCVR prediction through causal effect estimation (CATE, SCM) to budget-constrained optimal bidding and off-policy policy evaluation.