contextual-bandit
Expedia
Tue Nov 05 2024
Identifying Top-Scoring Arms in Ranking Bandits With Linear Payoffs in Real-Time
reward-engineering
Netflix
Thu Aug 29 2024
Recommending for Long-Term Member Satisfaction at Netflix
mlops
Lyft
Tue Mar 12 2024
Lyft’s Reinforcement Learning Platform