2024 Optimization Days, (algorithmic) collusions in games

DeepMind’s AI Master Gamer: Learns 26 Games in 2 Hours

An End-to-End Guide on Reinforcement Learning with Human Feedback