technique
self-play reinforcement learning
techniqueactive
self-play-reinforcement-learning-10ea19d7·1 events·first seen 28d agoAliases: self-play reinforcement learning
Co-occurring entities
More like this (12)
self-playCompetitive Self-PlayReinforcement LearningConstrained Reinforcement LearningImitation LearningGoal-Conditioned Reinforcement LearningQ-learningReinforcement Learning from Human FeedbackHierarchical Reinforcement Learningshielded reinforcement learningSoft Q-LearningGeneral Preference Reinforcement Learning
Recent events (1)
OpenAI Bot Defeats Top Dota 2 Professionals at 1v1
OpenAI developed a bot that defeats world-class professional players in 1v1 Dota 2 matches under standard tournament rules. The system learned entirely through self-play without imitation learning or tree search. This was presented as a milestone toward AI systems that can achieve well-defined goals in complex, real-world environments involving humans.