benchmark
Sonic the Hedgehog (RL benchmark)
benchmarkactive
sonic-the-hedgehog-rl-benchmark--cba89552·1 events·first seen 28d agoAliases: Sonic the Hedgehog (RL benchmark)
Co-occurring entities
More like this (12)
Recent events (1)
OpenAI Releases CoinRun Environment for Measuring RL Generalization
OpenAI released CoinRun, a procedurally generated platformer training environment designed to measure reinforcement learning agents' ability to generalize to novel situations. The environment is positioned as simpler than Sonic the Hedgehog benchmarks but still challenging enough to expose generalization failures in state-of-the-art RL algorithms. It addresses a longstanding puzzle in RL research around overfitting to training environments versus true generalization.