Almanac
benchmark

Sonic the Hedgehog (RL benchmark)

benchmarkactivesonic-the-hedgehog-rl-benchmark--cba89552·1 events·first seen 28d ago

Aliases: Sonic the Hedgehog (RL benchmark)

Co-occurring entities

More like this (12)

Recent events (1)

4Openai Blog·28d ago·source ↗

OpenAI Releases CoinRun Environment for Measuring RL Generalization

OpenAI released CoinRun, a procedurally generated platformer training environment designed to measure reinforcement learning agents' ability to generalize to novel situations. The environment is positioned as simpler than Sonic the Hedgehog benchmarks but still challenging enough to expose generalization failures in state-of-the-art RL algorithms. It addresses a longstanding puzzle in RL research around overfitting to training environments versus true generalization.