Almanac
paper

RL²

paperactiverl--5efefc98·1 events·first seen 28d ago

Aliases: RL²

Co-occurring entities

More like this (12)

Recent events (1)

5Openai Blog·28d ago·source ↗

RL²: Fast Reinforcement Learning via Slow Reinforcement Learning

OpenAI published RL², a meta-reinforcement learning approach in which a slow outer RL process trains a recurrent neural network whose hidden state encodes a fast inner learning algorithm. The method allows agents to rapidly adapt to new tasks within a single episode by leveraging experience accumulated across many training tasks. This work is an early foundational contribution to meta-learning for RL, predating the modern agent and LLM era but relevant to understanding the intellectual lineage of in-context and few-shot learning in AI systems.