Almanac
technique

Learning with Opponent-Learning Awareness (LOLA)

techniqueactivelearning-with-opponent-learning-awareness-lola--c06d34c6·1 events·first seen 28d ago

Aliases: Learning with Opponent-Learning Awareness (LOLA)

Co-occurring entities

More like this (12)

Recent events (1)

4Openai Blog·28d ago·source ↗

Learning to Model Other Minds: OpenAI Releases LOLA Algorithm

OpenAI has released Learning with Opponent-Learning Awareness (LOLA), an algorithm designed for multi-agent settings where each agent accounts for the fact that other agents are also learning. LOLA discovers self-interested yet collaborative strategies such as tit-for-tat in the iterated prisoner's dilemma. The work represents an early step toward agents capable of modeling other minds and reasoning about opponent behavior.