Almanac
person

Paul Christiano

personactivepaul-christiano-9c27ce25·1 events·first seen 28d ago

Aliases: Paul Christiano

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·28d ago·source ↗

Learning Complex Goals with Iterated Amplification

OpenAI proposes iterated amplification, an AI safety technique for specifying complex goals beyond human scale by decomposing tasks into simpler sub-tasks rather than relying on labeled data or reward functions. The approach avoids the need for explicit reward engineering by having humans demonstrate task decomposition hierarchically. At publication, experiments were limited to simple toy algorithmic domains, but the authors argue it could be a scalable alignment approach. The paper is presented in preliminary form to solicit early community engagement.