Almanac
product

RL-Teacher

productactiverl-teacher-8f583b9c·1 events·first seen 28d ago

Aliases: RL-Teacher

Co-occurring entities

More like this (12)

Recent events (1)

5Openai Blog·28d ago·source ↗

OpenAI Releases RL-Teacher: Open-Source Human Feedback Interface for RL

OpenAI released RL-Teacher, an open-source implementation of an interface for training AI systems using occasional human feedback instead of hand-crafted reward functions. The tool implements a technique developed as a step toward safer AI systems and is applicable to reinforcement learning problems where reward specification is difficult. This represents an early public release of human-in-the-loop RL tooling from OpenAI.