Almanac
technique

Random Network Distillation

techniqueactiverandom-network-distillation-5d746235·1 events·first seen 28d ago

Aliases: Random Network Distillation

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·28d ago·source ↗

Reinforcement Learning with Prediction-Based Rewards (Random Network Distillation)

OpenAI introduces Random Network Distillation (RND), a curiosity-driven exploration method for reinforcement learning that uses prediction error on a fixed random neural network as an intrinsic reward signal. RND is the first method to exceed average human performance on Montezuma's Revenge, a notoriously hard-exploration Atari game. The approach is simple to implement and compatible with standard RL algorithms, offering a scalable alternative to count-based or dynamics-model exploration bonuses.