Entity · technique

Random Network Distillation

techniqueactiverandom-network-distillation-5d746235·1 events·first seen May 20, 2026

Aliases: Random Network Distillation

Co-occurring entities

OpenAI Yuri Burda Montezuma's Revenge Harrison Edwards

More like this (12)

Model Distillation Automatic Domain Randomization Selective Proxy Distillation dynamics randomization Routing-based On-Policy Distillation Generalized Distillation distributed training On-Policy Distillation (OPD)Denoising Diffusion Policy Optimization on-policy self-distillation Parallel Decoding Distillation Self-Distillation

Recent events (1)

6Openai Blog·May 20, 2026·source ↗

Reinforcement Learning with Prediction-Based Rewards (Random Network Distillation)

OpenAI introduces Random Network Distillation (RND), a curiosity-driven exploration method for reinforcement learning that uses prediction error on a fixed random neural network as an intrinsic reward signal. RND is the first method to exceed average human performance on Montezuma's Revenge, a notoriously hard-exploration Atari game. The approach is simple to implement and compatible with standard RL algorithms, offering a scalable alternative to count-based or dynamics-model exploration bonuses.

Evaluation and Benchmarking AI Safety Research OpenAI Random Network Distillation Yuri Burda +2 more