person
Harrison Edwards
personactive
harrison-edwards-19c27506·1 events·first seen 28d agoAliases: Harrison Edwards
Co-occurring entities
More like this (12)
Recent events (1)
Reinforcement Learning with Prediction-Based Rewards (Random Network Distillation)
OpenAI introduces Random Network Distillation (RND), a curiosity-driven exploration method for reinforcement learning that uses prediction error on a fixed random neural network as an intrinsic reward signal. RND is the first method to exceed average human performance on Montezuma's Revenge, a notoriously hard-exploration Atari game. The approach is simple to implement and compatible with standard RL algorithms, offering a scalable alternative to count-based or dynamics-model exploration bonuses.