Almanac
person

Harrison Edwards

personactiveharrison-edwards-19c27506·1 events·first seen 28d ago

Aliases: Harrison Edwards

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·28d ago·source ↗

Reinforcement Learning with Prediction-Based Rewards (Random Network Distillation)

OpenAI introduces Random Network Distillation (RND), a curiosity-driven exploration method for reinforcement learning that uses prediction error on a fixed random neural network as an intrinsic reward signal. RND is the first method to exceed average human performance on Montezuma's Revenge, a notoriously hard-exploration Atari game. The approach is simple to implement and compatible with standard RL algorithms, offering a scalable alternative to count-based or dynamics-model exploration bonuses.