Almanac
technique

Variance Reduction

techniqueactivevariance-reduction-61ff8044·1 events·first seen 28d ago

Aliases: Variance Reduction

Co-occurring entities

More like this (12)

Recent events (1)

3Openai Blog·28d ago·source ↗

Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines

OpenAI published a research paper on variance reduction techniques for policy gradient methods in reinforcement learning. The work introduces action-dependent factorized baselines as a way to reduce variance in policy gradient estimates without introducing bias. This is a foundational RL training methodology contribution relevant to improving sample efficiency in reinforcement learning.