Entity · technique

Action-Dependent Factorized Baselines

techniqueactiveaction-dependent-factorized-baselines-ccb6691e·1 events·first seen May 20, 2026

Aliases: Action-Dependent Factorized Baselines

Co-occurring entities

Policy Gradient Methods Variance Reduction OpenAI

More like this (12)

OpenAI Baselines leave-one-out baseline Action-BED: Task-Driven Bayesian Experimental Design with Singly Intractable Objectives code-as-action agents Advanced AI Scaling Framework ACTION-BED FACTOR Aspect-Based Sentiment Analysis Adaptive Depth Sparse Framework BERT-base humanlayer/12-factor-agents FutureBench

Recent events (1)

3Openai Blog·May 20, 2026·source ↗

Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines

OpenAI published a research paper on variance reduction techniques for policy gradient methods in reinforcement learning. The work introduces action-dependent factorized baselines as a way to reduce variance in policy gradient estimates without introducing bias. This is a foundational RL training methodology contribution relevant to improving sample efficiency in reinforcement learning.

Alignment and RLHF Action-Dependent Factorized Baselines Policy Gradient Methods Variance Reduction +1 more