technique
Denoising Diffusion Policy Optimization
techniqueactive
denoising-diffusion-policy-optimization-c401f75b·1 events·first seen 28d agoAliases: Denoising Diffusion Policy Optimization
Co-occurring entities
More like this (12)
Denoising Diffusion Probabilistic Modelsdiffusion-based policyBeyond Fully Random Masking: Attention-Guided Denoising and Optimization for Diffusion Language ModelsAmbient Diffusion PolicyDiffusion PolicyDivergence Regularized Policy OptimizationKolmogorov Regression for Robust Diffusion PoliciesProximal Policy OptimizationOn-Policy Distillation (OPD)Pareto Optimal Policy OptimizationVector Policy Optimizationdiffusion-based inpainting
Recent events (1)
Finetune Stable Diffusion Models with DDPO via TRL
Hugging Face's TRL library adds support for DDPO (Denoising Diffusion Policy Optimization), enabling reinforcement learning-based finetuning of Stable Diffusion models. This extends TRL's RLHF tooling beyond language models to image generation, allowing reward-driven optimization of diffusion models. The post demonstrates practical usage of the new DDPO trainer within the TRL ecosystem.