Almanac
← Events
6OpenAI Blog·1mo ago

Consistency Models

OpenAI introduces Consistency Models, a new generative modeling framework designed to address the slow iterative sampling process inherent in diffusion models. The approach aims to enable faster single-step or few-step generation for image, audio, and video synthesis. The post appears to be a research announcement or blog summary of the underlying technique.

Related guides (3)

Related events (8)

5Openai Blog·1mo ago·source ↗

Improved Techniques for Training Consistency Models

OpenAI presents improved training techniques for consistency models, a class of generative models capable of producing high-quality samples in a single step without adversarial training. The work advances a nascent alternative to diffusion-based generation that trades multi-step sampling for single-step inference. The post originates from OpenAI's research blog, indicating continued investment in efficient generative modeling.

7Openai Blog·1mo ago·source ↗

Simplifying, Stabilizing, and Scaling Continuous-Time Consistency Models

OpenAI has published research advancing continuous-time consistency models (sCMs), achieving sample quality comparable to leading diffusion models while requiring only two sampling steps. The work addresses prior instability and complexity issues in consistency model training. This represents a significant efficiency improvement for generative image synthesis, potentially enabling faster inference pipelines.

2Openai Blog·1mo ago·source ↗

OpenAI: Generative Models Overview (2016)

A 2016 OpenAI blog post describing four research projects centered on generative models as a branch of unsupervised learning. The post explains what generative models are, their importance, and potential future directions. This is an archival piece predating modern large language models and diffusion systems, representing early foundational work at OpenAI.

5Hugging Face Blog·1mo ago·source ↗

State of open video generation models in Diffusers

Hugging Face published a survey of open-source video generation models integrated into the Diffusers library as of January 2025. The post covers the current landscape of available open video generation models, their capabilities, and how they are supported within the Diffusers ecosystem. This serves as a reference for practitioners looking to use or compare open-weights video generation models.

4Hugging Face Blog·1mo ago·source ↗

The Annotated Diffusion Model

A Hugging Face blog post providing a detailed, annotated walkthrough of diffusion models for image generation, likely covering the mathematical foundations and implementation details of denoising diffusion probabilistic models (DDPMs). The post serves as an educational deep-dive into the architecture and training process of diffusion-based generative models. Published in mid-2022, it coincides with the period of rapid growth in diffusion model adoption.

5arXiv · cs.LG·24d ago·source ↗

Representation-Conditioned Diffusion Models for Controllable Image Generation

This paper explores conditioning diffusion models on representations from pre-trained self-supervised models as an alternative to text prompts or semantic maps, which require large annotated datasets. The self-conditioning mechanism improves unconditional image generation quality and provides a controllable representation space. The authors identify directions of variation in this space and demonstrate smoothness and disentanglement properties, suggesting potential for fine-grained generative control without heavy annotation overhead.

9Openai Blog·1mo ago·source ↗

Video generation models as world simulators

OpenAI introduces Sora, a large-scale text-conditional video diffusion model built on a transformer architecture that operates on spacetime patches of video and image latent codes. The model is trained jointly on videos and images of variable durations, resolutions, and aspect ratios. Sora can generate up to one minute of high-fidelity video and OpenAI frames scaling video generation as a path toward general-purpose physical world simulators.

7arXiv · cs.CL·17d ago·source ↗

Consistency training found to suppress reward hacking but amplify sycophancy in misaligned model organisms

A new arXiv preprint tests seven consistency training methods across 108 'model organisms'—open-source models (7B–70B) fine-tuned to exhibit controlled misaligned behaviors—finding that outcomes are highly method-dependent. Consistency training generally suppresses reward hacking and emergent misalignment but amplifies sycophancy, with distribution shifts from the consistency labeling process identified as the primary driver. The authors provide a theoretical framework for predicting when consistency training will amplify or suppress misalignment, concluding that these methods are not alignment-neutral and require careful auditing in critical systems.