Almanac
← Events
5Hugging Face Blog·1mo ago

Introducing Würstchen: Fast Diffusion for Image Generation

Hugging Face introduces Würstchen, a latent diffusion architecture designed for fast and efficient image generation. The model operates in a highly compressed latent space, reducing computational requirements significantly compared to standard diffusion models. It is being integrated into the Diffusers library, making it accessible for the broader community.

Related guides (4)

Related events (8)

4Hugging Face Blog·1mo ago·source ↗

VQ-Diffusion: Vector Quantized Diffusion Models on Hugging Face

This Hugging Face blog post introduces VQ-Diffusion, a text-to-image generation approach that combines vector quantization with diffusion models. The method operates in a discrete latent space defined by a VQ-VAE codebook, applying the diffusion process to token sequences rather than continuous pixel or latent representations. The post likely covers integration into the Hugging Face diffusers ecosystem and demonstrates generation capabilities.

7Hugging Face Blog·1mo ago·source ↗

Stable Diffusion with 🧨 Diffusers

Hugging Face published a blog post introducing Stable Diffusion integration with their Diffusers library, covering the model's architecture and how to run it using the open-source tooling. The post appeared at the time of Stable Diffusion's public release in August 2022, marking a significant moment in accessible text-to-image generation. It served as both a technical introduction and a practical guide for the community to adopt the model.

4Hugging Face Blog·1mo ago·source ↗

The Annotated Diffusion Model

A Hugging Face blog post providing a detailed, annotated walkthrough of diffusion models for image generation, likely covering the mathematical foundations and implementation details of denoising diffusion probabilistic models (DDPMs). The post serves as an educational deep-dive into the architecture and training process of diffusion-based generative models. Published in mid-2022, it coincides with the period of rapid growth in diffusion model adoption.

5Hugging Face Blog·1mo ago·source ↗

Diffusers welcomes Stable Diffusion 3.5 Large

Hugging Face's Diffusers library has added support for Stable Diffusion 3.5 Large, Stability AI's latest image generation model. The blog post covers integration details, usage patterns, and how to run the model within the Diffusers ecosystem. This represents a standard tooling integration announcement for a recently released frontier image generation model.

5Hugging Face Blog·1mo ago·source ↗

Exploring Quantization Backends in Diffusers

Hugging Face published a technical overview of quantization backends available in the Diffusers library for image and video generation models. The post covers integration with multiple quantization frameworks (likely bitsandbytes, GGUF, torchao, and similar) and their trade-offs for diffusion model inference. It targets practitioners seeking to reduce memory footprint and improve throughput when deploying diffusion models.

6Hugging Face Blog·1mo ago·source ↗

Diffusers welcomes FLUX-2

Hugging Face's Diffusers library has added support for FLUX-2, the successor to Black Forest Labs' FLUX image generation model. The blog post announces integration of the new model into the Diffusers ecosystem, enabling developers to use FLUX-2 through the standard Diffusers API. This represents a tooling and ecosystem update for one of the leading open-weights image generation model families.

5Hugging Face Blog·1mo ago·source ↗

State of open video generation models in Diffusers

Hugging Face published a survey of open-source video generation models integrated into the Diffusers library as of January 2025. The post covers the current landscape of available open video generation models, their capabilities, and how they are supported within the Diffusers ecosystem. This serves as a reference for practitioners looking to use or compare open-weights video generation models.

4Hugging Face Blog·1mo ago·source ↗

Welcome aMUSEd: Efficient Text-to-Image Generation

Hugging Face introduces aMUSEd, a text-to-image model based on the MUSE architecture that prioritizes efficiency over raw quality. The model is designed to be smaller and faster than diffusion-based alternatives, making it more accessible for deployment. It is released with integration into the Diffusers library.