
Diffusers
diffusers-60b85a07·22 events·first seen 1mo agoAliases: Diffusers
Co-occurring entities
More like this (12)
Recent events (22)
Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines
Hugging Face introduces Modular Diffusers, a new framework design that breaks diffusion pipelines into composable, interchangeable building blocks. The approach aims to make it easier to mix and match components such as encoders, denoisers, and decoders across different diffusion model architectures. This represents a significant refactor of the Diffusers library's pipeline abstraction, targeting researchers and developers who need flexible pipeline construction without rewriting boilerplate code.
State of open video generation models in Diffusers
Hugging Face published a survey of open-source video generation models integrated into the Diffusers library as of January 2025. The post covers the current landscape of available open video generation models, their capabilities, and how they are supported within the Diffusers ecosystem. This serves as a reference for practitioners looking to use or compare open-weights video generation models.
Memory-efficient Diffusion Transformers with Quanto and Diffusers
This Hugging Face blog post describes integrating the Quanto quantization library with the Diffusers framework to reduce memory requirements for diffusion transformer models. The approach enables running large image/video generation models on consumer-grade hardware by applying int8 and int4 quantization to model weights. The post covers practical implementation details and benchmarks showing memory savings for models like Flux and others in the diffusion transformer family.
Exploring Quantization Backends in Diffusers
Hugging Face published a technical overview of quantization backends available in the Diffusers library for image and video generation models. The post covers integration with multiple quantization frameworks (likely bitsandbytes, GGUF, torchao, and similar) and their trade-offs for diffusion model inference. It targets practitioners seeking to reduce memory footprint and improve throughput when deploying diffusion models.
Train your ControlNet with diffusers
This Hugging Face blog post provides a technical guide for training ControlNet models using the diffusers library. It covers the process of conditioning diffusion models on additional inputs such as edge maps, depth maps, or other spatial signals to enable fine-grained image generation control. The post targets practitioners looking to implement custom ControlNet pipelines on their own datasets.
Ethical Guidelines for Developing the Diffusers Library
Hugging Face published a set of ethical guidelines governing the development of its Diffusers library, a widely-used open-source toolkit for diffusion-based generative models. The guidelines address responsible development practices, content moderation considerations, and the handling of potentially harmful use cases. This represents an early attempt by a major ML tooling provider to formalize ethics policies at the library/infrastructure level rather than solely at the model or application layer.
What's new in Diffusers? — Hugging Face Diffusers Library Second Month Update
Hugging Face published a blog post summarizing new features and updates added to the Diffusers library in its second month of development. The post covers new pipelines, model integrations, and tooling improvements for diffusion-based generative image models. This represents an early-stage ecosystem update for one of the primary open-source libraries supporting text-to-image and related diffusion model workflows.
Diffusers welcomes Stable Diffusion 3.5 Large
Hugging Face's Diffusers library has added support for Stable Diffusion 3.5 Large, Stability AI's latest image generation model. The blog post covers integration details, usage patterns, and how to run the model within the Diffusers ecosystem. This represents a standard tooling integration announcement for a recently released frontier image generation model.
Fast LoRA inference for Flux with Diffusers and PEFT
Hugging Face published a technical blog post detailing optimizations for LoRA inference speed with the Flux image generation model using the Diffusers and PEFT libraries. The post covers techniques to accelerate adapter loading and inference throughput for diffusion models. This is relevant to practitioners deploying fine-tuned image generation models in production or research settings.
Diffusers welcomes Stable Diffusion 3
Hugging Face's Diffusers library adds support for Stable Diffusion 3, enabling users to run Stability AI's latest text-to-image model through the standard Diffusers API. The post covers integration details, usage patterns, and memory optimization techniques for running SD3 locally. This marks the open-weights availability of SD3 through a major ML tooling ecosystem.
Stable Diffusion with 🧨 Diffusers
Hugging Face published a blog post introducing Stable Diffusion integration with their Diffusers library, covering the model's architecture and how to run it using the open-source tooling. The post appeared at the time of Stable Diffusion's public release in August 2022, marking a significant moment in accessible text-to-image generation. It served as both a technical introduction and a practical guide for the community to adopt the model.
LoRA Training Scripts of the World, Unite!
Hugging Face published a blog post consolidating and comparing advanced LoRA fine-tuning scripts for Stable Diffusion XL, covering techniques such as pivotal tuning, custom captions, and various regularization strategies. The post aims to unify fragmented community training approaches into a more coherent set of best practices. It serves as a practical guide for practitioners fine-tuning SDXL models with LoRA adapters.
Efficient Controllable Generation for SDXL with T2I-Adapters
Hugging Face published a blog post detailing T2I-Adapters for Stable Diffusion XL (SDXL), a lightweight conditioning mechanism that enables controllable image generation without full fine-tuning. The approach allows users to guide SDXL outputs using structural signals such as depth maps, edge detection, and pose estimation. T2I-Adapters offer a parameter-efficient alternative to ControlNet for the SDXL architecture, with integration into the Diffusers library.
AudioLDM 2, but faster ⚡️
Hugging Face published a blog post on AudioLDM 2, a latent diffusion model for audio generation, with a focus on inference speed improvements. The post likely covers integration into the Diffusers library and optimization techniques for faster audio synthesis. AudioLDM 2 supports text-to-audio, text-to-music, and text-to-speech generation tasks.
Welcome aMUSEd: Efficient Text-to-Image Generation
Hugging Face introduces aMUSEd, a text-to-image model based on the MUSE architecture that prioritizes efficiency over raw quality. The model is designed to be smaller and faster than diffusion-based alternatives, making it more accessible for deployment. It is released with integration into the Diffusers library.
SDXL in 4 Steps with Latent Consistency LoRAs
Hugging Face demonstrates combining Latent Consistency Models (LCMs) with LoRA adapters to enable high-quality image generation with Stable Diffusion XL in as few as 4 inference steps. This approach dramatically reduces the number of diffusion steps required compared to standard SDXL, lowering inference latency and compute cost. The technique leverages consistency distillation applied via lightweight LoRA weights, making it accessible without full model retraining.
Exploring Simple Optimizations for SDXL
This Hugging Face blog post explores practical optimization techniques for Stable Diffusion XL (SDXL) inference. It covers methods to improve throughput and reduce memory usage when running SDXL, targeting practitioners deploying the model. The content is oriented toward applied inference efficiency rather than novel research.
Introducing Würstchen: Fast Diffusion for Image Generation
Hugging Face introduces Würstchen, a latent diffusion architecture designed for fast and efficient image generation. The model operates in a highly compressed latent space, reducing computational requirements significantly compared to standard diffusion models. It is being integrated into the Diffusers library, making it accessible for the broader community.
Using LoRA for Efficient Stable Diffusion Fine-Tuning
This Hugging Face blog post explains how Low-Rank Adaptation (LoRA) can be applied to fine-tune Stable Diffusion models efficiently. LoRA reduces the number of trainable parameters by decomposing weight updates into low-rank matrices, enabling fine-tuning on consumer hardware with significantly less memory. The post covers practical implementation details using the diffusers library.
VQ-Diffusion: Vector Quantized Diffusion Models on Hugging Face
This Hugging Face blog post introduces VQ-Diffusion, a text-to-image generation approach that combines vector quantization with diffusion models. The method operates in a discrete latent space defined by a VQ-VAE codebook, applying the diffusion process to token sequences rather than continuous pixel or latent representations. The post likely covers integration into the Hugging Face diffusers ecosystem and demonstrates generation capabilities.
Stable Diffusion in JAX / Flax
Hugging Face published a blog post demonstrating Stable Diffusion running in JAX/Flax, enabling efficient inference on TPU hardware. The post covers the technical implementation of diffusion pipelines using Flax's functional programming model. This represents an early effort to bring high-performance image generation to Google's TPU ecosystem via the Diffusers library.
Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e
Hugging Face published a technical blog post detailing how to accelerate Stable Diffusion XL inference using JAX on Google Cloud TPU v5e hardware. The post covers the integration of JAX-based diffusion pipelines with TPU v5e, demonstrating performance gains from hardware-software co-optimization. This represents a practical deployment pattern for large image generation models on non-GPU accelerators.