Almanac
← Events
4Hugging Face Blog·1mo ago

Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac

Hugging Face published a blog post detailing optimizations for running Stable Diffusion models via Core ML on Apple devices including iPhone, iPad, and Mac. The post covers techniques to accelerate on-device inference using Apple's neural engine and Core ML framework. This represents progress in deploying capable diffusion models at the edge without cloud dependency.

Related guides (3)

Related events (8)

5Hugging Face Blog·1mo ago·source ↗

Using Stable Diffusion with Core ML on Apple Silicon

Hugging Face published a guide on running Stable Diffusion models via Apple's Core ML framework on Apple Silicon hardware. The post covers converting diffusion model weights to Core ML format and integrating them into the Diffusers library for on-device inference. This represents an early effort to enable efficient local image generation on consumer Apple hardware without requiring cloud GPU resources.

4Hugging Face Blog·1mo ago·source ↗

Swift Diffusers: Fast Stable Diffusion for Mac

Hugging Face published a blog post introducing Swift Diffusers, a native macOS/iOS application for running Stable Diffusion models locally on Apple Silicon hardware. The post covers optimizations leveraging Apple's Core ML framework to accelerate inference on Mac. This represents an effort to bring on-device diffusion model inference to consumer Apple hardware without cloud dependency.

5Hugging Face Blog·1mo ago·source ↗

Stable Diffusion XL on Mac with Advanced Core ML Quantization

Hugging Face details the process of running Stable Diffusion XL (SDXL) on Apple Silicon Macs using Core ML with advanced quantization techniques. The post covers how quantization reduces model size and memory requirements to make SDXL feasible on consumer Mac hardware. This represents a practical deployment advance for running large diffusion models at the edge on Apple devices.

4Hugging Face Blog·1mo ago·source ↗

Optimizing Stable Diffusion for Intel CPUs with NNCF and Hugging Face Optimum

This Hugging Face blog post details techniques for optimizing Stable Diffusion inference on Intel CPUs using Neural Network Compression Framework (NNCF) and the Optimum library. The workflow covers quantization and other compression methods to reduce latency and memory footprint on CPU hardware. This is relevant to the inference-economics and enterprise-deployment threads as it addresses running diffusion models without dedicated GPU hardware.

4Hugging Face Blog·1mo ago·source ↗

Accelerating Stable Diffusion Inference on Intel CPUs

This Hugging Face blog post details techniques for optimizing Stable Diffusion inference on Intel CPUs, likely covering quantization, operator fusion, and Intel-specific hardware acceleration libraries. The post addresses the practical challenge of running diffusion models on CPU hardware without dedicated GPUs. This is relevant to inference economics and enterprise deployment patterns where GPU availability is constrained.

4Hugging Face Blog·1mo ago·source ↗

Fine-tuning Stable Diffusion models on Intel CPUs

This Hugging Face blog post describes a workflow for fine-tuning Stable Diffusion image generation models on Intel CPUs rather than GPUs. It covers the tooling and optimizations required to make CPU-based diffusion model training practical, relevant to inference-economics and hardware diversification trends. The post targets practitioners looking to reduce dependency on GPU hardware for generative model fine-tuning.

6Hugging Face Blog·1mo ago·source ↗

Diffusers welcomes Stable Diffusion 3

Hugging Face's Diffusers library adds support for Stable Diffusion 3, enabling users to run Stability AI's latest text-to-image model through the standard Diffusers API. The post covers integration details, usage patterns, and memory optimization techniques for running SD3 locally. This marks the open-weights availability of SD3 through a major ML tooling ecosystem.

5Hugging Face Blog·1mo ago·source ↗

WWDC 24: Running Mistral 7B with Core ML

This Hugging Face blog post covers running Mistral 7B on Apple devices using Core ML, likely demonstrated or announced around WWDC 2024. It addresses on-device inference of a 7B parameter open-weights model using Apple's ML framework. This represents a practical deployment pattern for running capable open-weights LLMs locally on Apple Silicon hardware.