Step 10 of 10 in Long Context EvolutionFinish path →

Guide · Beginner

Hugging Face: The Home of Open-Source AI

Hugging FaceBeginneractive·v3 · live·generated 6d ago

Part of these paths

Agent and Tool Ecosystem · Step 2 of 9
Enterprise Deployment Patterns · Step 7 of 12
Evaluation and Benchmarking · Step 2 of 10
Frontier Model Releases · Step 10 of 10
Inference Economics · Step 3 of 9
Long Context Evolution · Step 10 of 10
Multimodal Progress · Step 2 of 7
Open Weights Progress · Step 1 of 7
Open weights vs. the closed frontier · Step 2 of 7
Regulatory Developments · Step 8 of 9
Training Infrastructure · Step 4 of 8

TL;DRHugging Face is the platform where the open-source AI world meets — a hub where researchers, companies, and hobbyists share models, datasets, and tools freely. It has grown from a model-hosting service into the de facto distribution layer for open-weights AI, and is now pushing that mission into robotics and local inference infrastructure.

Key takeaways

Hugging Face hosts landmark open-weights releases from Meta (Llama 2, 3, 3.1, 3.2, 4), Google (Gemma 3, 4), Alibaba (Qwen family), DeepSeek, Mistral, NVIDIA, and OpenAI's GPT OSS — making it the broadest single distribution point for frontier open models.
It acquired Pollen Robotics in April 2025 to extend its open-source mission into physical robots.
In February 2026, it brought llama.cpp and GGML — the libraries that power most local AI inference — under its umbrella to secure their long-term development.
Its own Transformers library reached version 5, a major update focused on simplified model definitions that underpin much of the ML ecosystem.
It launched Open-R1 in January 2025, a fully open community effort to reproduce DeepSeek-R1's reasoning training pipeline.
Stanford's 28-trillion-pixel GPIC image corpus — one of the largest permissively licensed visual datasets — is hosted on Hugging Face, illustrating its role as a dataset home too.

What Hugging Face is

Hugging Face is an open-source AI platform — think of it as a combination of GitHub and an app store, but specifically for AI models, datasets, and tools. Anyone can upload a model, anyone can download it, and the whole thing is searchable and free to browse. That openness has made it the default distribution point for the open-weights AI world: when a lab releases a model they want the public to use, Hugging Face is almost always where it lands first.

Why it matters

Most of the biggest names in AI — Meta, Google, Alibaba, Mistral, DeepSeek, NVIDIA, and even OpenAI — publish their open models on Hugging Face. That means if you want to run, study, or build on top of a frontier AI model without paying a subscription, Hugging Face is your starting point. It's also where the research community shares datasets: Stanford's GPIC image corpus, for example — roughly 28 trillion pixels of permissively licensed images — is hosted there.

Beyond hosting, Hugging Face builds and maintains the Transformers library, one of the most widely used software packages in machine learning. Version 5, released in late 2025, focused on making model definitions simpler and cleaner — a change that ripples out to every researcher and developer who builds on top of it.

A tour of what lives there

The breadth of what Hugging Face hosts is striking. A partial list from recent events alone:

Meta's Llama family — Llama 2, 3, 3.1 (up to 405B parameters), 3.2 (with vision and edge variants), and Llama 4 (Maverick and Scout, both multimodal mixture-of-experts models)
Google's Gemma — Gemma 3 and Gemma 4, both multimodal and on-device capable
Alibaba's Qwen series — Qwen2.5, Qwen2.5-VL (vision-language), Qwen2.5-Omni (text + image + audio + video), QwQ-32B (reasoning), Qwen3, and Qwen3 Embedding models
DeepSeek's V-series — V3.1, V3.2, V4-Flash, V4-Pro, and their base variants
Mistral models — Voxtral (speech understanding), Voxtral Transcribe 2, Mistral Small 3, and Mistral 3
NVIDIA Cosmos 3 — an open omni-model for robotics and physical AI
OpenAI's GPT OSS — a notable shift for a company historically known for keeping its models closed

Beyond hosting: Hugging Face's own moves

Hugging Face isn't just a passive shelf. It has been actively expanding what "open AI" means:

Open-R1 (January 2025): When DeepSeek released its R1 reasoning model, the training recipe wasn't fully public. Hugging Face launched Open-R1, a community project to reproduce the entire pipeline — data, training, and evaluation — using open-source components, so anyone could study and build on it.

Pollen Robotics acquisition (April 2025): Hugging Face bought a French open-source robotics company and announced plans to sell physical robots. This extends the platform's philosophy — open, accessible, community-driven — into hardware and embodied AI.

GGML and llama.cpp (February 2026): These two libraries are the engine behind most local AI inference — the software that lets people run large models on a laptop or home server without a cloud subscription. Hugging Face brought them under its umbrella to ensure they stay maintained and funded long-term.

Who uses it and how

Hugging Face serves several overlapping audiences. Researchers use it to share and reproduce work. Developers use it to grab pre-trained models and fine-tune them for specific tasks. Companies use it as a distribution channel for open-weights releases. And hobbyists use it to run models locally, often via llama.cpp — now a Hugging Face project.

Where it's heading

The pattern across these events points in a clear direction: Hugging Face is consolidating the infrastructure of open AI. It already hosts the models; now it owns the local inference stack (llama.cpp), is building toward physical robots, and maintains the most widely used model-loading library (Transformers). The platform is becoming less of a repository and more of a full ecosystem — the connective tissue that holds the open-weights world together.

Hugging Face as the open-weights ecosystem hub

Timeline

FAQ

Do I need to pay to use Hugging Face?

Most models and datasets on Hugging Face are free to download and use. The platform also offers paid hosting and compute services, but the core open-weights library is publicly accessible.

What is the Transformers library?

It's Hugging Face's flagship open-source software package that makes it easy to load, run, and fine-tune AI models. Version 5 was released in late 2025 with a focus on simpler model definitions.

Why did Hugging Face buy a robotics company?

Hugging Face acquired Pollen Robotics in April 2025 to extend its open-source AI mission into physical hardware, aiming to make open-source robots as accessible as open-source models.

What is llama.cpp and why does it matter that Hugging Face acquired it?

llama.cpp (and its underlying library GGML) is the software most people use to run large AI models on a personal computer or laptop without cloud services. Hugging Face brought it in-house in February 2026 to ensure its long-term maintenance and funding.

Is Hugging Face only for text AI models?

No — the platform hosts vision-language models, speech models, image datasets, robotics models, and embedding models, reflecting the full breadth of modern AI research.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

v3live6d ago
v2superseded11d ago
v1superseded16d ago

Related guides (4)

Hugging Face

Hugging Face: The Infrastructure Layer of the Open AI Ecosystem

Read asIn-depth

Anthropic

Anthropic: The AI Safety Company at the Center of the Frontier

Read asBeginner In-depth

ChatGPT

ChatGPT: The AI Assistant That Changed How the World Talks to Computers

Read asBeginner In-depth

MambaConcept

Mamba: State Space Models as a Practical Alternative to Transformers

Read asIn-depth

More on Hugging Face (6)

7Hugging Face Blog·1mo ago·source ↗

Transformers v5: Simple model definitions powering the AI ecosystem

Hugging Face has announced Transformers v5, a major version update to its flagship open-source library. The release focuses on simplified model definitions and architectural improvements to the codebase. As one of the most widely used ML libraries in the ecosystem, this update has broad implications for researchers and practitioners building on top of the Transformers framework.

Open Weights Progress Inference Economics Transformers Hugging Face +1 more

7Hugging Face Blog·1mo ago·source ↗

Hugging Face Acquires Pollen Robotics to Sell Open-Source Robots

Hugging Face has announced the acquisition of Pollen Robotics, a French open-source robotics company, with plans to sell physical robots. This move extends Hugging Face's open-source AI platform strategy into embodied AI and physical hardware. The acquisition signals a strategic push by Hugging Face to become a hub for open-source robotics development alongside its existing ML model and dataset ecosystem.

Open Weights Progress Enterprise Deployment Patterns Hugging Face Pollen Robotics +1 more

6Hugging Face Blog·1mo ago·source ↗

TRL v1.0: Post-Training Library Built to Move with the Field

Hugging Face has released TRL v1.0, a major milestone for its post-training library focused on reinforcement learning from human feedback and related alignment techniques. The release signals a stabilization of the API and feature set after iterative development tracking the rapidly evolving post-training landscape. TRL is widely used in the open-source community for fine-tuning and aligning language models using methods such as PPO, DPO, and GRPO.

Open Weights Progress Agent and Tool Ecosystem GRPO PPO DPO +3 more

4Hugging Face Blog·1mo ago·source ↗

Introducing Storage Buckets on the Hugging Face Hub

Hugging Face is launching Storage Buckets, a new feature on the Hub that provides object storage capabilities for AI/ML workflows. This expands the Hub's infrastructure offerings beyond model and dataset repositories, enabling users to store arbitrary files and artifacts. The feature targets teams managing large-scale AI pipelines who need integrated storage alongside their models and datasets.

Enterprise Deployment Patterns Agent and Tool Ecosystem Hugging Face Storage Buckets

5Hugging Face Blog·1mo ago·source ↗

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

Hugging Face introduces Modular Diffusers, a new framework design that breaks diffusion pipelines into composable, interchangeable building blocks. The approach aims to make it easier to mix and match components such as encoders, denoisers, and decoders across different diffusion model architectures. This represents a significant refactor of the Diffusers library's pipeline abstraction, targeting researchers and developers who need flexible pipeline construction without rewriting boilerplate code.

Agent and Tool Ecosystem Multimodal Progress Hugging Face Modular Diffusers Diffusers

8Hugging Face Blog·1mo ago·source ↗

GGML and llama.cpp Join Hugging Face to Ensure Long-Term Progress of Local AI

GGML and llama.cpp, the foundational open-source libraries enabling efficient local inference of large language models, are joining Hugging Face. This move is intended to secure long-term development and sustainability of the projects that underpin much of the local/on-device AI ecosystem. The acquisition or integration represents a significant consolidation of key open-weights inference infrastructure under the Hugging Face umbrella.

Open Weights Progress Inference Economics Georgi Gerganov llama.cpp Hugging Face +2 more