Almanac
Guide · Beginner

Hugging Face: The Home of Open-Source AI

Hugging FaceBeginneractive·v3 · live·generated 6d ago

Part of these paths

TL;DRHugging Face is the platform where the open-source AI world meets — a hub where researchers, companies, and hobbyists share models, datasets, and tools freely. It has grown from a model-hosting service into the de facto distribution layer for open-weights AI, and is now pushing that mission into robotics and local inference infrastructure.

Key takeaways

  • Hugging Face hosts landmark open-weights releases from Meta (Llama 2, 3, 3.1, 3.2, 4), Google (Gemma 3, 4), Alibaba (Qwen family), DeepSeek, Mistral, NVIDIA, and OpenAI's GPT OSS — making it the broadest single distribution point for frontier open models.
  • It acquired Pollen Robotics in April 2025 to extend its open-source mission into physical robots.
  • In February 2026, it brought llama.cpp and GGML — the libraries that power most local AI inference — under its umbrella to secure their long-term development.
  • Its own Transformers library reached version 5, a major update focused on simplified model definitions that underpin much of the ML ecosystem.
  • It launched Open-R1 in January 2025, a fully open community effort to reproduce DeepSeek-R1's reasoning training pipeline.
  • Stanford's 28-trillion-pixel GPIC image corpus — one of the largest permissively licensed visual datasets — is hosted on Hugging Face, illustrating its role as a dataset home too.

What Hugging Face is

Hugging Face is an open-source AI platform — think of it as a combination of GitHub and an app store, but specifically for AI models, datasets, and tools. Anyone can upload a model, anyone can download it, and the whole thing is searchable and free to browse. That openness has made it the default distribution point for the open-weights AI world: when a lab releases a model they want the public to use, Hugging Face is almost always where it lands first.

Why it matters

Most of the biggest names in AI — Meta, Google, Alibaba, Mistral, DeepSeek, NVIDIA, and even OpenAI — publish their open models on Hugging Face. That means if you want to run, study, or build on top of a frontier AI model without paying a subscription, Hugging Face is your starting point. It's also where the research community shares datasets: Stanford's GPIC image corpus, for example — roughly 28 trillion pixels of permissively licensed images — is hosted there.

Beyond hosting, Hugging Face builds and maintains the Transformers library, one of the most widely used software packages in machine learning. Version 5, released in late 2025, focused on making model definitions simpler and cleaner — a change that ripples out to every researcher and developer who builds on top of it.

A tour of what lives there

The breadth of what Hugging Face hosts is striking. A partial list from recent events alone:

  • Meta's Llama family — Llama 2, 3, 3.1 (up to 405B parameters), 3.2 (with vision and edge variants), and Llama 4 (Maverick and Scout, both multimodal mixture-of-experts models)
  • Google's Gemma — Gemma 3 and Gemma 4, both multimodal and on-device capable
  • Alibaba's Qwen series — Qwen2.5, Qwen2.5-VL (vision-language), Qwen2.5-Omni (text + image + audio + video), QwQ-32B (reasoning), Qwen3, and Qwen3 Embedding models
  • DeepSeek's V-series — V3.1, V3.2, V4-Flash, V4-Pro, and their base variants
  • Mistral models — Voxtral (speech understanding), Voxtral Transcribe 2, Mistral Small 3, and Mistral 3
  • NVIDIA Cosmos 3 — an open omni-model for robotics and physical AI
  • OpenAI's GPT OSS — a notable shift for a company historically known for keeping its models closed

Beyond hosting: Hugging Face's own moves

Hugging Face isn't just a passive shelf. It has been actively expanding what "open AI" means:

Open-R1 (January 2025): When DeepSeek released its R1 reasoning model, the training recipe wasn't fully public. Hugging Face launched Open-R1, a community project to reproduce the entire pipeline — data, training, and evaluation — using open-source components, so anyone could study and build on it.

Pollen Robotics acquisition (April 2025): Hugging Face bought a French open-source robotics company and announced plans to sell physical robots. This extends the platform's philosophy — open, accessible, community-driven — into hardware and embodied AI.

GGML and llama.cpp (February 2026): These two libraries are the engine behind most local AI inference — the software that lets people run large models on a laptop or home server without a cloud subscription. Hugging Face brought them under its umbrella to ensure they stay maintained and funded long-term.

Who uses it and how

Hugging Face serves several overlapping audiences. Researchers use it to share and reproduce work. Developers use it to grab pre-trained models and fine-tune them for specific tasks. Companies use it as a distribution channel for open-weights releases. And hobbyists use it to run models locally, often via llama.cpp — now a Hugging Face project.

Where it's heading

The pattern across these events points in a clear direction: Hugging Face is consolidating the infrastructure of open AI. It already hosts the models; now it owns the local inference stack (llama.cpp), is building toward physical robots, and maintains the most widely used model-loading library (Transformers). The platform is becoming less of a repository and more of a full ecosystem — the connective tissue that holds the open-weights world together.

Hugging Face as the open-weights ecosystem hub

Timeline

  1. BLOOM released — 176B open multilingual model co-developed with BigScience

  2. Llama 2 lands on Hugging Face, expanding accessible open-weights frontier models

  3. Open-R1 launched: fully open reproduction of DeepSeek-R1's training pipeline

  4. Pollen Robotics acquired — open-source AI mission extended into physical robots

  5. Transformers v5 released with simplified model definitions

  6. GGML and llama.cpp join Hugging Face to secure local AI inference infrastructure

Related topics

MetaGoogleQwenDeepSeek V4NVIDIAHugging Face TransformersTransformers

FAQ

Do I need to pay to use Hugging Face?

Most models and datasets on Hugging Face are free to download and use. The platform also offers paid hosting and compute services, but the core open-weights library is publicly accessible.

What is the Transformers library?

It's Hugging Face's flagship open-source software package that makes it easy to load, run, and fine-tune AI models. Version 5 was released in late 2025 with a focus on simpler model definitions.

Why did Hugging Face buy a robotics company?

Hugging Face acquired Pollen Robotics in April 2025 to extend its open-source AI mission into physical hardware, aiming to make open-source robots as accessible as open-source models.

What is llama.cpp and why does it matter that Hugging Face acquired it?

llama.cpp (and its underlying library GGML) is the software most people use to run large AI models on a personal computer or laptop without cloud services. Hugging Face brought it in-house in February 2026 to ensure its long-term maintenance and funding.

Is Hugging Face only for text AI models?

No — the platform hosts vision-language models, speech models, image datasets, robotics models, and embedding models, reflecting the full breadth of modern AI research.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

  • v3live6d ago
  • v2superseded11d ago
  • v1superseded16d ago

Related guides (4)

More on Hugging Face (6)

7Hugging Face Blog·1mo ago·source ↗

Transformers v5: Simple model definitions powering the AI ecosystem

Hugging Face has announced Transformers v5, a major version update to its flagship open-source library. The release focuses on simplified model definitions and architectural improvements to the codebase. As one of the most widely used ML libraries in the ecosystem, this update has broad implications for researchers and practitioners building on top of the Transformers framework.

7Hugging Face Blog·1mo ago·source ↗

Hugging Face Acquires Pollen Robotics to Sell Open-Source Robots

Hugging Face has announced the acquisition of Pollen Robotics, a French open-source robotics company, with plans to sell physical robots. This move extends Hugging Face's open-source AI platform strategy into embodied AI and physical hardware. The acquisition signals a strategic push by Hugging Face to become a hub for open-source robotics development alongside its existing ML model and dataset ecosystem.

6Hugging Face Blog·1mo ago·source ↗

TRL v1.0: Post-Training Library Built to Move with the Field

Hugging Face has released TRL v1.0, a major milestone for its post-training library focused on reinforcement learning from human feedback and related alignment techniques. The release signals a stabilization of the API and feature set after iterative development tracking the rapidly evolving post-training landscape. TRL is widely used in the open-source community for fine-tuning and aligning language models using methods such as PPO, DPO, and GRPO.

4Hugging Face Blog·1mo ago·source ↗

Introducing Storage Buckets on the Hugging Face Hub

Hugging Face is launching Storage Buckets, a new feature on the Hub that provides object storage capabilities for AI/ML workflows. This expands the Hub's infrastructure offerings beyond model and dataset repositories, enabling users to store arbitrary files and artifacts. The feature targets teams managing large-scale AI pipelines who need integrated storage alongside their models and datasets.

5Hugging Face Blog·1mo ago·source ↗

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

Hugging Face introduces Modular Diffusers, a new framework design that breaks diffusion pipelines into composable, interchangeable building blocks. The approach aims to make it easier to mix and match components such as encoders, denoisers, and decoders across different diffusion model architectures. This represents a significant refactor of the Diffusers library's pipeline abstraction, targeting researchers and developers who need flexible pipeline construction without rewriting boilerplate code.

8Hugging Face Blog·1mo ago·source ↗

GGML and llama.cpp Join Hugging Face to Ensure Long-Term Progress of Local AI

GGML and llama.cpp, the foundational open-source libraries enabling efficient local inference of large language models, are joining Hugging Face. This move is intended to secure long-term development and sustainability of the projects that underpin much of the local/on-device AI ecosystem. The acquisition or integration represents a significant consolidation of key open-weights inference infrastructure under the Hugging Face umbrella.