5Hugging Face Blog·1mo ago

One Year Since the "DeepSeek Moment"

A Hugging Face retrospective marking one year since the DeepSeek moment, which shook assumptions about AI development costs and open-weights competitiveness. The piece likely reflects on how DeepSeek's efficient training approach influenced the broader AI landscape, open-weights progress, and inference economics over the past year. Published on the anniversary of the original release, it offers industry analysis from a major open-source AI platform perspective.

Frontier Model Releases Open Weights Progress Inference Economics DeepSeek V4 Hugging Face

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

DeepSeek V4

DeepSeek V4: The Open-Weights Giant Reshaping AI Economics

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Related events (8)

5Hugging Face Blog·1mo ago·source ↗

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

Hugging Face publishes a retrospective and forward-looking commentary marking one year since the 'DeepSeek moment,' examining how DeepSeek's open-weight releases reshaped the global open-source AI ecosystem. The piece analyzes the downstream effects on model development, inference economics, and competitive dynamics between open and closed AI labs. It situates these developments within a broader 'AI+' framing, suggesting a new phase of AI integration across industries.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face +2 more

5Hugging Face Blog·1mo ago·source ↗

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

A Hugging Face blog post reflecting on one year since the 'DeepSeek moment' examines the architectural decisions shaping China's open-source AI ecosystem. The piece analyzes how Chinese labs have built upon and diverged from DeepSeek's design choices in the intervening year. It situates these developments within the broader context of open-weights model progress and competitive dynamics between Chinese and Western AI development.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face +1 more

6Hugging Face Blog·1mo ago·source ↗

Open-R1: Update #1 — Open Reproduction of DeepSeek-R1

Hugging Face's Open-R1 project provides a first progress update on its open reproduction of DeepSeek-R1, a reasoning-focused language model. The update covers early training runs, dataset construction, and evaluation results aimed at replicating DeepSeek-R1's chain-of-thought reasoning capabilities. This effort is part of the broader open-weights community push to reproduce frontier reasoning models transparently.

Frontier Model Releases Evaluation and Benchmarking DeepSeek V4 Open R1 Hugging Face +1 more

5Hugging Face Blog·1mo ago·source ↗

Open R1: Update #4

Hugging Face's Open R1 project releases its fourth progress update on the open reproduction of DeepSeek-R1. The update likely covers training progress, dataset releases, and evaluation results for the open-weights reasoning model effort. This project is a community-driven attempt to replicate and open-source the techniques behind DeepSeek-R1's chain-of-thought reasoning capabilities.

Frontier Model Releases Evaluation and Benchmarking DeepSeek V4 Open R1 Hugging Face +1 more

7Hugging Face Blog·1mo ago·source ↗

Open-R1: a fully open reproduction of DeepSeek-R1

Hugging Face announced Open-R1, a community effort to fully reproduce DeepSeek-R1's training pipeline using open-source components. The project aims to replicate the data, training, and evaluation stages of DeepSeek-R1, making the entire process transparent and accessible. This follows significant interest in DeepSeek-R1's reinforcement-learning-based reasoning approach and addresses the lack of fully open reproduction of that methodology.

Frontier Model Releases Evaluation and Benchmarking DeepSeek V4 Open R1 Hugging Face +2 more

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2 on Hugging Face

DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

6Hacker News·9d ago·source ↗

Hugging Face open reproduction of DeepSeek-R1

Hugging Face has published an open reproduction of DeepSeek-R1, the reasoning-focused language model, on GitHub. The project aims to replicate DeepSeek-R1's training methodology and capabilities in an open-weights setting. This contributes to the broader effort to make frontier reasoning model techniques accessible to the research community.

Frontier Model Releases Open Weights Progress DeepSeek V4 Open R1 Hugging Face

7The Batch·19d ago·source ↗

Data Points: OpenAI and Microsoft sever their exclusive relationship

This edition of The Batch covers several major AI industry developments: OpenAI has revised its partnership with Microsoft, ending exclusivity while retaining Microsoft as primary cloud partner through 2032 and gaining freedom to deploy on AWS and Google Cloud. DeepSeek released V4 model weights featuring 1M-token context and Huawei Ascend chip optimization, though it trails leading open and closed models on aggregate benchmarks. Google and Amazon are deepening investments in Anthropic with up to $40B and $25B respectively in funding-for-compute deals, and an agentic AI system autonomously designed a functional RISC-V CPU from a 219-word spec in 12 hours.

Training Infrastructure Frontier Model Releases Google Cloud Google TPU knowledge distillation +25 more