Almanac
← Events
6Hacker News (AI-filtered, score >= 200)·9d ago

Hugging Face open reproduction of DeepSeek-R1

Hugging Face has published an open reproduction of DeepSeek-R1, the reasoning-focused language model, on GitHub. The project aims to replicate DeepSeek-R1's training methodology and capabilities in an open-weights setting. This contributes to the broader effort to make frontier reasoning model techniques accessible to the research community.

Related guides (3)

Related events (8)

6Hugging Face Blog·1mo ago·source ↗

Open-R1: Update #1 — Open Reproduction of DeepSeek-R1

Hugging Face's Open-R1 project provides a first progress update on its open reproduction of DeepSeek-R1, a reasoning-focused language model. The update covers early training runs, dataset construction, and evaluation results aimed at replicating DeepSeek-R1's chain-of-thought reasoning capabilities. This effort is part of the broader open-weights community push to reproduce frontier reasoning models transparently.

7Hugging Face Blog·1mo ago·source ↗

Open-R1: a fully open reproduction of DeepSeek-R1

Hugging Face announced Open-R1, a community effort to fully reproduce DeepSeek-R1's training pipeline using open-source components. The project aims to replicate the data, training, and evaluation stages of DeepSeek-R1, making the entire process transparent and accessible. This follows significant interest in DeepSeek-R1's reinforcement-learning-based reasoning approach and addresses the lack of fully open reproduction of that methodology.

5Hugging Face Blog·1mo ago·source ↗

Open R1: Update #4

Hugging Face's Open R1 project releases its fourth progress update on the open reproduction of DeepSeek-R1. The update likely covers training progress, dataset releases, and evaluation results for the open-weights reasoning model effort. This project is a community-driven attempt to replicate and open-source the techniques behind DeepSeek-R1's chain-of-thought reasoning capabilities.

5Hugging Face Blog·1mo ago·source ↗

Open R1: Update #2

Hugging Face's Open R1 project releases its second progress update on the open-source replication of DeepSeek-R1's reasoning capabilities. The update likely covers training progress, dataset releases, and intermediate model checkpoints as the team works toward a fully open reproduction of the reasoning model pipeline. Open R1 is a community-driven effort to make the techniques behind frontier reasoning models accessible to researchers.

5Hugging Face Blog·1mo ago·source ↗

Open R1: Update #3

Hugging Face's Open R1 project releases its third update, continuing the open-source replication effort of DeepSeek-R1's reasoning model training pipeline. The update likely covers progress on data, training runs, and evaluation results for the community-driven reproduction. This is part of an ongoing effort to make frontier reasoning model capabilities accessible via open weights and open training code.

6Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-OCR vision-language model on Hugging Face

DeepSeek has released DeepSeek-OCR, a multilingual image-text-to-text model on Hugging Face, built on the DeepSeek-VL-v2 architecture. The model targets OCR and image feature extraction tasks and has accumulated over 2.4 million downloads and 3,275 likes, indicating significant community uptake. This represents an open-weights multimodal release from a major Chinese AI lab.

9Deepseek News·1mo ago·source ↗

DeepSeek-R1 Release: Open-Source Reasoning Model on Par with OpenAI o1

DeepSeek has released DeepSeek-R1, a reasoning-focused large language model claiming performance parity with OpenAI o1 on math, code, and reasoning benchmarks. The model is fully open-source under the MIT License, including weights and outputs, enabling distillation and commercial use. Six distilled smaller models (up to 32B and 70B) are also released, with the 32B and 70B variants reportedly matching OpenAI o1-mini. API access is live at significantly lower pricing than comparable frontier models ($0.55/M input tokens, $2.19/M output tokens).

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V4-Pro-Base on Hugging Face

DeepSeek has released DeepSeek-V4-Pro-Base, a new base model, on Hugging Face with fp8 and safetensors support. The model has accumulated over 20,000 downloads and 291 likes shortly after release. This represents a new generation in DeepSeek's V-series open-weights frontier models.