6DeepSeek (HuggingFace model releases)·11d ago

DeepSeek releases R1-0528-Qwen3-8B distilled reasoning model on Hugging Face

DeepSeek released DeepSeek-R1-0528-Qwen3-8B, an 8B parameter text-generation model on Hugging Face, combining the R1-0528 reasoning capabilities with a Qwen3 base. The model has accumulated over 306K downloads and 1K likes shortly after release, indicating strong community uptake. This appears to be a distilled version of the R1-0528 reasoning model targeting smaller-scale deployment.

Frontier Model Releases Open Weights Progress Inference Economics DeepSeek-R1-0528 DeepSeek V4 DeepSeek-R1-0528-Qwen3-8B Hugging Face Qwen3

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

DeepSeek V4

DeepSeek V4: The Open-Weights Giant Reshaping AI Economics

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Related events (8)

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.1 on Hugging Face

DeepSeek has released DeepSeek-V3.1, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, text-generation-inference, and endpoint deployment, and has accumulated over 220K downloads and 824 likes shortly after release. This appears to be an updated iteration of the DeepSeek-V3 series, a frontier-class open-weights model family.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2 on Hugging Face

DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

7Deepseek News·1mo ago·source ↗

DeepSeek-V3-0324 Released with Improved Reasoning, Tool-Use, and MIT License

DeepSeek has released DeepSeek-V3-0324, an updated version of its V3 model featuring major improvements in reasoning performance, front-end development capabilities, and tool-use. The model is now released under the MIT License, matching DeepSeek-R1's open licensing terms. Weights are publicly available on Hugging Face, and the API interface remains unchanged from the prior V3 version.

Frontier Model Releases Open Weights Progress DeepSeek-V3-0324 DeepSeek V4 MIT License +2 more

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V4-Pro on Hugging Face

DeepSeek has released DeepSeek-V4-Pro, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports FP8 and 8-bit quantization formats and is tagged as endpoints-compatible with eval results included. With over 4.3 million downloads and 4,740 likes, it has attracted significant community uptake.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

6Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Speciale on Hugging Face

DeepSeek has published DeepSeek-V3.2-Speciale, a new text-generation model, on Hugging Face under the deepseek-ai organization. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. Early traction is notable with nearly 10,000 downloads and 708 likes shortly after release.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face DeepSeek-V3.2-Speciale

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.1-Base on Hugging Face

DeepSeek has released DeepSeek-V3.1-Base, a new base model for text generation, on Hugging Face. The model supports fp8 precision, safetensors format, and is compatible with text-generation-inference endpoints. With over 1,000 likes and nearly 9,000 downloads shortly after release, it is attracting significant community attention as a successor to the widely-used DeepSeek-V3.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face DeepSeek-V3.1-Base

9Deepseek News·1mo ago·source ↗

DeepSeek-R1 Release: Open-Source Reasoning Model on Par with OpenAI o1

DeepSeek has released DeepSeek-R1, a reasoning-focused large language model claiming performance parity with OpenAI o1 on math, code, and reasoning benchmarks. The model is fully open-source under the MIT License, including weights and outputs, enabling distillation and commercial use. Six distilled smaller models (up to 32B and 70B) are also released, with the 32B and 70B variants reportedly matching OpenAI o1-mini. API access is live at significantly lower pricing than comparable frontier models ($0.55/M input tokens, $2.19/M output tokens).

Frontier Model Releases Evaluation and Benchmarking DeepSeek API DeepSeek V4 OpenAI o3-mini +5 more

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V4-Flash on Hugging Face

DeepSeek has released DeepSeek-V4-Flash, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports FP8 and 8-bit quantization and is tagged as conversational and endpoints-compatible. With over 2.8 million downloads and 1,455 likes, it has seen substantial early uptake.

Frontier Model Releases Open Weights Progress DeepSeek V4 DeepSeek-V4-Flash Hugging Face +1 more