3DeepSeek (HuggingFace model releases)·10h ago

DeepSeek releases EAGLE3 speculative decoding draft model for Qwen3-4B

DeepSeek published eagle3_qwen3_4b_ttt7 on Hugging Face, a draft model for EAGLE3 speculative decoding targeting the Qwen3-4B base model. EAGLE3 is DeepSeek's third-generation speculative decoding framework designed to accelerate inference by predicting future tokens with a lightweight draft model. The release is a narrow inference-optimization artifact with zero downloads and likes at time of indexing, suggesting it is very fresh or experimental.

Open Weights Progress Inference Economics Eagle3 DeepSeek V4 Qwen3-4B eagle3_qwen3_14b_ttt7 Hugging Face

Related guides (3)

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost of Running AI in Production

Read asBeginner In-depth

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Related events (8)

3Deepseek·10h ago·source ↗

DeepSeek releases EAGLE3 speculative decoding draft model for Qwen3-8B

DeepSeek published eagle3_qwen3_8b_ttt7 on Hugging Face, a draft model for EAGLE3 speculative decoding targeting the Qwen3-8B base model. EAGLE3 is DeepSeek's third-generation speculative decoding framework designed to accelerate inference by predicting future tokens with a lightweight draft head. The release is a narrow inference optimization artifact with minimal engagement at time of indexing.

Inference Economics Eagle3 DeepSeek V4 Qwen3-4B +2 more

3Deepseek·10h ago·source ↗

DeepSeek releases Eagle3 speculative decoding draft model for Qwen3-14B

DeepSeek published eagle3_qwen3_14b_ttt7 on Hugging Face, a draft model for the Eagle3 speculative decoding framework targeting Qwen3-14B. Eagle3 is DeepSeek's third-generation speculative decoding approach designed to accelerate inference. The release is a narrow infrastructure artifact with zero downloads and likes at time of indexing, suggesting it is very early or experimental.

Inference Economics Eagle3 DeepSeek V4 Qwen3-14B +1 more

6Deepseek·19d ago·source ↗

DeepSeek releases R1-0528-Qwen3-8B distilled reasoning model on Hugging Face

DeepSeek released DeepSeek-R1-0528-Qwen3-8B, an 8B parameter text-generation model on Hugging Face, combining the R1-0528 reasoning capabilities with a Qwen3 base. The model has accumulated over 306K downloads and 1K likes shortly after release, indicating strong community uptake. This appears to be a distilled version of the R1-0528 reasoning model targeting smaller-scale deployment.

Frontier Model Releases Open Weights Progress DeepSeek-R1-0528 DeepSeek V4 DeepSeek-R1-0528-Qwen3-8B +3 more

6Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Speciale on Hugging Face

DeepSeek has published DeepSeek-V3.2-Speciale, a new text-generation model, on Hugging Face under the deepseek-ai organization. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. Early traction is notable with nearly 10,000 downloads and 708 likes shortly after release.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face DeepSeek-V3.2-Speciale

7Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V3.1 on Hugging Face

DeepSeek has released DeepSeek-V3.1, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, text-generation-inference, and endpoint deployment, and has accumulated over 220K downloads and 824 likes shortly after release. This appears to be an updated iteration of the DeepSeek-V3 series, a frontier-class open-weights model family.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

7Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V3.2 on Hugging Face

DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

6Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Exp-Base on Hugging Face

DeepSeek has published DeepSeek-V3.2-Exp-Base, an experimental base model for text generation, on Hugging Face. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. This appears to be a new experimental iteration in the DeepSeek-V3 series, though no technical details or benchmark results are provided in the release metadata.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

7Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V4-Pro on Hugging Face

DeepSeek has released DeepSeek-V4-Pro, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports FP8 and 8-bit quantization formats and is tagged as endpoints-compatible with eval results included. With over 4.3 million downloads and 4,740 likes, it has attracted significant community uptake.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face