Almanac
← Events
3DeepSeek (HuggingFace model releases)·10h ago

DeepSeek releases EAGLE3 speculative decoding draft model for Qwen3-4B

DeepSeek published eagle3_qwen3_4b_ttt7 on Hugging Face, a draft model for EAGLE3 speculative decoding targeting the Qwen3-4B base model. EAGLE3 is DeepSeek's third-generation speculative decoding framework designed to accelerate inference by predicting future tokens with a lightweight draft model. The release is a narrow inference-optimization artifact with zero downloads and likes at time of indexing, suggesting it is very fresh or experimental.

Related guides (3)

Related events (8)

3Deepseek·10h ago·source ↗

DeepSeek releases EAGLE3 speculative decoding draft model for Qwen3-8B

DeepSeek published eagle3_qwen3_8b_ttt7 on Hugging Face, a draft model for EAGLE3 speculative decoding targeting the Qwen3-8B base model. EAGLE3 is DeepSeek's third-generation speculative decoding framework designed to accelerate inference by predicting future tokens with a lightweight draft head. The release is a narrow inference optimization artifact with minimal engagement at time of indexing.

3Deepseek·10h ago·source ↗

DeepSeek releases Eagle3 speculative decoding draft model for Qwen3-14B

DeepSeek published eagle3_qwen3_14b_ttt7 on Hugging Face, a draft model for the Eagle3 speculative decoding framework targeting Qwen3-14B. Eagle3 is DeepSeek's third-generation speculative decoding approach designed to accelerate inference. The release is a narrow infrastructure artifact with zero downloads and likes at time of indexing, suggesting it is very early or experimental.

6Deepseek·19d ago·source ↗

DeepSeek releases R1-0528-Qwen3-8B distilled reasoning model on Hugging Face

DeepSeek released DeepSeek-R1-0528-Qwen3-8B, an 8B parameter text-generation model on Hugging Face, combining the R1-0528 reasoning capabilities with a Qwen3 base. The model has accumulated over 306K downloads and 1K likes shortly after release, indicating strong community uptake. This appears to be a distilled version of the R1-0528 reasoning model targeting smaller-scale deployment.

6Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Speciale on Hugging Face

DeepSeek has published DeepSeek-V3.2-Speciale, a new text-generation model, on Hugging Face under the deepseek-ai organization. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. Early traction is notable with nearly 10,000 downloads and 708 likes shortly after release.

7Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V3.1 on Hugging Face

DeepSeek has released DeepSeek-V3.1, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, text-generation-inference, and endpoint deployment, and has accumulated over 220K downloads and 824 likes shortly after release. This appears to be an updated iteration of the DeepSeek-V3 series, a frontier-class open-weights model family.

7Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V3.2 on Hugging Face

DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.

6Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Exp-Base on Hugging Face

DeepSeek has published DeepSeek-V3.2-Exp-Base, an experimental base model for text generation, on Hugging Face. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. This appears to be a new experimental iteration in the DeepSeek-V3 series, though no technical details or benchmark results are provided in the release metadata.

7Deepseek·19d ago·source ↗

DeepSeek releases DeepSeek-V4-Pro on Hugging Face

DeepSeek has released DeepSeek-V4-Pro, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports FP8 and 8-bit quantization formats and is tagged as endpoints-compatible with eval results included. With over 4.3 million downloads and 4,740 likes, it has attracted significant community uptake.