Almanac
← Events
6DeepSeek (HuggingFace model releases)·11d ago

DeepSeek releases R1-0528-Qwen3-8B distilled reasoning model on Hugging Face

DeepSeek released DeepSeek-R1-0528-Qwen3-8B, an 8B parameter text-generation model on Hugging Face, combining the R1-0528 reasoning capabilities with a Qwen3 base. The model has accumulated over 306K downloads and 1K likes shortly after release, indicating strong community uptake. This appears to be a distilled version of the R1-0528 reasoning model targeting smaller-scale deployment.

Related guides (3)

Related events (8)

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.1 on Hugging Face

DeepSeek has released DeepSeek-V3.1, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, text-generation-inference, and endpoint deployment, and has accumulated over 220K downloads and 824 likes shortly after release. This appears to be an updated iteration of the DeepSeek-V3 series, a frontier-class open-weights model family.

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2 on Hugging Face

DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.

7Deepseek News·1mo ago·source ↗

DeepSeek-V3-0324 Released with Improved Reasoning, Tool-Use, and MIT License

DeepSeek has released DeepSeek-V3-0324, an updated version of its V3 model featuring major improvements in reasoning performance, front-end development capabilities, and tool-use. The model is now released under the MIT License, matching DeepSeek-R1's open licensing terms. Weights are publicly available on Hugging Face, and the API interface remains unchanged from the prior V3 version.

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V4-Pro on Hugging Face

DeepSeek has released DeepSeek-V4-Pro, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports FP8 and 8-bit quantization formats and is tagged as endpoints-compatible with eval results included. With over 4.3 million downloads and 4,740 likes, it has attracted significant community uptake.

6Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Speciale on Hugging Face

DeepSeek has published DeepSeek-V3.2-Speciale, a new text-generation model, on Hugging Face under the deepseek-ai organization. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. Early traction is notable with nearly 10,000 downloads and 708 likes shortly after release.

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.1-Base on Hugging Face

DeepSeek has released DeepSeek-V3.1-Base, a new base model for text generation, on Hugging Face. The model supports fp8 precision, safetensors format, and is compatible with text-generation-inference endpoints. With over 1,000 likes and nearly 9,000 downloads shortly after release, it is attracting significant community attention as a successor to the widely-used DeepSeek-V3.

9Deepseek News·1mo ago·source ↗

DeepSeek-R1 Release: Open-Source Reasoning Model on Par with OpenAI o1

DeepSeek has released DeepSeek-R1, a reasoning-focused large language model claiming performance parity with OpenAI o1 on math, code, and reasoning benchmarks. The model is fully open-source under the MIT License, including weights and outputs, enabling distillation and commercial use. Six distilled smaller models (up to 32B and 70B) are also released, with the 32B and 70B variants reportedly matching OpenAI o1-mini. API access is live at significantly lower pricing than comparable frontier models ($0.55/M input tokens, $2.19/M output tokens).

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V4-Flash on Hugging Face

DeepSeek has released DeepSeek-V4-Flash, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports FP8 and 8-bit quantization and is tagged as conversational and endpoints-compatible. With over 2.8 million downloads and 1,455 likes, it has seen substantial early uptake.