5DeepSeek News (via RSSHub)·1mo ago

DeepSeek releases V3.1-Terminus, an incremental update to V3.1 with agent and language consistency improvements

DeepSeek has released DeepSeek-V3.1-Terminus, an update to its V3.1 model addressing user feedback on language mixing issues and improving Code Agent and Search Agent performance. The release claims more stable and reliable benchmark outputs compared to V3.1. Weights are publicly available on Hugging Face, and the model is accessible via the DeepSeek app, web, and API.

Frontier Model Releases Open Weights Progress Agent and Tool Ecosystem DeepSeek V4 Hugging Face DeepSeek-V3.1-Terminus

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

DeepSeek V4

DeepSeek V4: The Open-Weights Giant Reshaping AI Economics

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Related events (8)

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.1-Terminus on Hugging Face

DeepSeek has published DeepSeek-V3.1-Terminus, a new text-generation model, on Hugging Face under the deepseek_v3 architecture family. The model supports FP8 precision, safetensors format, and is compatible with text-generation-inference endpoints. Early traction is visible with over 11,500 downloads and 365 likes shortly after release.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face DeepSeek-V3.1-Terminus

8Deepseek News·1mo ago·source ↗

DeepSeek-V3.1 Release: Hybrid Think/Non-Think Model with Agent-Focused Upgrades

DeepSeek has released V3.1, a hybrid inference model supporting both thinking and non-thinking modes in a single model, positioned as their first step toward the agent era. The model features improved tool use and multi-step agent task performance, with benchmarks showing gains on SWE-bench and Terminal-Bench, and faster thinking efficiency compared to DeepSeek-R1-0528. The base model received 840B tokens of continued pretraining for long-context extension, a new tokenizer, and open-source weights are available on HuggingFace. API updates include 128K context for both modes, Anthropic API format compatibility, and strict function calling support in beta.

Long Context Evolution Frontier Model Releases DeepSeek-R1-0528 DeepSeek V4 SWE-bench +6 more

7Deepseek News·1mo ago·source ↗

DeepSeek-V3-0324 Released with Improved Reasoning, Tool-Use, and MIT License

DeepSeek has released DeepSeek-V3-0324, an updated version of its V3 model featuring major improvements in reasoning performance, front-end development capabilities, and tool-use. The model is now released under the MIT License, matching DeepSeek-R1's open licensing terms. Weights are publicly available on Hugging Face, and the API interface remains unchanged from the prior V3 version.

Frontier Model Releases Open Weights Progress DeepSeek-V3-0324 DeepSeek V4 MIT License +2 more

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2 on Hugging Face

DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.1 on Hugging Face

DeepSeek has released DeepSeek-V3.1, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, text-generation-inference, and endpoint deployment, and has accumulated over 220K downloads and 824 likes shortly after release. This appears to be an updated iteration of the DeepSeek-V3 series, a frontier-class open-weights model family.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

5Deepseek News·1mo ago·source ↗

DeepSeek V2.5-1210: Final Update to V2.5 Series, V3 Generation Teased

DeepSeek has released DeepSeek-V2.5-1210, the final update to its V2.5 model series, with claimed improvements across math, coding, writing, and roleplay benchmarks. The model is available as open weights on Hugging Face. DeepSeek also announced the launch of Internet Search on chat.deepseek.com. The release marks the end of the V2 generation, with the company signaling work on next-generation foundation models.

Frontier Model Releases Open Weights Progress DeepSeek V4 deepseek-chat Hugging Face +2 more

8Deepseek News·1mo ago·source ↗

DeepSeek Releases V3.2-Exp with Sparse Attention Architecture and 50%+ API Price Cut

DeepSeek has released DeepSeek-V3.2-Exp, an experimental model built on V3.1-Terminus that introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism designed to improve long-context performance and reduce compute costs during training and inference. Benchmarks indicate V3.2-Exp performs on par with V3.1-Terminus while achieving efficiency gains. The release is accompanied by a 50%+ API price reduction effective immediately, open-weights release on Hugging Face, a technical report, and GPU kernel code in TileLang and CUDA.

Training Infrastructure Long Context Evolution DeepSeek API DeepSeek V4 TileLang +5 more

6Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Speciale on Hugging Face

DeepSeek has published DeepSeek-V3.2-Speciale, a new text-generation model, on Hugging Face under the deepseek-ai organization. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. Early traction is notable with nearly 10,000 downloads and 708 likes shortly after release.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face DeepSeek-V3.2-Speciale