7DeepSeek News (via RSSHub)·1mo ago

DeepSeek-V3-0324 Released with Improved Reasoning, Tool-Use, and MIT License

DeepSeek has released DeepSeek-V3-0324, an updated version of its V3 model featuring major improvements in reasoning performance, front-end development capabilities, and tool-use. The model is now released under the MIT License, matching DeepSeek-R1's open licensing terms. Weights are publicly available on Hugging Face, and the API interface remains unchanged from the prior V3 version.

Frontier Model Releases Open Weights Progress Agent and Tool Ecosystem DeepSeek-V3-0324 DeepSeek V4 MIT License Hugging Face

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

DeepSeek V4

DeepSeek V4: The Open-Weights Giant Reshaping AI Economics

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Related events (8)

8Deepseek News·1mo ago·source ↗

DeepSeek-V3.2 and V3.2-Speciale Released: Reasoning-First Models with Agent Tool-Use Integration

DeepSeek has released two new open-weights models: DeepSeek-V3.2, the official successor to V3.2-Exp with balanced reasoning and tool-use capabilities, and DeepSeek-V3.2-Speciale, a maxed-out reasoning variant claiming gold-medal performance on IMO, CMO, ICPC World Finals, and IOI 2025. V3.2 is the first DeepSeek model to integrate chain-of-thought thinking directly into tool-use workflows, trained on a new agent data synthesis pipeline covering 1,800+ environments and 85k+ complex instructions. V3.2-Speciale is API-only with no tool-call support, available via a temporary endpoint expiring December 15, 2025, while both models are open-sourced on Hugging Face with an accompanying technical report.

Frontier Model Releases Evaluation and Benchmarking DeepSeek V4 Gemini-3.0-Pro ICPC World Finals +8 more

8Deepseek News·1mo ago·source ↗

DeepSeek-V3.1 Release: Hybrid Think/Non-Think Model with Agent-Focused Upgrades

DeepSeek has released V3.1, a hybrid inference model supporting both thinking and non-thinking modes in a single model, positioned as their first step toward the agent era. The model features improved tool use and multi-step agent task performance, with benchmarks showing gains on SWE-bench and Terminal-Bench, and faster thinking efficiency compared to DeepSeek-R1-0528. The base model received 840B tokens of continued pretraining for long-context extension, a new tokenizer, and open-source weights are available on HuggingFace. API updates include 128K context for both modes, Anthropic API format compatibility, and strict function calling support in beta.

Long Context Evolution Frontier Model Releases DeepSeek-R1-0528 DeepSeek V4 SWE-bench +6 more

9Deepseek News·1mo ago·source ↗

DeepSeek-R1 Release: Open-Source Reasoning Model on Par with OpenAI o1

DeepSeek has released DeepSeek-R1, a reasoning-focused large language model claiming performance parity with OpenAI o1 on math, code, and reasoning benchmarks. The model is fully open-source under the MIT License, including weights and outputs, enabling distillation and commercial use. Six distilled smaller models (up to 32B and 70B) are also released, with the 32B and 70B variants reportedly matching OpenAI o1-mini. API access is live at significantly lower pricing than comparable frontier models ($0.55/M input tokens, $2.19/M output tokens).

Frontier Model Releases Evaluation and Benchmarking DeepSeek API DeepSeek V4 OpenAI o3-mini +5 more

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2 on Hugging Face

DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

6Deepseek News·1mo ago·source ↗

DeepSeek-R1-0528 Released with Improved Benchmarks, Reduced Hallucinations, and Function Calling

DeepSeek has released DeepSeek-R1-0528, an updated version of its R1 reasoning model featuring improved benchmark performance, reduced hallucinations, enhanced front-end capabilities, and new support for JSON output and function calling. The API interface remains unchanged, and open-source weights are available on Hugging Face. This is an incremental update to the R1 series rather than a new flagship model.

Frontier Model Releases Open Weights Progress DeepSeek-R1-0528 DeepSeek V4 Hugging Face +1 more

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.1 on Hugging Face

DeepSeek has released DeepSeek-V3.1, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, text-generation-inference, and endpoint deployment, and has accumulated over 220K downloads and 824 likes shortly after release. This appears to be an updated iteration of the DeepSeek-V3 series, a frontier-class open-weights model family.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

7Deepseek News·1mo ago·source ↗

DeepSeek-R1-Lite-Preview Launched with o1-Level Reasoning Performance

DeepSeek has released DeepSeek-R1-Lite-Preview, a reasoning-focused model claiming o1-preview-level performance on AIME and MATH benchmarks. The model features a transparent, real-time chain-of-thought process and demonstrates inference scaling behavior where longer reasoning chains yield better results. DeepSeek has indicated that open-source model weights and a full API are forthcoming. The model is currently accessible via chat.deepseek.com.

Frontier Model Releases Evaluation and Benchmarking o1-preview DeepSeek V4 AIME +4 more

5Deepseek News·1mo ago·source ↗

DeepSeek V2.5-1210: Final Update to V2.5 Series, V3 Generation Teased

DeepSeek has released DeepSeek-V2.5-1210, the final update to its V2.5 model series, with claimed improvements across math, coding, writing, and roleplay benchmarks. The model is available as open weights on Hugging Face. DeepSeek also announced the launch of Internet Search on chat.deepseek.com. The release marks the end of the V2 generation, with the company signaling work on next-generation foundation models.

Frontier Model Releases Open Weights Progress DeepSeek V4 deepseek-chat Hugging Face +2 more