8DeepSeek News (via RSSHub)·1mo ago

DeepSeek Releases V3.2-Exp with Sparse Attention Architecture and 50%+ API Price Cut

DeepSeek has released DeepSeek-V3.2-Exp, an experimental model built on V3.1-Terminus that introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism designed to improve long-context performance and reduce compute costs during training and inference. Benchmarks indicate V3.2-Exp performs on par with V3.1-Terminus while achieving efficiency gains. The release is accompanied by a 50%+ API price reduction effective immediately, open-weights release on Hugging Face, a technical report, and GPU kernel code in TileLang and CUDA.

Training Infrastructure Long Context Evolution Frontier Model Releases Open Weights Progress Inference Economics DeepSeek API DeepSeek V4 TileLang DeepSeek-V3.1-Terminus DeepSeek Sparse Attention

Related guides (3)

DeepSeek V4

DeepSeek V4: The Open-Weights Giant Reshaping AI Economics

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Long Context EvolutionTopic guide

Long Context Evolution: From Bigger Windows to Smarter Memory

Read asBeginner In-depth

Related events (8)

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Exp on Hugging Face

DeepSeek has published DeepSeek-V3.2-Exp, an experimental text-generation model, on Hugging Face under the deepseek-ai organization. The model uses the deepseek_v32 architecture and supports fp8 precision, with tags indicating eval results and endpoint compatibility. Early traction is notable with nearly 176K downloads and ~1K likes shortly after release.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2 on Hugging Face

DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

9Deepseek News·1mo ago·source ↗

DeepSeek-V3: 671B MoE Open-Source Model with 3x Speed Improvement

DeepSeek releases V3, a 671B parameter Mixture-of-Experts model with 37B activated parameters, trained on 14.8T tokens. The model runs at 60 tokens/second (3x faster than V2) and is fully open-source with weights and paper released. API pricing is set at $0.27/M input tokens and $1.10/M output tokens starting February 8, positioning it as a low-cost frontier alternative. DeepSeek signals future multimodal capabilities in the ecosystem.

Frontier Model Releases Open Weights Progress DeepSeek V4 Mixture of Experts +2 more

9Deepseek News·1mo ago·source ↗

DeepSeek V4 Preview Release: 1.6T-param Pro and 284B Flash Models with 1M Context, Open-Sourced

DeepSeek has released DeepSeek-V4 as an open-weights preview, comprising two MoE variants: V4-Pro (1.6T total / 49B active parameters) and V4-Flash (284B total / 13B active parameters). Both models support 1M token context by default, enabled by a novel Token-wise compression and DeepSeek Sparse Attention (DSA) architecture. V4-Pro claims open-source SOTA on agentic coding benchmarks and world-class math/STEM/coding performance rivaling top closed-source models, while V4-Flash offers near-parity reasoning at lower cost and latency. The API is live today with OpenAI and Anthropic compatibility, and legacy model endpoints will be retired in July 2026.

Long Context Evolution Frontier Model Releases DeepSeek V4 DeepSeek-V4-Flash Claude Code +7 more

6Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Speciale on Hugging Face

DeepSeek has published DeepSeek-V3.2-Speciale, a new text-generation model, on Hugging Face under the deepseek-ai organization. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. Early traction is notable with nearly 10,000 downloads and 708 likes shortly after release.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face DeepSeek-V3.2-Speciale

6Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Exp-Base on Hugging Face

DeepSeek has published DeepSeek-V3.2-Exp-Base, an experimental base model for text generation, on Hugging Face. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. This appears to be a new experimental iteration in the DeepSeek-V3 series, though no technical details or benchmark results are provided in the release metadata.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

5Deepseek News·1mo ago·source ↗

DeepSeek releases V3.1-Terminus, an incremental update to V3.1 with agent and language consistency improvements

DeepSeek has released DeepSeek-V3.1-Terminus, an update to its V3.1 model addressing user feedback on language mixing issues and improving Code Agent and Search Agent performance. The release claims more stable and reliable benchmark outputs compared to V3.1. Weights are publicly available on Hugging Face, and the model is accessible via the DeepSeek app, web, and API.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face DeepSeek-V3.1-Terminus +1 more

7Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-V3.1 on Hugging Face

DeepSeek has released DeepSeek-V3.1, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, text-generation-inference, and endpoint deployment, and has accumulated over 220K downloads and 824 likes shortly after release. This appears to be an updated iteration of the DeepSeek-V3 series, a frontier-class open-weights model family.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face