Almanac
model

DeepSeek-V3.1-Terminus

modelactivedeepseek-v3-1-terminus-d1e68e5a·3 events·first seen 1mo ago

Aliases: DeepSeek-V3.1-Terminus

Co-occurring entities

More like this (12)

Recent events (3)

5Deepseek News·1mo ago·source ↗

DeepSeek releases V3.1-Terminus, an incremental update to V3.1 with agent and language consistency improvements

DeepSeek has released DeepSeek-V3.1-Terminus, an update to its V3.1 model addressing user feedback on language mixing issues and improving Code Agent and Search Agent performance. The release claims more stable and reliable benchmark outputs compared to V3.1. Weights are publicly available on Hugging Face, and the model is accessible via the DeepSeek app, web, and API.

7Deepseek·7d ago·source ↗

DeepSeek releases DeepSeek-V3.1-Terminus on Hugging Face

DeepSeek has published DeepSeek-V3.1-Terminus, a new text-generation model, on Hugging Face under the deepseek_v3 architecture family. The model supports FP8 precision, safetensors format, and is compatible with text-generation-inference endpoints. Early traction is visible with over 11,500 downloads and 365 likes shortly after release.

8Deepseek News·1mo ago·source ↗

DeepSeek Releases V3.2-Exp with Sparse Attention Architecture and 50%+ API Price Cut

DeepSeek has released DeepSeek-V3.2-Exp, an experimental model built on V3.1-Terminus that introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism designed to improve long-context performance and reduce compute costs during training and inference. Benchmarks indicate V3.2-Exp performs on par with V3.1-Terminus while achieving efficiency gains. The release is accompanied by a 50%+ API price reduction effective immediately, open-weights release on Hugging Face, a technical report, and GPU kernel code in TileLang and CUDA.