DeepSeek-V3.1-Base
deepseek-v3-1-base-f8a9603e·2 events·first seen 1mo agoAliases: DeepSeek-V3.1-Base
Co-occurring entities
More like this (12)
Recent events (2)
DeepSeek releases DeepSeek-V3.1-Base on Hugging Face
DeepSeek has released DeepSeek-V3.1-Base, a new base model for text generation, on Hugging Face. The model supports fp8 precision, safetensors format, and is compatible with text-generation-inference endpoints. With over 1,000 likes and nearly 9,000 downloads shortly after release, it is attracting significant community attention as a successor to the widely-used DeepSeek-V3.
DeepSeek-V3.1 Release: Hybrid Think/Non-Think Model with Agent-Focused Upgrades
DeepSeek has released V3.1, a hybrid inference model supporting both thinking and non-thinking modes in a single model, positioned as their first step toward the agent era. The model features improved tool use and multi-step agent task performance, with benchmarks showing gains on SWE-bench and Terminal-Bench, and faster thinking efficiency compared to DeepSeek-R1-0528. The base model received 840B tokens of continued pretraining for long-context extension, a new tokenizer, and open-source weights are available on HuggingFace. API updates include 128K context for both modes, Anthropic API format compatibility, and strict function calling support in beta.