3DeepSeek (HuggingFace model releases)·7h ago

DeepSeek releases dspark_gemma4_12b_block7 model weights on Hugging Face

DeepSeek uploaded a model checkpoint named dspark_gemma4_12b_block7 to Hugging Face, tagged with gemma4_text and safetensors format. The naming suggests a 12B parameter model built on or related to Google's Gemma 4 architecture, possibly a distillation or block-level experiment. The release has minimal engagement (0 downloads, 3 likes) and no accompanying documentation, making its purpose unclear.

Open Weights Progress DeepSeek V4 Google dspark_gemma4_12b_block7 Gemma 4

Related guides (3)

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner In-depth

Google

Google: The AI Lab That Builds Everything from DNA Models to Your Phone's Assistant

Read asBeginner In-depth

DeepSeek V4

DeepSeek V4: The Open-Weights Giant Reshaping AI Economics

Read asBeginner In-depth

Related events (8)

3Deepseek·7h ago·source ↗

DeepSeek releases dflash_gemma4_12b_block7 model weights on Hugging Face

DeepSeek uploaded a model checkpoint named dflash_gemma4_12b_block7 to Hugging Face, tagged with gemma4_text and safetensors format. The naming suggests a distillation or flash-attention variant of a Gemma 4 12B architecture, possibly a block-level component or intermediate checkpoint. The release has minimal engagement (0 downloads, 2 likes) and no accompanying documentation, suggesting an experimental or internal artifact rather than a polished release.

Frontier Model Releases Open Weights Progress deepseek-ai/dflash_gemma4_12b_block7 DeepSeek V4 Google +1 more

5Deepseek·37h ago·source ↗

DeepSeek releases DeepSeek-V4-Flash-DSpark on Hugging Face

DeepSeek has published a new model checkpoint, DeepSeek-V4-Flash-DSpark, on Hugging Face under the deepseek_v4 model family. The release is tagged as a text-generation model with FP8 and 8-bit support, suggesting an efficiency-optimized variant. The 'Flash' and 'DSpark' naming implies a faster or distilled derivative of the DeepSeek V4 flagship. Download counts are near zero, indicating a very recent upload.

Frontier Model Releases Inference Economics DeepSeek V4 DeepSeek-V4-Flash Hugging Face

5Deepseek·37h ago·source ↗

DeepSeek releases DeepSeek-V4-Pro-DSpark on Hugging Face

DeepSeek has published a new model checkpoint, DeepSeek-V4-Pro-DSpark, on Hugging Face under the text-generation category. The model uses the deepseek_v4 architecture and supports FP8 and 8-bit quantization formats. The 'DSpark' suffix suggests a variant or specialized version of the DeepSeek V4 Pro line, though no accompanying technical documentation is visible in this listing.

Frontier Model Releases Open Weights Progress DeepSeek V4 DeepSeek-V4-Pro-DSpark Hugging Face

7Deepseek·18d ago·source ↗

DeepSeek releases DeepSeek-V4-Pro-Base on Hugging Face

DeepSeek has released DeepSeek-V4-Pro-Base, a new base model, on Hugging Face with fp8 and safetensors support. The model has accumulated over 20,000 downloads and 291 likes shortly after release. This represents a new generation in DeepSeek's V-series open-weights frontier models.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face

7Deepseek·18d ago·source ↗

DeepSeek releases DeepSeek-V4-Flash-Base on Hugging Face

DeepSeek has released DeepSeek-V4-Flash-Base, a new open-weights base model, on Hugging Face. The model uses FP8 precision and the deepseek_v4 architecture with safetensors format. Early traction is notable with over 66,000 downloads and 241 likes shortly after release, suggesting significant community interest in a 'Flash' variant of the V4 series.

Frontier Model Releases Open Weights Progress DeepSeek V4 DeepSeek-V4-Flash Hugging Face +1 more

6Deepseek·18d ago·source ↗

DeepSeek releases DeepSeek-Math-V2 on Hugging Face

DeepSeek has released DeepSeek-Math-V2, a math-specialized text-generation model, on Hugging Face. The model uses the deepseek_v32 architecture and is available in fp8 format with safetensors support. Early engagement metrics show 697 likes and 416 downloads, suggesting notable community interest for a new release.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face DeepSeek-Math-V2

6Deepseek·18d ago·source ↗

DeepSeek releases DeepSeek-V3.2-Speciale on Hugging Face

DeepSeek has published DeepSeek-V3.2-Speciale, a new text-generation model, on Hugging Face under the deepseek-ai organization. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. Early traction is notable with nearly 10,000 downloads and 708 likes shortly after release.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face DeepSeek-V3.2-Speciale

7Deepseek·18d ago·source ↗

DeepSeek releases DeepSeek-V3.2 on Hugging Face

DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.

Frontier Model Releases Open Weights Progress DeepSeek V4 Hugging Face