DeepSeek releases dspark_gemma4_12b_block7 model weights on Hugging Face
DeepSeek uploaded a model checkpoint named dspark_gemma4_12b_block7 to Hugging Face, tagged with gemma4_text and safetensors format. The naming suggests a 12B parameter model built on or related to Google's Gemma 4 architecture, possibly a distillation or block-level experiment. The release has minimal engagement (0 downloads, 3 likes) and no accompanying documentation, making its purpose unclear.
Related guides (3)
Related events (8)
DeepSeek releases dflash_gemma4_12b_block7 model weights on Hugging Face
DeepSeek uploaded a model checkpoint named dflash_gemma4_12b_block7 to Hugging Face, tagged with gemma4_text and safetensors format. The naming suggests a distillation or flash-attention variant of a Gemma 4 12B architecture, possibly a block-level component or intermediate checkpoint. The release has minimal engagement (0 downloads, 2 likes) and no accompanying documentation, suggesting an experimental or internal artifact rather than a polished release.
DeepSeek releases DeepSeek-V4-Flash-DSpark on Hugging Face
DeepSeek has published a new model checkpoint, DeepSeek-V4-Flash-DSpark, on Hugging Face under the deepseek_v4 model family. The release is tagged as a text-generation model with FP8 and 8-bit support, suggesting an efficiency-optimized variant. The 'Flash' and 'DSpark' naming implies a faster or distilled derivative of the DeepSeek V4 flagship. Download counts are near zero, indicating a very recent upload.
DeepSeek releases DeepSeek-V4-Pro-DSpark on Hugging Face
DeepSeek has published a new model checkpoint, DeepSeek-V4-Pro-DSpark, on Hugging Face under the text-generation category. The model uses the deepseek_v4 architecture and supports FP8 and 8-bit quantization formats. The 'DSpark' suffix suggests a variant or specialized version of the DeepSeek V4 Pro line, though no accompanying technical documentation is visible in this listing.
DeepSeek releases DeepSeek-V4-Pro-Base on Hugging Face
DeepSeek has released DeepSeek-V4-Pro-Base, a new base model, on Hugging Face with fp8 and safetensors support. The model has accumulated over 20,000 downloads and 291 likes shortly after release. This represents a new generation in DeepSeek's V-series open-weights frontier models.
DeepSeek releases DeepSeek-V4-Flash-Base on Hugging Face
DeepSeek has released DeepSeek-V4-Flash-Base, a new open-weights base model, on Hugging Face. The model uses FP8 precision and the deepseek_v4 architecture with safetensors format. Early traction is notable with over 66,000 downloads and 241 likes shortly after release, suggesting significant community interest in a 'Flash' variant of the V4 series.
DeepSeek releases DeepSeek-Math-V2 on Hugging Face
DeepSeek has released DeepSeek-Math-V2, a math-specialized text-generation model, on Hugging Face. The model uses the deepseek_v32 architecture and is available in fp8 format with safetensors support. Early engagement metrics show 697 likes and 416 downloads, suggesting notable community interest for a new release.
DeepSeek releases DeepSeek-V3.2-Speciale on Hugging Face
DeepSeek has published DeepSeek-V3.2-Speciale, a new text-generation model, on Hugging Face under the deepseek-ai organization. The model uses the deepseek_v32 architecture and supports fp8 precision with safetensors format. Early traction is notable with nearly 10,000 downloads and 708 likes shortly after release.
DeepSeek releases DeepSeek-V3.2 on Hugging Face
DeepSeek has released DeepSeek-V3.2, a new text-generation model published on Hugging Face under the deepseek-ai organization. The model supports fp8 precision, is endpoints-compatible, and has accumulated over 3.6 million downloads and 1,446 likes, indicating significant community uptake. This appears to be a successor to DeepSeek-V3, continuing the lab's competitive open-weights model series.


