Google DeepMind releases Gemma 4 12B, a unified encoder-free multimodal open model
Google DeepMind has released Gemma 4 12B, a new open-weights multimodal model that uses a unified, encoder-free architecture. The model is positioned as a capable multimodal system at the 12B parameter scale. This is notable as an open-weights release from a frontier lab with an architectural distinction — eliminating the separate vision encoder common in most multimodal models.
Related guides (5)

Google DeepMind
Google DeepMind: Frontier AI Across Models, Robotics, and Scientific Discovery

Google: The AI Lab That Builds Everything from DNA Models to Your Phone's Assistant
Related events (8)
Gemma 4: Google DeepMind Releases Most Capable Open Models
Google DeepMind has released Gemma 4, described as their most capable open models to date. The models are purpose-built for advanced reasoning and agentic workflows, and are positioned as the most capable open models byte-for-byte. The announcement comes from DeepMind's official blog, indicating a significant open-weights release targeting the frontier open model space.
Welcome Gemma 4: Frontier Multimodal Intelligence on Device
Google has released Gemma 4, a new open-weights multimodal model family announced via the Hugging Face blog. The release positions Gemma 4 as capable of frontier-level multimodal intelligence while being deployable on-device. As a tier-2 source commentary, the post likely covers model capabilities, availability on Hugging Face Hub, and integration details.
Introducing Gemma 3 270M: The compact model for hyper-efficient AI
Google DeepMind has released Gemma 3 270M, a 270-million parameter compact language model added to the Gemma 3 family. The model is positioned as a highly specialized, hyper-efficient tool for resource-constrained deployments. This extends the Gemma 3 lineup into the sub-billion parameter range, targeting edge and on-device use cases.
MedGemma: DeepMind releases most capable open models for health AI development
Google DeepMind has announced new multimodal models in the MedGemma collection, described as their most capable open models for health AI development. The release expands the MedGemma family with enhanced multimodal capabilities targeting medical and clinical AI applications. As open models, they are intended to support developers building health AI systems.
Welcome Gemma 3: Google's All-New Multimodal, Multilingual, Long-Context Open LLM
Google has released Gemma 3, a new family of open-weights large language models featuring multimodal capabilities, multilingual support, and extended context windows. The Hugging Face blog post introduces the model family and its key features. Gemma 3 represents a significant update to Google's open-weights model line, expanding beyond text-only capabilities to include vision and broader language coverage.
Announcing Gemma 3n Preview: Powerful, Efficient, Mobile-First AI
Google DeepMind has released a preview of Gemma 3n, an open-weights model optimized for on-device multimodal inference. The model features a 2-in-1 architecture for flexible deployment and adds audio understanding to its multimodal capabilities. It is designed for mobile and edge environments, targeting developers building real-time interactive applications.
T5Gemma: A new collection of encoder-decoder Gemma models
DeepMind has announced T5Gemma, a new collection of encoder-decoder large language models under the Gemma family. The release extends the Gemma model line beyond its existing decoder-only architecture to include encoder-decoder variants, following the T5 paradigm. Further technical details are sparse in the announcement but the models represent a notable architectural expansion of the open Gemma ecosystem.
Introducing Gemma 3
Google DeepMind has released Gemma 3, described as the most capable model runnable on a single GPU or TPU. The announcement comes from DeepMind's official blog, indicating a new generation of the open-weights Gemma model family. Specific capability details, parameter counts, and benchmark results are not included in the provided body text.


