Almanac
← Events
6Google DeepMind Blog·1mo ago

Gemma Scope 2: Interpretability Tools Released Across Entire Gemma 3 Family

DeepMind has released Gemma Scope 2, an open interpretability toolkit covering the full Gemma 3 model family. The release extends the original Gemma Scope effort to provide the AI safety community with tools for understanding complex language model behavior. By making these tools openly available across all Gemma 3 variants, DeepMind aims to support mechanistic interpretability research at scale.

Related guides (4)

Related events (8)

7Hugging Face Blog·1mo ago·source ↗

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Google released three new additions to the Gemma ecosystem: Gemma 2 2B, a small open-weights language model; ShieldGemma, a safety-focused classifier model; and Gemma Scope, an interpretability toolset. These releases expand the Gemma family with a smaller, more accessible model alongside dedicated safety and interpretability infrastructure. The announcement was published on the Hugging Face blog, indicating integration with the HF ecosystem.

7Hugging Face Blog·1mo ago·source ↗

Welcome Gemma 3: Google's All-New Multimodal, Multilingual, Long-Context Open LLM

Google has released Gemma 3, a new family of open-weights large language models featuring multimodal capabilities, multilingual support, and extended context windows. The Hugging Face blog post introduces the model family and its key features. Gemma 3 represents a significant update to Google's open-weights model line, expanding beyond text-only capabilities to include vision and broader language coverage.

6Hugging Face Blog·1mo ago·source ↗

Gemma 3n Fully Available in the Open-Source Ecosystem

Google's Gemma 3n model has been integrated into the open-source ecosystem via Hugging Face, making it broadly accessible for developers and researchers. The announcement covers availability of the model weights and tooling support within the Hugging Face platform. Gemma 3n is designed for efficient on-device inference, targeting mobile and edge deployment scenarios. This release extends the open-weights frontier model landscape with a multimodal-capable, efficiency-focused architecture.

7Google Deepmind Blog·1mo ago·source ↗

Introducing Gemma 3n: The Developer Guide

Google DeepMind has published a developer-focused guide introducing Gemma 3n, a new model in the Gemma open-weights family. The announcement is directed at the developer community and appears to describe architecture, usage, and integration details for the new release. As a Tier 1 source announcement, this represents a notable addition to Google's open-weights model lineup.

7Hugging Face Blog·1mo ago·source ↗

Welcome Gemma 2 - Google's new open LLM

Google released Gemma 2, a new open-weights large language model, announced via the Hugging Face blog. The post covers integration with the Hugging Face ecosystem and highlights the model's capabilities. Gemma 2 represents Google's continued investment in open-weight model releases to compete in the open-source LLM space.

5Google Deepmind Blog·1mo ago·source ↗

Introducing Gemma 3 270M: The compact model for hyper-efficient AI

Google DeepMind has released Gemma 3 270M, a 270-million parameter compact language model added to the Gemma 3 family. The model is positioned as a highly specialized, hyper-efficient tool for resource-constrained deployments. This extends the Gemma 3 lineup into the sub-billion parameter range, targeting edge and on-device use cases.

7Google Deepmind Blog·1mo ago·source ↗

Introducing Gemma 3

Google DeepMind has released Gemma 3, described as the most capable model runnable on a single GPU or TPU. The announcement comes from DeepMind's official blog, indicating a new generation of the open-weights Gemma model family. Specific capability details, parameter counts, and benchmark results are not included in the provided body text.

6arXiv · cs.AI·46h ago·source ↗

Interpretability study of DiffusionGemma reveals novel diffusion-specific reasoning phenomena

Researchers investigate the reasoning transparency of DiffusionGemma, a diffusion-based language model, decomposing transparency into variable and algorithmic components. They show that mapping information through an interpretable token bottleneck reduces DiffusionGemma's opaque serial depth from 28.6X to just 1.1X that of autoregressive Gemma 4, with no performance loss. Interpretability case studies uncover diffusion-specific phenomena including non-chronological reasoning, token smearing, and intermediate-context reasoning. Monitorability tests find DiffusionGemma comparable to Gemma 4, suggesting diffusion LMs are not inherently less amenable to safety oversight.