Almanac
← Events
5Simon Willison's Weblog·10d ago

Simon Willison on DiffusionGemma

Simon Willison covers DiffusionGemma, a diffusion-based language model in the Gemma family from Google. The post appears to be commentary or a brief note on the model's release or capabilities. Diffusion-based LLMs represent an active area of research as an alternative to autoregressive generation.

Related guides (3)

Related events (8)

7Google Deepmind Blog·10d ago·source ↗

DeepMind announces DiffusionGemma with 4x faster text generation

DeepMind published a blog post introducing DiffusionGemma, a diffusion-based variant of the Gemma model family claiming 4x faster text generation. The announcement suggests a departure from standard autoregressive decoding in favor of diffusion-based generation. If the claims hold, this could represent a meaningful inference efficiency advance for the Gemma line.

6arXiv · cs.AI·46h ago·source ↗

Interpretability study of DiffusionGemma reveals novel diffusion-specific reasoning phenomena

Researchers investigate the reasoning transparency of DiffusionGemma, a diffusion-based language model, decomposing transparency into variable and algorithmic components. They show that mapping information through an interpretable token bottleneck reduces DiffusionGemma's opaque serial depth from 28.6X to just 1.1X that of autoregressive Gemma 4, with no performance loss. Interpretability case studies uncover diffusion-specific phenomena including non-chronological reasoning, token smearing, and intermediate-context reasoning. Monitorability tests find DiffusionGemma comparable to Gemma 4, suggesting diffusion LMs are not inherently less amenable to safety oversight.

7Hugging Face Blog·1mo ago·source ↗

Welcome Gemma 2 - Google's new open LLM

Google released Gemma 2, a new open-weights large language model, announced via the Hugging Face blog. The post covers integration with the Hugging Face ecosystem and highlights the model's capabilities. Gemma 2 represents Google's continued investment in open-weight model releases to compete in the open-source LLM space.

6Hugging Face Blog·1mo ago·source ↗

CodeGemma - Google's Official Code-Focused LLM Release

Google has released CodeGemma, a family of code-specialized large language models, announced via the Hugging Face blog. CodeGemma builds on the Gemma model family and is targeted at code generation and understanding tasks. The release represents Google's continued push into open-weights code LLMs to compete with models like Code Llama and DeepSeek Coder.

7Hugging Face Blog·1mo ago·source ↗

Welcome Gemma 3: Google's All-New Multimodal, Multilingual, Long-Context Open LLM

Google has released Gemma 3, a new family of open-weights large language models featuring multimodal capabilities, multilingual support, and extended context windows. The Hugging Face blog post introduces the model family and its key features. Gemma 3 represents a significant update to Google's open-weights model line, expanding beyond text-only capabilities to include vision and broader language coverage.

7Hugging Face Blog·1mo ago·source ↗

Welcome Gemma - Google's new open LLM

Google released Gemma, a family of open-weight large language models, announced via the Hugging Face blog. The models are positioned as Google's entry into the open-weights LLM space, following the success of models like Llama 2. This release marks a significant strategic move by Google to compete in the open-source AI ecosystem.

7The Batch·3d ago·source ↗

DiffusionGemma hits 1,000+ tokens/sec; Claude Fable 5 export controls; Agents' Last Exam benchmark launch

Google introduced DiffusionGemma, an experimental 26B MoE model using diffusion-based text generation that produces 256-token blocks simultaneously, achieving over 1,000 tokens/second on H100 hardware at the cost of lower output quality versus standard Gemma 4. Separately, the US government issued an export control directive forcing Anthropic to suspend Claude Fable 5 and Claude Mythos 5 globally, while Anthropic also reversed a controversial silent-degradation safeguard on Fable 5 after researcher backlash. UC Berkeley's Center for RDI launched Agents' Last Exam (ALE), a 1,500+ task agentic benchmark using deterministic grading, where GPT-5.5 topped the leaderboard at only 24% pass rate, highlighting the difficulty gap between current models and professional-grade workflows.

4Simon Willison'S Weblog·1mo ago·source ↗

Simon Willison's Commentary on Google I/O, Gemini Spark, and Antigravity

Simon Willison provides commentary on Google I/O 2026 announcements, including Gemini Spark and something referred to as Antigravity. As a tier-2 source, this represents an analyst perspective on Google's AI announcements rather than primary source material. The body content appears empty, limiting the depth of analysis available.