Almanac
model

Gemma

modelactiveprovisionalgemma-f6a2f419·15 events·first seen 29d ago

Aliases: Gemma

Co-occurring entities

More like this (12)

Recent events (15)

7Google Deepmind Blog·29d ago·source ↗

DeepMind Launches 27B Parameter Gemma-Based Foundation Model for Single-Cell Analysis

DeepMind has released a new 27 billion parameter foundation model built on the Gemma open-model family, specifically designed for single-cell biological analysis. The model contributed to the discovery of a new potential cancer therapy pathway. This represents a significant application of large language model architecture to computational biology and genomics research.

7Hugging Face Blog·28d ago·source ↗

Welcome Gemma - Google's new open LLM

Google released Gemma, a family of open-weight large language models, announced via the Hugging Face blog. The models are positioned as Google's entry into the open-weights LLM space, following the success of models like Llama 2. This release marks a significant strategic move by Google to compete in the open-source AI ecosystem.

4Hugging Face Blog·28d ago·source ↗

Fine-Tuning Gemma Models in Hugging Face

Hugging Face published a guide on fine-tuning Google's Gemma models using parameter-efficient fine-tuning (PEFT) techniques. The post covers practical workflows for adapting Gemma to downstream tasks within the Hugging Face ecosystem. This represents part of the broader tooling support rollout accompanying Gemma's release in February 2024.

6Google Deepmind Blog·29d ago·source ↗

T5Gemma: A new collection of encoder-decoder Gemma models

DeepMind has announced T5Gemma, a new collection of encoder-decoder large language models under the Gemma family. The release extends the Gemma model line beyond its existing decoder-only architecture to include encoder-decoder variants, following the T5 paradigm. Further technical details are sparse in the announcement but the models represent a notable architectural expansion of the open Gemma ecosystem.

7Google Deepmind Blog·29d ago·source ↗

Introducing Gemma 3n: The Developer Guide

Google DeepMind has published a developer-focused guide introducing Gemma 3n, a new model in the Gemma open-weights family. The announcement is directed at the developer community and appears to describe architecture, usage, and integration details for the new release. As a Tier 1 source announcement, this represents a notable addition to Google's open-weights model lineup.

6Hugging Face Blog·28d ago·source ↗

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Google released PaliGemma, an open-weights vision-language model built on the PaLI architecture combined with Gemma language components. The model is hosted and documented on Hugging Face, making it accessible for research and fine-tuning. PaliGemma targets multimodal tasks including image captioning, visual question answering, and object detection.

6Hugging Face Blog·28d ago·source ↗

Welcome EmbeddingGemma, Google's new efficient embedding model

Google has released EmbeddingGemma, a new embedding model announced via the Hugging Face blog. The model appears to be positioned as an efficient option for generating text embeddings, likely derived from or related to the Gemma model family. Details on architecture, benchmarks, and use cases are expected in the full post.

6Hugging Face Blog·28d ago·source ↗

CodeGemma - Google's Official Code-Focused LLM Release

Google has released CodeGemma, a family of code-specialized large language models, announced via the Hugging Face blog. CodeGemma builds on the Gemma model family and is targeted at code generation and understanding tasks. The release represents Google's continued push into open-weights code LLMs to compete with models like Code Llama and DeepSeek Coder.

7Google Deepmind Blog·29d ago·source ↗

MedGemma: DeepMind releases most capable open models for health AI development

Google DeepMind has announced new multimodal models in the MedGemma collection, described as their most capable open models for health AI development. The release expands the MedGemma family with enhanced multimodal capabilities targeting medical and clinical AI applications. As open models, they are intended to support developers building health AI systems.

7Google Deepmind Blog·29d ago·source ↗

VaultGemma: The world's most capable differentially private LLM

DeepMind introduces VaultGemma, a large language model trained from scratch using differential privacy (DP), claiming it as the most capable DP-trained model to date. The announcement positions VaultGemma as a significant advance in privacy-preserving AI, combining strong utility with formal privacy guarantees. The blog post is brief and likely precedes a more detailed technical disclosure.

7Google Deepmind Blog·29d ago·source ↗

Announcing Gemma 3n Preview: Powerful, Efficient, Mobile-First AI

Google DeepMind has released a preview of Gemma 3n, an open-weights model optimized for on-device multimodal inference. The model features a 2-in-1 architecture for flexible deployment and adds audio understanding to its multimodal capabilities. It is designed for mobile and edge environments, targeting developers building real-time interactive applications.

7Google Deepmind Blog·29d ago·source ↗

Introducing Gemma 3

Google DeepMind has released Gemma 3, described as the most capable model runnable on a single GPU or TPU. The announcement comes from DeepMind's official blog, indicating a new generation of the open-weights Gemma model family. Specific capability details, parameter counts, and benchmark results are not included in the provided body text.

7Google Deepmind Blog·6d ago·source ↗

DeepMind announces DiffusionGemma with 4x faster text generation

DeepMind published a blog post introducing DiffusionGemma, a diffusion-based variant of the Gemma model family claiming 4x faster text generation. The announcement suggests a departure from standard autoregressive decoding in favor of diffusion-based generation. If the claims hold, this could represent a meaningful inference efficiency advance for the Gemma line.

5Google Deepmind Blog·29d ago·source ↗

DolphinGemma: Google DeepMind LLM for Decoding Dolphin Communication

Google DeepMind has developed DolphinGemma, a large language model designed to help scientists analyze and decode dolphin communication patterns. The model is being applied to the scientific challenge of understanding cetacean vocalizations. This represents a novel application of LLM-based sequence modeling to non-human animal communication research.

5arXiv · cs.CL·30h ago·source ↗

Contrastive-Difference CKA reveals concept-specific structural alignment across LLM architectures

Researchers introduce CKA_Delta (contrastive-difference CKA), a training-free diagnostic that isolates concept-specific representational convergence from generic similarity across LLM architectures. The method reveals a geometric-functional universality dissociation: moderate geometric alignment coexists with near-perfect functional transfer across six concept domains and multiple architectural families. CKA_Delta also functions as an architectural outlier detector, flagging Gemma as a notable outlier (d=1.08, AUC=0.79). The work provides a practical tool for cross-architecture concept monitoring without requiring model training.