Gemma
gemma-f6a2f419·15 events·first seen 29d agoAliases: Gemma
Co-occurring entities
More like this (12)
Recent events (15)
DeepMind Launches 27B Parameter Gemma-Based Foundation Model for Single-Cell Analysis
DeepMind has released a new 27 billion parameter foundation model built on the Gemma open-model family, specifically designed for single-cell biological analysis. The model contributed to the discovery of a new potential cancer therapy pathway. This represents a significant application of large language model architecture to computational biology and genomics research.
Welcome Gemma - Google's new open LLM
Google released Gemma, a family of open-weight large language models, announced via the Hugging Face blog. The models are positioned as Google's entry into the open-weights LLM space, following the success of models like Llama 2. This release marks a significant strategic move by Google to compete in the open-source AI ecosystem.
Fine-Tuning Gemma Models in Hugging Face
Hugging Face published a guide on fine-tuning Google's Gemma models using parameter-efficient fine-tuning (PEFT) techniques. The post covers practical workflows for adapting Gemma to downstream tasks within the Hugging Face ecosystem. This represents part of the broader tooling support rollout accompanying Gemma's release in February 2024.
T5Gemma: A new collection of encoder-decoder Gemma models
DeepMind has announced T5Gemma, a new collection of encoder-decoder large language models under the Gemma family. The release extends the Gemma model line beyond its existing decoder-only architecture to include encoder-decoder variants, following the T5 paradigm. Further technical details are sparse in the announcement but the models represent a notable architectural expansion of the open Gemma ecosystem.
Introducing Gemma 3n: The Developer Guide
Google DeepMind has published a developer-focused guide introducing Gemma 3n, a new model in the Gemma open-weights family. The announcement is directed at the developer community and appears to describe architecture, usage, and integration details for the new release. As a Tier 1 source announcement, this represents a notable addition to Google's open-weights model lineup.
PaliGemma – Google's Cutting-Edge Open Vision Language Model
Google released PaliGemma, an open-weights vision-language model built on the PaLI architecture combined with Gemma language components. The model is hosted and documented on Hugging Face, making it accessible for research and fine-tuning. PaliGemma targets multimodal tasks including image captioning, visual question answering, and object detection.
Welcome EmbeddingGemma, Google's new efficient embedding model
Google has released EmbeddingGemma, a new embedding model announced via the Hugging Face blog. The model appears to be positioned as an efficient option for generating text embeddings, likely derived from or related to the Gemma model family. Details on architecture, benchmarks, and use cases are expected in the full post.
CodeGemma - Google's Official Code-Focused LLM Release
Google has released CodeGemma, a family of code-specialized large language models, announced via the Hugging Face blog. CodeGemma builds on the Gemma model family and is targeted at code generation and understanding tasks. The release represents Google's continued push into open-weights code LLMs to compete with models like Code Llama and DeepSeek Coder.
MedGemma: DeepMind releases most capable open models for health AI development
Google DeepMind has announced new multimodal models in the MedGemma collection, described as their most capable open models for health AI development. The release expands the MedGemma family with enhanced multimodal capabilities targeting medical and clinical AI applications. As open models, they are intended to support developers building health AI systems.
VaultGemma: The world's most capable differentially private LLM
DeepMind introduces VaultGemma, a large language model trained from scratch using differential privacy (DP), claiming it as the most capable DP-trained model to date. The announcement positions VaultGemma as a significant advance in privacy-preserving AI, combining strong utility with formal privacy guarantees. The blog post is brief and likely precedes a more detailed technical disclosure.
Announcing Gemma 3n Preview: Powerful, Efficient, Mobile-First AI
Google DeepMind has released a preview of Gemma 3n, an open-weights model optimized for on-device multimodal inference. The model features a 2-in-1 architecture for flexible deployment and adds audio understanding to its multimodal capabilities. It is designed for mobile and edge environments, targeting developers building real-time interactive applications.
Introducing Gemma 3
Google DeepMind has released Gemma 3, described as the most capable model runnable on a single GPU or TPU. The announcement comes from DeepMind's official blog, indicating a new generation of the open-weights Gemma model family. Specific capability details, parameter counts, and benchmark results are not included in the provided body text.
DeepMind announces DiffusionGemma with 4x faster text generation
DeepMind published a blog post introducing DiffusionGemma, a diffusion-based variant of the Gemma model family claiming 4x faster text generation. The announcement suggests a departure from standard autoregressive decoding in favor of diffusion-based generation. If the claims hold, this could represent a meaningful inference efficiency advance for the Gemma line.
DolphinGemma: Google DeepMind LLM for Decoding Dolphin Communication
Google DeepMind has developed DolphinGemma, a large language model designed to help scientists analyze and decode dolphin communication patterns. The model is being applied to the scientific challenge of understanding cetacean vocalizations. This represents a novel application of LLM-based sequence modeling to non-human animal communication research.
Contrastive-Difference CKA reveals concept-specific structural alignment across LLM architectures
Researchers introduce CKA_Delta (contrastive-difference CKA), a training-free diagnostic that isolates concept-specific representational convergence from generic similarity across LLM architectures. The method reveals a geometric-functional universality dissociation: moderate geometric alignment coexists with near-perfect functional transfer across six concept domains and multiple architectural families. CKA_Delta also functions as an architectural outlier detector, flagging Gemma as a notable outlier (d=1.08, AUC=0.79). The work provides a practical tool for cross-architecture concept monitoring without requiring model training.