Almanac
Guide · In-depth

Google: Frontier AI Lab, Open-Weights Pioneer, and Ecosystem Integrator

GoogleIn-depthactive·v3 · live·generated 6d ago

Part of these paths

TL;DRGoogle operates across every layer of the AI stack — from open-weights research models to frontier proprietary systems, from consumer products to scientific applications — making it one of the broadest-footprint players in the field. Its DeepMind division drives both cutting-edge model research and applied science, while the Gemini product line competes directly at the frontier and increasingly anchors third-party ecosystems, including Apple's consumer AI strategy.

Key takeaways

  • Gemini 3.5 Flash launched at Google I/O 2026 with a 1M-token context window, mixture-of-experts architecture, and $1.50/$9.00 per million token pricing — three times the cost of its predecessor.
  • Gemini 3.1 Pro Preview leads the KINA knowledge benchmark at 53.17% across 261 disciplines, and tops PhysTool-Bench among evaluated MLLMs at 58.7% tool identification.
  • Gemma 4 (up to 31B parameters, Apache 2.0) and Gemma 4 12B (encoder-free multimodal) extend Google's open-weights line to on-device frontier-level multimodal capability.
  • Apple announced a new AI architecture centered on Google Gemini models, representing a major consumer-ecosystem integration win.
  • AlphaGenome, an open-weights model interpreting non-coding DNA, matched or exceeded prior models in 47 of 50 evaluations and correctly predicted expression changes linked to T-cell leukemia.
  • Google has agreed to provide Anthropic with multiple gigawatts of next-generation TPU capacity starting 2027, deepening a compute-for-investment relationship alongside up to $40B in funding.

What Google is in the AI landscape

Google — operating its AI research and product work primarily through Google DeepMind — is one of a small number of organizations that competes simultaneously at every layer of the AI stack: frontier proprietary models, open-weights releases, consumer products, scientific applications, and compute infrastructure. Its Gemini line anchors the proprietary side; its Gemma family anchors the open-weights side. Both have matured rapidly across the event window covered here.

The Gemini proprietary line

Gemini 3.1 Pro Preview currently represents Google's frontier proprietary capability. In independent evaluations, it leads the KINA knowledge benchmark (53.17% across 261 disciplines) and tops PhysTool-Bench among evaluated multimodal LLMs (58.7% tool identification, 21.0% end-to-end query completion — the latter figure also exposing how much headroom remains for embodied planning). On the Artificial Analysis Intelligence Index, it nearly ties GPT-5.4 Pro at 57.2 vs. 57 points, maintaining a price and multimodal advantage over OpenAI's offering.

Gemini 3.5 Flash, released at Google I/O 2026, is the current mid-tier entry: a mixture-of-experts multimodal model with a 1M-token context window, adjustable reasoning levels, thought preservation across multi-turn conversations, and top scores on APEX-Agents-AA and MMMU-Pro within the Flash tier. Its pricing — $1.50/$9.00 per million input/output tokens — is three times its predecessor Gemini 3 Flash, a positioning choice that has drawn scrutiny given that independent testing found it more expensive in practice than Gemini 3.1 Pro. Google I/O 2026 also introduced Gemini Omni Flash (multimodal video generation), Antigravity 2.0 (an agent-first desktop application), and Spark (a background agents platform), signaling a broad push into agentic product surfaces.

Gemini 3 Deep Think, an earlier reasoning-focused variant, powers the Aletheia agentic workflow that produced 13 correct solutions to previously unsolved Erdős mathematical problems — 4 of them genuinely novel contributions not found in existing literature. Its benchmark profile: 48.4% on HLE, 84.6% on ARC-AGI-2, 93.8% on GPQA Diamond.

The Gemma open-weights line

Google's open-weights strategy has evolved steadily since the original Gemma release in February 2024, through Gemma 2 (June 2024), Gemma 2 2B with ShieldGemma and Gemma Scope (July 2024), Gemma 3 (multimodal, multilingual, long-context; March 2025), and now Gemma 4.

Gemma 4, released April 2026 under Apache 2.0, scales to 31B parameters with strong benchmark performance and on-device deployability. Gemma 4 12B, released June 2026, introduces a unified encoder-free multimodal architecture — eliminating the separate vision encoder common in most multimodal models — and is designed to run on consumer laptops. A 27B Gemma-based foundation model for single-cell biological analysis, released by DeepMind in October 2025, contributed to the discovery of a potential cancer therapy pathway, illustrating how the Gemma architecture is being adapted for scientific domains beyond general language tasks.

Ecosystem integrations and strategic partnerships

The most consequential recent development in Google's ecosystem position is Apple's announcement of a new AI architecture centered on Google Gemini models, with Siri expected to use Gemini models distilled for on-device use alongside cloud routing. This represents a significant consumer-AI distribution win, placing Gemini inside Apple's hardware ecosystem.

Google is also a founding supporter of the Agentic AI Foundation (AAIF) under the Linux Foundation, which houses Anthropic's Model Context Protocol (MCP) — now integrated into Gemini — alongside OpenAI's AGENTS.md and Block's goose. MCP has reached 10,000+ active public servers and 97M+ monthly SDK downloads, making Google's participation in its governance a meaningful standards-layer commitment.

On content provenance, Google's SynthID watermarking system is being integrated into OpenAI's content credentials infrastructure, a cross-industry alignment on the C2PA standard.

Compute infrastructure and investment relationships

Google's compute relationship with Anthropic has deepened substantially. A new agreement signed April 2026 provides Anthropic with multiple gigawatts of next-generation TPU capacity expected online from 2027 — Anthropic's largest compute commitment to date — alongside Google deepening its investment toward $40B. Anthropic continues to operate across AWS Trainium, Google TPUs, and NVIDIA GPUs, with Google TPUs forming a significant training and inference substrate.

Google has also voluntarily agreed to submit models to the NIST TRAINS task force for pre-deployment national security evaluation, alongside Anthropic, OpenAI, Microsoft, and xAI — a posture consistent with its participation in Project Glasswing (Anthropic's cybersecurity consortium) and its broader engagement with safety-adjacent governance structures.

Scientific and applied AI

Beyond language models, DeepMind's applied science work spans several domains. AlphaGenome interprets the ~98% of human and mouse genomes that regulate gene expression rather than coding for proteins, taking up to 1M DNA base pairs as input and outputting roughly 6,000 human and 1,000 mouse gene properties; it matched or exceeded prior models in 47 of 50 evaluations and correctly predicted expression changes associated with T-cell acute lymphoblastic leukemia. Weights, API, and inference code are freely available for noncommercial use. CodeMender, announced October 2025, targets automated code security vulnerability identification and remediation. Google's mammography AI system, evaluated across 116,000 NHS scans, achieved higher sensitivity than the first human reader (0.541 vs. 0.437) and processed scans in under 18 minutes versus over two days for human readers, though clinician trust remains a deployment barrier.

Where it's heading

The event bundle points to three converging trajectories: (1) Gemini deepening its position as the AI backbone for third-party consumer ecosystems, most visibly Apple; (2) Gemma continuing to push the frontier of what open-weights models can do on-device, with encoder-free multimodal architecture as the current leading edge; and (3) DeepMind's scientific AI work — genomics, biology, mathematics — maturing from demonstrations into tools with measurable real-world impact. The multi-gigawatt TPU commitment to Anthropic also signals that Google's compute infrastructure is becoming a platform for the broader frontier lab ecosystem, not just its own model lines.

Google's AI product and ecosystem map

Google Gemini model tiers in the current landscape

ModelTypeContextPricing (in/out per M tokens)Notable benchmark
Gemini 3.1 Pro PreviewProprietary, frontierKINA: 53.17% (top); PhysTool-Bench: 58.7% tool ID (top)
Gemini 3.5 FlashProprietary, mid-tier MoE1M tokens$1.50 / $9.00Tops APEX-Agents-AA and MMMU-Pro (Flash tier)
Gemma 4 (up to 31B)Open-weights, Apache 2.0FreeStrong benchmark performance; on-device capable
Gemma 4 12BOpen-weights, encoder-free multimodalFreeUnified architecture; consumer laptop deployable
GPT-5.4 Pro (OpenAI)Proprietary, frontier1.05M tokensIntelligence Index: 57 (vs Gemini 3.1 Pro: 57.2)
Claude Opus 4.6 (Anthropic)Proprietary, frontier$5 / $25Leapfrogged by GPT-5.4 on most benchmarks per events

Cells drawn from event bundle; unknown cells render —. Gemini 3.1 Pro and GPT-5.4 comparison from independent benchmark sources cited in events.

Timeline

  1. Gemma (original) released as Google's open-weights LLM entry

  2. Gemma 2 released, deepening open-weights investment

  3. Gemma 3 adds multimodal, multilingual, and long-context capabilities

  4. Gemma 4 (up to 31B, Apache 2.0) released with frontier-level multimodal performance

  5. Anthropic–Google/Broadcom multi-gigawatt TPU deal announced; Google deepens Anthropic investment toward $40B

  6. Gemini 3.5 Flash released at Google I/O 2026

  7. Apple announces new AI architecture built around Google Gemini models

  8. Gemma 4 12B released: encoder-free multimodal open-weights model

Related topics

FAQ

What is the difference between Gemini and Gemma?

Gemini is Google's proprietary frontier model line (closed weights, API/product access); Gemma is Google's open-weights family released under Apache 2.0, designed for on-device and community deployment. Both lines have evolved in parallel, with Gemma 4 reaching 31B parameters and multimodal capability.

How does Google's Gemini 3.5 Flash compare to frontier models?

Gemini 3.5 Flash tops Flash-tier benchmarks (APEX-Agents-AA, MMMU-Pro) but trails leading frontier models on overall intelligence and coding; it is priced at $1.50/$9.00 per million tokens, three times its predecessor, positioning it as a mid-tier rather than budget offering.

What is Google's relationship with Anthropic?

Google is a major investor in Anthropic (deepening toward $40B) and a compute partner, with a new multi-gigawatt TPU deal expected to come online from 2027; Anthropic also participates in Google's Project Glasswing cybersecurity consortium.

Is Google involved in AI safety and pre-deployment evaluation?

Yes — Google has voluntarily agreed to submit models (including versions with limited guardrails) to the NIST TRAINS task force for pre-deployment national security evaluation, alongside Anthropic, OpenAI, Microsoft, and xAI.

What scientific AI work is Google DeepMind doing beyond language models?

DeepMind released AlphaGenome (non-coding DNA interpretation, open-weights for noncommercial use), a 27B Gemma-based single-cell biology model that contributed to a potential cancer therapy discovery, and CodeMender for automated code security remediation.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

  • v3live6d ago
  • v2superseded11d ago
  • v1superseded16d ago

Related guides (4)

More on Google (6)

7Latent Space·1mo ago·source ↗

Google I/O 2026: Gemini 3.5 Flash, Omni, Spark Background Agents, and Antigravity 2.0

Google I/O 2026 featured a cluster of AI announcements including Gemini 3.5 Flash, a multimodal model codenamed Omni (NanoBanana for video), Spark (a background agents platform), and Antigravity 2.0. The AINews digest from Latent Space summarizes the breadth of Google's releases across model, product, and infrastructure layers. Details on capabilities and benchmarks are not yet elaborated in the available body text.

6The Batch·19d ago·source ↗

Google Debuted Lyria 3, An App That Turns Text or Images Into 30-Second Songs

Google launched Lyria 3, a latent diffusion-based music generation model integrated into the Gemini app and YouTube Shorts, capable of producing 30-second audio clips with vocals and instruments from text or image prompts. Unlike its predecessor Lyria 2, Lyria 3 was trained on licensed audio data and includes copyright-filtering safeguards, SynthID watermarking, and RLHF fine-tuning. The model is available free to Gemini users (18+) and YouTube Shorts creators, reaching an estimated 750 million users. Google also acquired ProducerAI (formerly Riffusion) shortly after launch, signaling continued investment in AI music tooling.

7The Batch·17d ago·source ↗

Google's Aletheia agent uses Gemini 3 Deep Think to generate novel solutions to unsolved Erdős problems

Google researchers introduced Aletheia, an agentic workflow using Gemini 3 Deep Think that generates, verifies, and revises solutions to previously unsolved mathematical problems. Applied to Erdős problems, Aletheia produced 13 correct solutions out of 200 evaluated, with 4 being genuinely novel contributions not found in existing literature. The announcement also reveals Gemini 3 Deep Think's benchmark performance: 48.4% on HLE, 84.6% on ARC-AGI-2, and 93.8% on GPQA Diamond. The system demonstrates both the promise and current limitations of AI-assisted mathematical research, with a 6.5% correct-under-intended-interpretation rate on a hard problem set.

7Hugging Face Blog·1mo ago·source ↗

Welcome Gemma 4: Frontier Multimodal Intelligence on Device

Google has released Gemma 4, a new open-weights multimodal model family announced via the Hugging Face blog. The release positions Gemma 4 as capable of frontier-level multimodal intelligence while being deployable on-device. As a tier-2 source commentary, the post likely covers model capabilities, availability on Hugging Face Hub, and integration details.

4Mit Technology Review — Ai·1mo ago·source ↗

What to expect from Google at I/O 2026

MIT Technology Review previews Google I/O 2026, characterizing Google as currently in 'third place' in the foundation model race. The piece sets expectations for announcements at the annual developer conference. The framing reflects ongoing competitive positioning analysis among major AI labs.

8Anthropic News·1mo ago·source ↗

Anthropic Expands Partnership with Google and Broadcom for Multi-Gigawatt TPU Compute Capacity

Anthropic has signed a new agreement with Google and Broadcom for multiple gigawatts of next-generation TPU capacity expected to come online starting in 2027, representing the company's largest compute commitment to date. The announcement coincides with Anthropic reporting run-rate revenue surpassing $30 billion, up from ~$9 billion at end of 2025, and the number of enterprise customers spending over $1M annually doubling to 1,000+ in under two months. The compute will be predominantly US-sited, extending Anthropic's November 2025 $50B American infrastructure commitment. Anthropic continues to operate across AWS Trainium, Google TPUs, and NVIDIA GPUs, with Amazon remaining its primary cloud and training partner.