Almanac
Guide · Beginner

Google: The AI Lab That Builds Everything from DNA Models to Your Phone's Assistant

GoogleBeginneractive·v3 · live·generated 6d ago

Part of these paths

TL;DRGoogle is one of the world's most prolific AI organizations, running a research lab (DeepMind) and a product empire (Gemini, Search, Android) that together touch nearly every corner of artificial intelligence. Its strategy is unusually broad: releasing powerful open-weights models anyone can download, shipping frontier closed models that compete with OpenAI and Anthropic, and applying AI to science — from cancer screening to genomics — while also becoming the infrastructure backbone for competitors like Apple and Anthropic.

Key takeaways

  • Gemini 3.5 Flash, released at Google I/O 2026, tops the APEX-Agents-AA and MMMU-Pro benchmarks among mid-tier models and supports a 1M-token context window.
  • Gemma 4 12B is an open-weights, encoder-free multimodal model released under Apache 2.0, designed to run on consumer laptops.
  • Apple announced a new AI architecture centered on Google Gemini models, deepening a partnership that puts Gemini inside one of the world's most-used device ecosystems.
  • AlphaGenome, an open-weights model, interprets the ~98% of human DNA that regulates gene expression and correctly predicted expression changes linked to T-cell leukemia.
  • Google's Aletheia agent (using Gemini 3 Deep Think) produced 4 genuinely novel solutions to previously unsolved Erdős mathematics problems out of 200 evaluated.
  • Gemini 3.1 Pro Preview leads the 899-discipline KINA knowledge benchmark at 53.17%, ahead of Claude Opus 4.6 and GPT-5.4.

What Google is, in the AI world

Google is both a technology product company and one of the most active AI research organizations on the planet. Its AI work flows through Google DeepMind, a lab formed by merging Google Brain and the original DeepMind, which produces everything from frontier language models to scientific tools that analyze DNA. On the product side, Google ships AI through Gemini (its flagship model family), Search, Android, and a growing suite of developer tools.

What makes Google unusual is the sheer range of what it does — and the fact that it simultaneously competes with and supplies infrastructure to other major AI players.

Why it matters to you

If you use Google Search, Gmail, Android, or Chrome, you are already using Google's AI. But Google's reach goes further: Apple has built a new AI architecture around Google's Gemini models, meaning Gemini may soon power Siri on iPhones. Anthropic — one of Google's main competitors — runs significant parts of its training on Google's custom chips (called TPUs) under a multi-gigawatt compute deal. In other words, Google is both a player and a piece of the infrastructure that the whole industry runs on.

The model lineup: closed and open

Google runs two parallel tracks:

Gemini is the closed, frontier track — models you access through Google's products or API. Gemini 3.1 Pro Preview currently leads the 899-discipline KINA knowledge benchmark (53.17%), edging out rivals from OpenAI and Anthropic. Gemini 3.5 Flash, released at Google I/O 2026, is a mid-tier model aimed at developers who need speed and agentic capability at a lower cost than the top tier — it tops the APEX-Agents-AA and MMMU-Pro benchmarks among Flash-class models and supports a 1M-token context window (that means it can read roughly 750,000 words in one go).

Gemma is the open track — models anyone can download and run. The Gemma family has grown from the original release in early 2024 through Gemma 2, Gemma 3, and now Gemma 4. The latest, Gemma 4 12B, is notable for an "encoder-free" design that handles images and text in a single unified architecture, and it's licensed under Apache 2.0, meaning businesses can use it freely. It's designed to run on a consumer laptop, not just a data center.

Science and medicine: AI beyond chat

Google's DeepMind lab applies AI to hard scientific problems in ways that go well beyond answering questions:

  • AlphaGenome is an open-weights model that reads up to one million DNA base pairs at a time and predicts how genes are regulated — covering the roughly 98% of the genome that doesn't directly code for proteins. It correctly predicted expression changes linked to T-cell leukemia and matched or exceeded prior models in 47 out of 50 evaluations.
  • A mammography AI system was tested across 12 UK NHS clinics, processing scans in under 18 minutes versus over two days for human readers, and achieved higher sensitivity than the first human reader in retrospective testing on 116,000 scans.
  • A 27-billion-parameter model built on the Gemma architecture for single-cell biological analysis contributed to the discovery of a potential new cancer therapy pathway.
  • The Aletheia agentic system, powered by Gemini 3 Deep Think, produced 4 genuinely novel solutions to previously unsolved mathematical problems (from a set called Erdős problems) — problems that had stumped human mathematicians.

Agents and the next frontier

Google's stated long-term goal is to turn Gemini into a "universal AI assistant" — one that can plan, simulate future states, and act in the world rather than just respond to prompts. At Google I/O 2026, the company introduced Spark, a background agents platform, and Antigravity 2.0, an agent-first desktop application. Gemini 3.5 also now powers Google Search's overhauled interface.

CodeMender, announced by DeepMind, is an AI agent designed to find and fix critical software security vulnerabilities automatically — a sign that Google is moving AI from assistant to autonomous actor in high-stakes domains.

Google as infrastructure for the industry

One of the less-obvious but important facts about Google's AI position: it supplies the compute backbone for competitors. Anthropic signed a deal for multiple gigawatts of next-generation TPU capacity from Google and Broadcom, expected online from 2027. Google is also a major investor in Anthropic (up to $40 billion in funding-for-compute deals, according to industry reporting). And Google's SynthID watermarking system is being adopted by OpenAI as part of a cross-industry content provenance standard.

Safety and regulation

Google has voluntarily agreed to submit its frontier models — including versions with limited safety guardrails — to the U.S. government's new TRAINS task force (run by NIST), which evaluates AI for cybersecurity, biosecurity, and chemical weapons risks before public deployment. Google also published a security report cataloging how AI is being used to create harder-to-detect malware, signaling that it takes the offensive-capability risks of its own models seriously.

Where it's heading

The events in this bundle point to Google deepening on three fronts: (1) embedding Gemini into more consumer devices — most visibly through the Apple partnership; (2) expanding open-weights Gemma models that run locally, giving developers a Google-quality baseline without cloud dependency; and (3) pushing AI into scientific discovery, where AlphaGenome and the single-cell biology model suggest DeepMind sees medicine and genomics as a long-term frontier distinct from the language model race.

Google's AI ecosystem at a glance

Google's Gemini model tiers at a glance

ModelTypeContext WindowKey strengthPricing (input/output per M tokens)
Gemini 3.1 Pro PreviewClosed, frontierLeads KINA benchmark (53.17%); price & multimodal advantage over GPT-5.4
Gemini 3.5 FlashClosed, mid-tier1M tokensTops APEX-Agents-AA & MMMU-Pro (Flash tier); agentic gains$1.50 / $9.00
Gemma 4 12BOpen-weightsEncoder-free multimodal; runs on consumer laptops; Apache 2.0Free (open weights)

Data drawn from event bundle; unknown cells render —.

Timeline

  1. Gemma (first generation) released as Google's open-weights LLM entry

  2. Gemma 3 launches with multimodal, multilingual, and long-context capabilities

  3. DeepMind publishes vision for Gemini as a universal AI assistant and world model

  4. Gemma 4 released: frontier multimodal intelligence on-device

  5. Gemini 3.5 Flash released at Google I/O 2026

  6. Apple announces new AI architecture built around Google Gemini models

Related topics

FAQ

What is the difference between Gemini and Gemma?

Gemini is Google's closed, frontier model family — you access it through Google products or an API. Gemma is Google's open-weights family, meaning anyone can download and run the model themselves, including on a personal laptop.

Is Google just a search company doing AI on the side?

No — Google DeepMind is one of the world's leading AI research labs, producing frontier models, scientific AI tools (like AlphaGenome for DNA analysis), and safety research, while also supplying AI infrastructure to companies like Apple and Anthropic.

Why is Apple using Google's AI?

Apple announced a new AI architecture centered on Gemini models, reportedly distilling them for on-device use and routing harder queries to Google's cloud — a sign that even Apple, which builds its own chips, is leaning on Google's AI for capability.

Does Google release open-source AI models?

Yes — the Gemma family (Gemma, Gemma 2, Gemma 3, Gemma 4) are open-weights models released on Hugging Face under permissive licenses, designed to run on hardware ranging from servers to consumer laptops.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

  • v3live6d ago
  • v2superseded11d ago
  • v1superseded16d ago

Related guides (4)

More on Google (6)

7Latent Space·1mo ago·source ↗

Google I/O 2026: Gemini 3.5 Flash, Omni, Spark Background Agents, and Antigravity 2.0

Google I/O 2026 featured a cluster of AI announcements including Gemini 3.5 Flash, a multimodal model codenamed Omni (NanoBanana for video), Spark (a background agents platform), and Antigravity 2.0. The AINews digest from Latent Space summarizes the breadth of Google's releases across model, product, and infrastructure layers. Details on capabilities and benchmarks are not yet elaborated in the available body text.

6The Batch·19d ago·source ↗

Google Debuted Lyria 3, An App That Turns Text or Images Into 30-Second Songs

Google launched Lyria 3, a latent diffusion-based music generation model integrated into the Gemini app and YouTube Shorts, capable of producing 30-second audio clips with vocals and instruments from text or image prompts. Unlike its predecessor Lyria 2, Lyria 3 was trained on licensed audio data and includes copyright-filtering safeguards, SynthID watermarking, and RLHF fine-tuning. The model is available free to Gemini users (18+) and YouTube Shorts creators, reaching an estimated 750 million users. Google also acquired ProducerAI (formerly Riffusion) shortly after launch, signaling continued investment in AI music tooling.

7The Batch·17d ago·source ↗

Google's Aletheia agent uses Gemini 3 Deep Think to generate novel solutions to unsolved Erdős problems

Google researchers introduced Aletheia, an agentic workflow using Gemini 3 Deep Think that generates, verifies, and revises solutions to previously unsolved mathematical problems. Applied to Erdős problems, Aletheia produced 13 correct solutions out of 200 evaluated, with 4 being genuinely novel contributions not found in existing literature. The announcement also reveals Gemini 3 Deep Think's benchmark performance: 48.4% on HLE, 84.6% on ARC-AGI-2, and 93.8% on GPQA Diamond. The system demonstrates both the promise and current limitations of AI-assisted mathematical research, with a 6.5% correct-under-intended-interpretation rate on a hard problem set.

7Hugging Face Blog·1mo ago·source ↗

Welcome Gemma 4: Frontier Multimodal Intelligence on Device

Google has released Gemma 4, a new open-weights multimodal model family announced via the Hugging Face blog. The release positions Gemma 4 as capable of frontier-level multimodal intelligence while being deployable on-device. As a tier-2 source commentary, the post likely covers model capabilities, availability on Hugging Face Hub, and integration details.

4Mit Technology Review — Ai·1mo ago·source ↗

What to expect from Google at I/O 2026

MIT Technology Review previews Google I/O 2026, characterizing Google as currently in 'third place' in the foundation model race. The piece sets expectations for announcements at the annual developer conference. The framing reflects ongoing competitive positioning analysis among major AI labs.

8Anthropic News·1mo ago·source ↗

Anthropic Expands Partnership with Google and Broadcom for Multi-Gigawatt TPU Compute Capacity

Anthropic has signed a new agreement with Google and Broadcom for multiple gigawatts of next-generation TPU capacity expected to come online starting in 2027, representing the company's largest compute commitment to date. The announcement coincides with Anthropic reporting run-rate revenue surpassing $30 billion, up from ~$9 billion at end of 2025, and the number of enterprise customers spending over $1M annually doubling to 1,000+ in under two months. The compute will be predominantly US-sited, extending Anthropic's November 2025 $50B American infrastructure commitment. Anthropic continues to operate across AWS Trainium, Google TPUs, and NVIDIA GPUs, with Amazon remaining its primary cloud and training partner.