7The Batch (DeepLearning.AI)·3d ago

Apple Foundation Models 3 (AFM 3) bring on-device AI to iPhones and Macs via Google Gemini distillation

Apple announced its third-generation Foundation Models (AFM 3), a family of models distilled from Google Gemini and designed to run on-device on Apple silicon, including iPhones and Macs. The flagship on-device model, AFM 3 Core Advanced, uses a novel 'Instruction-Following Pruning' technique as an alternative to standard mixture-of-experts routing, enabling faster inference and flash-memory storage with 20B total parameters but only 1-4B active. The family also includes cloud-hosted variants (AFM 3 Cloud, Cloud Image, Cloud Pro), and Apple's Foundation Models Framework will allow developers to swap in third-party models like Claude or Gemini. No public benchmark results have been released yet; Apple says they will follow later in 2026.

Frontier Model Releases Inference Economics Multimodal Progress Gemini 3.1 Pro AFM 3 Cloud Pro Google AFM 3 Instruction-Following Pruning LanguageModel protocol AFM 3 Core Advanced Apple Apple Foundation Models Framework Claude Opus 4.8 Anthropic Amar Subramanya

Related guides (3)

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Google

Google: The AI Lab That Builds Everything from DNA Models to Your Phone's Assistant

Read asBeginner In-depth

Anthropic

Anthropic: The AI Safety Company at the Center of the Frontier

Read asBeginner In-depth

Related events (8)

7Hacker News·21d ago·source ↗

Apple reveals new AI architecture built around Google Gemini models

Apple has announced a new AI architecture centered on Google Gemini models, representing a significant strategic shift in how Apple integrates third-party AI into its ecosystem. The announcement, reported by MacRumors and generating substantial Hacker News discussion, suggests a deepening partnership between Apple and Google for on-device and cloud AI capabilities. This move has implications for the competitive landscape of consumer AI and the positioning of both companies relative to OpenAI and other frontier labs.

Frontier Model Releases Enterprise Deployment Patterns Google Apple Gemini

6Google Deepmind Blog·1mo ago·source ↗

Gemini 3.1 Flash-Lite: Built for intelligence at scale

Google DeepMind has released Gemini 3.1 Flash-Lite, described as the fastest and most cost-efficient model in the Gemini 3 series. The announcement positions it as optimized for high-throughput, cost-sensitive deployments at scale. The body is sparse, offering no benchmark details or capability specifics beyond the efficiency framing.

Frontier Model Releases Inference Economics Google DeepMind Gemini 3.1 Flash Live Gemini +1 more

8Google Deepmind Blog·1mo ago·source ↗

Gemini 3 Flash: frontier intelligence built for speed

Google DeepMind has announced Gemini 3 Flash, a new model positioned as a frontier-intelligence offering optimized for speed and cost efficiency. The announcement comes from the official DeepMind blog, indicating a formal product release. Specific capability details and benchmarks are not included in the available body text.

Frontier Model Releases Inference Economics Google DeepMind Gemini 3 Flash Gemini

9Google Deepmind Blog·1mo ago·source ↗

A new era of intelligence with Gemini 3

DeepMind has published a blog post titled 'A new era of intelligence with Gemini 3,' suggesting a major new model release or announcement in the Gemini series. The body content was not provided, but the title and source indicate this is a flagship model announcement from Google DeepMind. This would represent the next generation of the Gemini model family following Gemini 2.x.

Long Context Evolution Frontier Model Releases Google DeepMind Gemini +1 more

7Latent Space·1mo ago·source ↗

Google I/O 2026: Gemini 3.5 Flash, Omni, Spark Background Agents, and Antigravity 2.0

Google I/O 2026 featured a cluster of AI announcements including Gemini 3.5 Flash, a multimodal model codenamed Omni (NanoBanana for video), Spark (a background agents platform), and Antigravity 2.0. The AINews digest from Latent Space summarizes the breadth of Google's releases across model, product, and infrastructure layers. Details on capabilities and benchmarks are not yet elaborated in the available body text.

Frontier Model Releases Inference Economics Spark Google Gemini 3.5 Flash +5 more

8Google Deepmind Blog·1mo ago·source ↗

Gemini 3.1 Pro: A smarter model for your most complex tasks

Google DeepMind has announced Gemini 3.1 Pro, a new model positioned for complex reasoning tasks where simple answers are insufficient. The announcement comes from the official DeepMind blog, indicating a flagship-tier release. The body content is minimal, providing little technical detail beyond the positioning statement.

Frontier Model Releases Enterprise Deployment Patterns Gemini 3.1 Pro Google DeepMind Gemini

7The Batch·19d ago·source ↗

The Batch: Claude Mythos 5 / Fable 5 debut, Apple AFM 3, Google Live Translate, OpenAI IPO filing, FrontierCode benchmark

Anthropic launched Claude Fable 5 (a safety-guardrailed model) and Claude Mythos 5 (same underlying model with safeguards removed, for vetted cyberdefense/infrastructure users via Project Glasswing with US government collaboration), both priced at $10/$50 per million tokens. Apple released five new Apple Foundation Models (AFM 3) spanning on-device and cloud tiers, built with Google and Nvidia infrastructure. Additional headlines cover Google's Gemini 3.5 Live Translate (70+ languages, real-time), OpenAI's confidential SEC IPO filing, a NotebookLM upgrade to Gemini 3.5, and Cognition's FrontierCode benchmark for code-quality evaluation where Claude Opus 4.8 leads at 34.3%.

Frontier Model Releases Evaluation and Benchmarking Claude Mythos Claude Opus 4.6 Google +19 more

5Google Deepmind Blog·1mo ago·source ↗

Introducing Gemma 3 270M: The compact model for hyper-efficient AI

Google DeepMind has released Gemma 3 270M, a 270-million parameter compact language model added to the Gemma 3 family. The model is positioned as a highly specialized, hyper-efficient tool for resource-constrained deployments. This extends the Gemma 3 lineup into the sub-billion parameter range, targeting edge and on-device use cases.

Open Weights Progress Inference Economics Gemma 3 Google DeepMind Gemma 3 270M +1 more