Apple Foundation Models 3 (AFM 3) bring on-device AI to iPhones and Macs via Google Gemini distillation
Apple announced its third-generation Foundation Models (AFM 3), a family of models distilled from Google Gemini and designed to run on-device on Apple silicon, including iPhones and Macs. The flagship on-device model, AFM 3 Core Advanced, uses a novel 'Instruction-Following Pruning' technique as an alternative to standard mixture-of-experts routing, enabling faster inference and flash-memory storage with 20B total parameters but only 1-4B active. The family also includes cloud-hosted variants (AFM 3 Cloud, Cloud Image, Cloud Pro), and Apple's Foundation Models Framework will allow developers to swap in third-party models like Claude or Gemini. No public benchmark results have been released yet; Apple says they will follow later in 2026.
Related guides (3)
Related events (8)
Apple reveals new AI architecture built around Google Gemini models
Apple has announced a new AI architecture centered on Google Gemini models, representing a significant strategic shift in how Apple integrates third-party AI into its ecosystem. The announcement, reported by MacRumors and generating substantial Hacker News discussion, suggests a deepening partnership between Apple and Google for on-device and cloud AI capabilities. This move has implications for the competitive landscape of consumer AI and the positioning of both companies relative to OpenAI and other frontier labs.
Gemini 3.1 Flash-Lite: Built for intelligence at scale
Google DeepMind has released Gemini 3.1 Flash-Lite, described as the fastest and most cost-efficient model in the Gemini 3 series. The announcement positions it as optimized for high-throughput, cost-sensitive deployments at scale. The body is sparse, offering no benchmark details or capability specifics beyond the efficiency framing.
Gemini 3 Flash: frontier intelligence built for speed
Google DeepMind has announced Gemini 3 Flash, a new model positioned as a frontier-intelligence offering optimized for speed and cost efficiency. The announcement comes from the official DeepMind blog, indicating a formal product release. Specific capability details and benchmarks are not included in the available body text.
A new era of intelligence with Gemini 3
DeepMind has published a blog post titled 'A new era of intelligence with Gemini 3,' suggesting a major new model release or announcement in the Gemini series. The body content was not provided, but the title and source indicate this is a flagship model announcement from Google DeepMind. This would represent the next generation of the Gemini model family following Gemini 2.x.
Google I/O 2026: Gemini 3.5 Flash, Omni, Spark Background Agents, and Antigravity 2.0
Google I/O 2026 featured a cluster of AI announcements including Gemini 3.5 Flash, a multimodal model codenamed Omni (NanoBanana for video), Spark (a background agents platform), and Antigravity 2.0. The AINews digest from Latent Space summarizes the breadth of Google's releases across model, product, and infrastructure layers. Details on capabilities and benchmarks are not yet elaborated in the available body text.
Gemini 3.1 Pro: A smarter model for your most complex tasks
Google DeepMind has announced Gemini 3.1 Pro, a new model positioned for complex reasoning tasks where simple answers are insufficient. The announcement comes from the official DeepMind blog, indicating a flagship-tier release. The body content is minimal, providing little technical detail beyond the positioning statement.
The Batch: Claude Mythos 5 / Fable 5 debut, Apple AFM 3, Google Live Translate, OpenAI IPO filing, FrontierCode benchmark
Anthropic launched Claude Fable 5 (a safety-guardrailed model) and Claude Mythos 5 (same underlying model with safeguards removed, for vetted cyberdefense/infrastructure users via Project Glasswing with US government collaboration), both priced at $10/$50 per million tokens. Apple released five new Apple Foundation Models (AFM 3) spanning on-device and cloud tiers, built with Google and Nvidia infrastructure. Additional headlines cover Google's Gemini 3.5 Live Translate (70+ languages, real-time), OpenAI's confidential SEC IPO filing, a NotebookLM upgrade to Gemini 3.5, and Cognition's FrontierCode benchmark for code-quality evaluation where Claude Opus 4.8 leads at 34.3%.
Introducing Gemma 3 270M: The compact model for hyper-efficient AI
Google DeepMind has released Gemma 3 270M, a 270-million parameter compact language model added to the Gemma 3 family. The model is positioned as a highly specialized, hyper-efficient tool for resource-constrained deployments. This extends the Gemma 3 lineup into the sub-billion parameter range, targeting edge and on-device use cases.


