5Simon Willison's Weblog·1mo ago

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Simon Willison offers commentary on Google's Gemini 3.5 Flash model release, noting it is priced higher than its predecessor while Google intends to deploy it broadly across its products. The piece reflects on the pricing shift and Google's strategic positioning of the model as a general-purpose workhorse. As a tier-2 commentary source, this provides analyst perspective rather than primary technical detail.

Frontier Model Releases Inference Economics Enterprise Deployment Patterns Google Gemini 3.5 Flash Simon Willison

Related guides (4)

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Google

Google: The AI Lab That Builds Everything from DNA Models to Your Phone's Assistant

Read asBeginner

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From AI Demo to Production Reality

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost Structure of Running AI Models in Production

Read asIn-depth

Related events (8)

7Hacker News·1mo ago·source ↗

Gemini 3.5 Flash Released

Google has released Gemini 3.5 Flash, a new model in the Gemini family. The announcement appears on Google's official blog and has generated significant community discussion on Hacker News with 381 points and 304 comments. Gemini 3.5 Flash follows the Flash line of efficiency-focused models from Google DeepMind.

Frontier Model Releases Inference Economics Google Gemini 3.5 Flash Google DeepMind +3 more

6The Batch·22d ago·source ↗

Google Launches Gemini 3.5 Flash: Mid-Tier Model With Agentic Gains at 3x Higher Price

Google released Gemini 3.5 Flash at Google I/O 2026, a mixture-of-experts multimodal model with adjustable reasoning levels, thought preservation across multi-turn conversations, and a 1M-token context window. The model tops APEX-Agents-AA and MMMU-Pro benchmarks among Flash-tier models but trails leading frontier models on overall intelligence, knowledge, and coding. Pricing is $1.50/$9.00 per million input/output tokens—three times the cost of its predecessor Gemini 3 Flash—raising questions about Google's positioning of Flash as a mid-tier rather than budget offering. Independent testing found it costs more in practice than Gemini 3.1 Pro despite Google's claims of competitive pricing.

Frontier Model Releases Evaluation and Benchmarking Google AI Studio Artificial Analysis Intelligence Index Claude Opus 4.6 +17 more

6Google Deepmind Blog·1mo ago·source ↗

Gemini 3.1 Flash-Lite: Built for intelligence at scale

Google DeepMind has released Gemini 3.1 Flash-Lite, described as the fastest and most cost-efficient model in the Gemini 3 series. The announcement positions it as optimized for high-throughput, cost-sensitive deployments at scale. The body is sparse, offering no benchmark details or capability specifics beyond the efficiency framing.

Frontier Model Releases Inference Economics Google DeepMind Gemini 3.1 Flash Live Gemini +1 more

4Don'T Worry About The Vase·29d ago·source ↗

Gemini 3.5 Flash Looks Good For How Fast It Is

Zvi Mowshowitz offers commentary on Google's Gemini 3.5 Flash model, characterizing it as a competitive option given its speed profile. The piece is a tier-2 commentary assessing the model's positioning in the current landscape. The headline framing suggests the model is notable primarily in the speed-vs-capability tradeoff rather than as a frontier capability leader.

Frontier Model Releases Inference Economics Google Gemini 3.5 Flash Zvi Mowshowitz

8Google Deepmind Blog·1mo ago·source ↗

Gemini 3 Flash: frontier intelligence built for speed

Google DeepMind has announced Gemini 3 Flash, a new model positioned as a frontier-intelligence offering optimized for speed and cost efficiency. The announcement comes from the official DeepMind blog, indicating a formal product release. Specific capability details and benchmarks are not included in the available body text.

Frontier Model Releases Inference Economics Google DeepMind Gemini 3 Flash Gemini

8Google Deepmind Blog·1mo ago·source ↗

Gemini 2.5 Family Expansion: Flash and Pro GA, Flash-Lite Introduced

Google DeepMind has made Gemini 2.5 Flash and Gemini 2.5 Pro generally available, while simultaneously introducing Gemini 2.5 Flash-Lite, described as the most cost-efficient and fastest model in the 2.5 family. The announcement marks the full productization of the Gemini 2.5 generation. Flash-Lite targets latency- and cost-sensitive deployment scenarios.

Frontier Model Releases Inference Economics Gemini-2.5-Flash-Lite Google DeepMind Gemini-2.5-Pro +1 more

5Google Deepmind Blog·1mo ago·source ↗

Gemini 2.5 Flash-Lite reaches general availability for production use

Google DeepMind has moved Gemini 2.5 Flash-Lite from preview to stable general availability. The model is positioned as a cost-efficient, small-footprint option within the 2.5 family, retaining key features including a 1 million-token context window and multimodal capabilities. It is now ready for scaled production deployment.

Long Context Evolution Frontier Model Releases Gemini 2.5 Gemini-2.5-Flash-Lite Google DeepMind +2 more

6The Batch·22d ago·source ↗

Gemini 3.5 Flash Launch, AI FDE Job Trends, AI Act Delays, and Agent-Driven Web Traffic

Google launched Gemini 3.5 Flash, a mid-tier multimodal mixture-of-experts model with improved agentic capabilities, visual understanding, and speed, priced at $1.50/$9.00 per million input/output tokens — three times the cost of its predecessor Gemini 3 Flash. The model supports up to 1M token context, adjustable reasoning levels, and thought preservation across multi-turn conversations, and tops the Artificial Analysis APEX-Agents-AA and MMMU-Pro benchmarks. The issue also covers Andrew Ng's commentary on the rise of AI Forward Deployed Engineers versus the broader AI Engineer role, plus news items on EU AI Act implementation delays and AI agents driving measurable online traffic shifts.

Frontier Model Releases Evaluation and Benchmarking Gemini 3.5 Pro Palantir Artificial Analysis Intelligence Index +18 more