8Google DeepMind Blog·1mo ago

Introducing Gemini 2.5 Flash

Google DeepMind has released Gemini 2.5 Flash, described as their first fully hybrid reasoning model. The model allows developers to toggle 'thinking' (extended reasoning) on or off, combining standard and chain-of-thought inference modes in a single model. It is available to developers and represents a new architectural approach to balancing reasoning depth with inference cost.

Long Context Evolution Frontier Model Releases Inference Economics Agent and Tool Ecosystem Gemini-2.5-Flash-Lite Google DeepMind Gemini-2.5-Pro hybrid reasoning

Related guides (4)

Google DeepMind

Google DeepMind: Frontier AI Across Models, Robotics, and Scientific Discovery

Read asIn-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Long Context EvolutionTopic guide

Long Context Evolution: From Bigger Windows to Smarter Memory

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How the Infrastructure Layer Around LLMs Is Consolidating

Read asIn-depth

Related events (8)

7Google Deepmind Blog·1mo ago·source ↗

Gemini 2.5 Pro and Flash Updates: Deep Think Reasoning Mode and Capability Improvements

DeepMind announces updates to Gemini 2.5 Pro and Gemini 2.5 Flash, highlighting continued developer adoption for coding tasks. A new experimental feature called Deep Think introduces an enhanced reasoning mode for Gemini 2.5 Pro. Gemini 2.5 Flash also receives a capability update in this release cycle.

Frontier Model Releases Evaluation and Benchmarking Gemini-2.5-Flash-Lite Google DeepMind Gemini-2.5-Pro +2 more

8Google Deepmind Blog·1mo ago·source ↗

Gemini 2.5: Updates to our family of thinking models

Google DeepMind has announced updates to the Gemini 2.5 model family, including Gemini 2.5 Pro reaching stable status, Gemini 2.5 Flash becoming generally available, and a new Gemini 2.5 Flash-Lite entering preview. These releases mark the maturation of DeepMind's 'thinking model' line with enhanced performance and accuracy. The updates span multiple tiers of the Gemini 2.5 family, from the flagship Pro to the lightweight Flash-Lite variant.

Long Context Evolution Frontier Model Releases Gemini-2.5-Flash-Lite Google DeepMind Gemini-2.5-Pro +1 more

8Google Deepmind Blog·1mo ago·source ↗

Gemini 2.5: Google DeepMind's Most Intelligent AI Model with Built-in Thinking

Google DeepMind has announced Gemini 2.5, described as their most intelligent AI model to date, with thinking capabilities built directly into the model. The announcement comes from the official DeepMind blog and marks a significant step in Google's frontier model development. The integration of thinking natively into the model suggests a chain-of-thought or reasoning-first architecture similar to approaches seen in competing models.

Long Context Evolution Frontier Model Releases Gemini 2.5 Google DeepMind +1 more

5Google Deepmind Blog·1mo ago·source ↗

Gemini 2.5 Flash-Lite reaches general availability for production use

Google DeepMind has moved Gemini 2.5 Flash-Lite from preview to stable general availability. The model is positioned as a cost-efficient, small-footprint option within the 2.5 family, retaining key features including a 1 million-token context window and multimodal capabilities. It is now ready for scaled production deployment.

Long Context Evolution Frontier Model Releases Gemini 2.5 Gemini-2.5-Flash-Lite Google DeepMind +2 more

6The Batch·23d ago·source ↗

Google Launches Gemini 3.5 Flash: Mid-Tier Model With Agentic Gains at 3x Higher Price

Google released Gemini 3.5 Flash at Google I/O 2026, a mixture-of-experts multimodal model with adjustable reasoning levels, thought preservation across multi-turn conversations, and a 1M-token context window. The model tops APEX-Agents-AA and MMMU-Pro benchmarks among Flash-tier models but trails leading frontier models on overall intelligence, knowledge, and coding. Pricing is $1.50/$9.00 per million input/output tokens—three times the cost of its predecessor Gemini 3 Flash—raising questions about Google's positioning of Flash as a mid-tier rather than budget offering. Independent testing found it costs more in practice than Gemini 3.1 Pro despite Google's claims of competitive pricing.

Frontier Model Releases Evaluation and Benchmarking Google AI Studio Artificial Analysis Intelligence Index Claude Opus 4.6 +17 more

7Google Deepmind Blog·1mo ago·source ↗

Gemini 2.0 Flash and Flash-Lite Reach General Availability

Google DeepMind has made Gemini 2.0 Flash-Lite generally available via the Gemini API, Google AI Studio, and Vertex AI for enterprise production use. This marks the transition of the Flash-Lite variant from preview to full GA status. The release expands developer and enterprise access to cost-efficient Gemini 2.0 inference capabilities.

Frontier Model Releases Inference Economics Google AI Studio Gemini-2.5-Flash-Lite Google DeepMind +3 more

7Hacker News·1mo ago·source ↗

Gemini 3.5 Flash Released

Google has released Gemini 3.5 Flash, a new model in the Gemini family. The announcement appears on Google's official blog and has generated significant community discussion on Hacker News with 381 points and 304 comments. Gemini 3.5 Flash follows the Flash line of efficiency-focused models from Google DeepMind.

Frontier Model Releases Inference Economics Google Gemini 3.5 Flash Google DeepMind +3 more

8Google Deepmind Blog·1mo ago·source ↗

Gemini 2.5 Family Expansion: Flash and Pro GA, Flash-Lite Introduced

Google DeepMind has made Gemini 2.5 Flash and Gemini 2.5 Pro generally available, while simultaneously introducing Gemini 2.5 Flash-Lite, described as the most cost-efficient and fastest model in the 2.5 family. The announcement marks the full productization of the Gemini 2.5 generation. Flash-Lite targets latency- and cost-sensitive deployment scenarios.

Frontier Model Releases Inference Economics Gemini-2.5-Flash-Lite Google DeepMind Gemini-2.5-Pro +1 more