7Google DeepMind Blog·1mo ago

Veo 2 Video Generation Launches in Gemini Advanced and Whisk Animate

Google DeepMind is rolling out Veo 2 video generation capabilities to Gemini Advanced and Whisk, enabling users to create high-resolution eight-second videos from text prompts or animate still images. Gemini Advanced subscribers can generate videos directly from text, while Whisk Animate converts input images into short animated clips. This marks a consumer-facing deployment of Veo 2, DeepMind's second-generation video generation model.

Frontier Model Releases Enterprise Deployment Patterns Multimodal Progress Gemini Advanced Veo 2 Whisk Google Google DeepMind

Related guides (5)

Google DeepMind

Google DeepMind: Frontier AI Across Models, Robotics, and Scientific Discovery

Read asIn-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Google

Google: The AI Lab That Builds Everything from DNA Models to Your Phone's Assistant

Read asBeginner

Multimodal ProgressTopic guide

Multimodal Progress: How AI Learned to See, Hear, and Act

Read asBeginner

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From LLM Demo to Production Reality

Read asIn-depth

Related events (8)

6Google Deepmind Blog·1mo ago·source ↗

Introducing Veo 3.1 and Advanced Creative Capabilities

Google DeepMind has announced Veo 3.1, an updated version of its video generation model, with significant enhancements to creative control features. The announcement comes from DeepMind's official blog, indicating a formal product update rather than a research preview. Specific capability details are not provided in the body text, but the framing suggests improvements to user-facing generation controls.

Frontier Model Releases Multimodal Progress Veo Veo 3.1 Google DeepMind

6Google Deepmind Blog·1mo ago·source ↗

Veo 3.1 Ingredients to Video: More consistency, creativity and control

Google DeepMind has released Veo 3.1, an updated video generation model that improves consistency, creativity, and control in generated clips. The update produces more natural and dynamic video content and adds support for vertical video generation. The announcement comes from DeepMind's official blog as a tier-1 source.

Frontier Model Releases Multimodal Progress Veo Veo 3.1 Google DeepMind

7Google Deepmind Blog·1mo ago·source ↗

Gemini 2.0 Flash Native Image Generation Now Available for Developers

Google DeepMind has released native image output capability in Gemini 2.0 Flash, making it available to developers via Google AI Studio and the Gemini API. This enables the model to generate images natively rather than through a separate image generation pipeline. The release is framed as an experimental feature for developer exploration.

Frontier Model Releases Agent and Tool Ecosystem Google AI Studio Gemini-2.5-Flash-Lite Google DeepMind +2 more

8Google Deepmind Blog·1mo ago·source ↗

Google DeepMind Introduces Veo 3, Imagen 4, and Flow Filmmaking Tool

Google DeepMind has announced Veo 3 and Imagen 4, new generative video and image models respectively, alongside a filmmaking tool called Flow. The announcement comes from DeepMind's official blog and represents the next generation of their generative media capabilities. These releases expand Google's multimodal generative AI portfolio targeting creative and professional media production use cases.

Frontier Model Releases Agent and Tool Ecosystem Imagen 4 Veo 3.1 Flow +2 more

7Google Deepmind Blog·1mo ago·source ↗

Gemini 2.0 Flash and Flash-Lite Reach General Availability

Google DeepMind has made Gemini 2.0 Flash-Lite generally available via the Gemini API, Google AI Studio, and Vertex AI for enterprise production use. This marks the transition of the Flash-Lite variant from preview to full GA status. The release expands developer and enterprise access to cost-efficient Gemini 2.0 inference capabilities.

Frontier Model Releases Inference Economics Google AI Studio Gemini-2.5-Flash-Lite Google DeepMind +3 more

7Google Deepmind Blog·1mo ago·source ↗

Advanced audio dialog and generation with Gemini 2.5

Google DeepMind has announced new audio dialog and generation capabilities in Gemini 2.5. The update extends the model's multimodal capabilities into AI-powered audio interaction and synthesis. No further technical details are provided in the announcement body.

Frontier Model Releases Multimodal Progress Gemini 2.5 Google DeepMind

6Google Deepmind Blog·1mo ago·source ↗

Improved Gemini Audio Models for Powerful Voice Experiences

DeepMind has announced improved Gemini audio models targeting enhanced voice experience capabilities. The announcement comes from the official DeepMind blog, indicating a formal product or capability update to the Gemini model family's audio processing and generation features. Specific technical details were not available in the body text, but the framing suggests advances in speech understanding, synthesis, or real-time voice interaction. This is part of Google DeepMind's ongoing development of multimodal Gemini capabilities.

Frontier Model Releases Multimodal Progress Gemini Audio Google DeepMind Gemini

6Google Deepmind Blog·1mo ago·source ↗

Updated Gemini 2.5 Pro Preview with Improved Coding Capabilities

Google DeepMind has released an updated version of Gemini 2.5 Pro Preview with enhanced coding capabilities, specifically targeting the development of rich, interactive web applications. The announcement comes from DeepMind's official blog, indicating a focused improvement on code generation and web app development use cases. No detailed technical specifics or benchmark results are provided in the body text.

Frontier Model Releases Agent and Tool Ecosystem Google DeepMind Gemini-2.5-Pro