6Google DeepMind Blog·1mo ago

Project Genie Expands with Street View Integration for Real-World Simulation

DeepMind is expanding Project Genie access to Google AI Ultra subscribers globally and introducing a new capability that uses Street View data to simulate real-world places. Project Genie, previously known for generating interactive 2D environments from images, is now incorporating real-world geographic imagery as a conditioning source. The announcement signals a move toward grounding generative world models in actual physical environments.

Frontier Model Releases Agent and Tool Ecosystem Multimodal Progress Google Street View Project Genie Google DeepMind Google AI Ultra

Related guides (4)

Google DeepMind

Google DeepMind: Frontier AI Across Models, Robotics, and Scientific Discovery

Read asIn-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Multimodal ProgressTopic guide

Multimodal Progress: How AI Learned to See, Hear, and Act

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How the Infrastructure Layer Around LLMs Is Consolidating

Read asIn-depth

Related events (8)

5Google Deepmind Blog·1mo ago·source ↗

Project Genie: Experimenting with Infinite, Interactive Worlds

Google DeepMind has launched Project Genie as an experimental research prototype available to Google AI Ultra subscribers in the U.S. The system allows users to create and explore interactive worlds, extending DeepMind's prior Genie research on generative interactive environments. The announcement is brief and light on technical detail, positioning this as an early public experiment rather than a full product release.

Agent and Tool Ecosystem Multimodal Progress Project Genie Google DeepMind Google AI Ultra

8Google Deepmind Blog·1mo ago·source ↗

Genie 3: A new frontier for world models

DeepMind has announced Genie 3, a world model capable of generating interactive, navigable 3D environments in real time at 24 fps and 720p resolution. The system maintains consistency for several minutes, representing a significant step up from prior Genie iterations. This positions Genie 3 as a frontier capability demonstration in generative world modeling for interactive applications.

Frontier Model Releases Agent and Tool Ecosystem Genie 3 Google DeepMind +1 more

5Google Deepmind Blog·4d ago·source ↗

Google DeepMind partners with UK government on AI-accelerated housing planning prototype

Google DeepMind and the UK government have announced a partnership to build an AI-powered prototype aimed at accelerating housing planning decisions. The initiative targets the UK's housing development bottleneck by applying AI to planning workflows. This represents a notable government-lab collaboration deploying AI in a high-stakes public sector context.

Enterprise Deployment Patterns Regulatory Developments Google DeepMind UK Government

6Openai Blog·1mo ago·source ↗

DALL·E 3 Now Available in ChatGPT Plus and Enterprise

OpenAI has rolled out DALL·E 3 to ChatGPT Plus and Enterprise subscribers, expanding access to its latest image generation model. The announcement also highlights a safety mitigation stack developed for the wider release and provides updates on provenance research aimed at identifying AI-generated images.

AI Safety Research Enterprise Deployment Patterns DALL·E 3 ChatGPT Plus OpenAI +2 more

7Google Deepmind Blog·1mo ago·source ↗

SIMA 2: An Agent that Plays, Reasons, and Learns With You in Virtual 3D Worlds

DeepMind has announced SIMA 2, a successor to its Scalable Instructable Multiworld Agent, powered by Gemini and designed to think, reason, and act within interactive 3D virtual environments. The agent represents an advancement in embodied AI agents capable of operating across diverse game and simulation worlds. This builds on DeepMind's earlier SIMA work, which demonstrated generalist instruction-following agents in video game environments.

Frontier Model Releases Agent and Tool Ecosystem SIMA 2 SIMA Google DeepMind +2 more

7Google Deepmind Blog·1mo ago·source ↗

DeepMind's Vision for Building a Universal AI Assistant

DeepMind has published a vision statement for evolving Gemini into a universal AI assistant by extending it into a world model capable of planning and simulating aspects of the world. The announcement signals a strategic direction toward agents that can imagine and reason about future states rather than purely responding to prompts. This positions Gemini as a long-term platform for agentic and embodied AI capabilities.

Frontier Model Releases Agent and Tool Ecosystem DeepMind world model Google +2 more

4Openai Blog·1mo ago·source ↗

Building smarter maps with GPT-4o vision fine-tuning

OpenAI published a case study on Grab using GPT-4o vision fine-tuning to improve map intelligence. The deployment demonstrates a real-world enterprise application of fine-tuned multimodal models for geospatial data processing. This represents a concrete example of GPT-4o's vision capabilities being adapted for domain-specific tasks in Southeast Asian markets.

Enterprise Deployment Patterns Multimodal Progress Grab GPT-4o OpenAI +1 more

5Openai Blog·1mo ago·source ↗

OpenAI Releases Universe: A Platform for Training AI Across Games, Websites, and Applications

OpenAI released Universe, a software platform designed to measure and train AI general intelligence across a broad range of environments including games, websites, and other applications. The platform aims to expose AI agents to the world's supply of software as training and evaluation environments. This represented an early effort to develop general-purpose AI agents capable of operating across diverse real-world interfaces.

Evaluation and Benchmarking Agent and Tool Ecosystem Universe OpenAI