Project Genie Expands with Street View Integration for Real-World Simulation
DeepMind is expanding Project Genie access to Google AI Ultra subscribers globally and introducing a new capability that uses Street View data to simulate real-world places. Project Genie, previously known for generating interactive 2D environments from images, is now incorporating real-world geographic imagery as a conditioning source. The announcement signals a move toward grounding generative world models in actual physical environments.
Related guides (4)
Related events (8)
Project Genie: Experimenting with Infinite, Interactive Worlds
Google DeepMind has launched Project Genie as an experimental research prototype available to Google AI Ultra subscribers in the U.S. The system allows users to create and explore interactive worlds, extending DeepMind's prior Genie research on generative interactive environments. The announcement is brief and light on technical detail, positioning this as an early public experiment rather than a full product release.
Genie 3: A new frontier for world models
DeepMind has announced Genie 3, a world model capable of generating interactive, navigable 3D environments in real time at 24 fps and 720p resolution. The system maintains consistency for several minutes, representing a significant step up from prior Genie iterations. This positions Genie 3 as a frontier capability demonstration in generative world modeling for interactive applications.
Google DeepMind partners with UK government on AI-accelerated housing planning prototype
Google DeepMind and the UK government have announced a partnership to build an AI-powered prototype aimed at accelerating housing planning decisions. The initiative targets the UK's housing development bottleneck by applying AI to planning workflows. This represents a notable government-lab collaboration deploying AI in a high-stakes public sector context.
DALL·E 3 Now Available in ChatGPT Plus and Enterprise
OpenAI has rolled out DALL·E 3 to ChatGPT Plus and Enterprise subscribers, expanding access to its latest image generation model. The announcement also highlights a safety mitigation stack developed for the wider release and provides updates on provenance research aimed at identifying AI-generated images.
SIMA 2: An Agent that Plays, Reasons, and Learns With You in Virtual 3D Worlds
DeepMind has announced SIMA 2, a successor to its Scalable Instructable Multiworld Agent, powered by Gemini and designed to think, reason, and act within interactive 3D virtual environments. The agent represents an advancement in embodied AI agents capable of operating across diverse game and simulation worlds. This builds on DeepMind's earlier SIMA work, which demonstrated generalist instruction-following agents in video game environments.
DeepMind's Vision for Building a Universal AI Assistant
DeepMind has published a vision statement for evolving Gemini into a universal AI assistant by extending it into a world model capable of planning and simulating aspects of the world. The announcement signals a strategic direction toward agents that can imagine and reason about future states rather than purely responding to prompts. This positions Gemini as a long-term platform for agentic and embodied AI capabilities.
Building smarter maps with GPT-4o vision fine-tuning
OpenAI published a case study on Grab using GPT-4o vision fine-tuning to improve map intelligence. The deployment demonstrates a real-world enterprise application of fine-tuned multimodal models for geospatial data processing. This represents a concrete example of GPT-4o's vision capabilities being adapted for domain-specific tasks in Southeast Asian markets.
OpenAI Releases Universe: A Platform for Training AI Across Games, Websites, and Applications
OpenAI released Universe, a software platform designed to measure and train AI general intelligence across a broad range of environments including games, websites, and other applications. The platform aims to expose AI agents to the world's supply of software as training and evaluation environments. This represented an early effort to develop general-purpose AI agents capable of operating across diverse real-world interfaces.



