Almanac
← Events
7Google DeepMind Blog·1mo ago

SIMA 2: An Agent that Plays, Reasons, and Learns With You in Virtual 3D Worlds

DeepMind has announced SIMA 2, a successor to its Scalable Instructable Multiworld Agent, powered by Gemini and designed to think, reason, and act within interactive 3D virtual environments. The agent represents an advancement in embodied AI agents capable of operating across diverse game and simulation worlds. This builds on DeepMind's earlier SIMA work, which demonstrated generalist instruction-following agents in video game environments.

Related guides (4)

Related events (8)

7Google Deepmind Blog·1mo ago·source ↗

DeepMind's Vision for Building a Universal AI Assistant

DeepMind has published a vision statement for evolving Gemini into a universal AI assistant by extending it into a world model capable of planning and simulating aspects of the world. The announcement signals a strategic direction toward agents that can imagine and reason about future states rather than purely responding to prompts. This positions Gemini as a long-term platform for agentic and embodied AI capabilities.

8Google Deepmind Blog·1mo ago·source ↗

Gemini Robotics 1.5 brings AI agents into the physical world

DeepMind has announced Gemini Robotics 1.5, a model designed to enable physical AI agents with capabilities spanning perception, planning, reasoning, tool use, and multi-step task execution. The release positions Gemini as a foundation for embodied robotics systems. This represents an extension of the Gemini model family into physical-world agentic applications.

9Google Deepmind Blog·1mo ago·source ↗

Gemini 3.5: Frontier Intelligence with Action

Google DeepMind has announced Gemini 3.5, a new model generation positioned around agentic capabilities and complex workflow execution. The announcement emphasizes action-oriented AI, suggesting a focus on tool use, multi-step reasoning, and autonomous task completion. The blog post is brief, indicating this may be an initial announcement with further details to follow.

8Google Deepmind Blog·1mo ago·source ↗

Gemini 2.5: Google DeepMind's Most Intelligent AI Model with Built-in Thinking

Google DeepMind has announced Gemini 2.5, described as their most intelligent AI model to date, with thinking capabilities built directly into the model. The announcement comes from the official DeepMind blog and marks a significant step in Google's frontier model development. The integration of thinking natively into the model suggests a chain-of-thought or reasoning-first architecture similar to approaches seen in competing models.

8Google Deepmind Blog·1mo ago·source ↗

Gemini Robotics brings AI into the physical world

Google DeepMind has announced Gemini Robotics and Gemini Robotics-ER, two AI models purpose-built for robotic systems to perceive, reason about, and act within physical environments. The release extends the Gemini model family into embodied AI and robotics applications. Gemini Robotics-ER appears to target enhanced reasoning capabilities for robotic control. This marks a significant step by DeepMind toward deploying frontier multimodal models in physical-world settings.

6arXiv · cs.CL·12d ago·source ↗

Agentopia: Long-term multi-agent life simulation framework for training LLMs on social behavior

Researchers introduce Agentopia, a framework for simulating 10 years of social life across 100 LLM-powered agents, enabling study of emergent social behaviors and long-term personal growth dynamics. The system defines a 'life reward' metric mirroring human well-being and uses it to train LLMs via rejection sampling. Training on simulated social experience yields a +15.6% improvement on downstream role-playing benchmarks, suggesting that synthetic social simulation can generalize to real capability gains.

8Google Deepmind Blog·1mo ago·source ↗

Introducing the Gemini 2.5 Computer Use model

Google DeepMind has released a preview of a specialized Computer Use model built on Gemini 2.5 Pro, available via API. The model is designed to power agents that can interact with user interfaces, extending Gemini 2.5 Pro's capabilities into computer-use agentic tasks. This positions Google as a direct competitor to Anthropic's Claude Computer Use and similar offerings in the emerging computer-use agent space.

4Openai Blog·1mo ago·source ↗

OpenAI Releases Neural MMO: Massively Multiagent RL Game Environment

OpenAI released Neural MMO, a massively multiagent game environment designed for reinforcement learning research. The platform supports a large and variable number of agents operating within a persistent, open-ended task structure. The environment is designed to encourage emergent behaviors including better exploration, divergent niche formation, and improved overall agent competence through multi-species competition.