NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI
NVIDIA has released Cosmos Reason 2, a model designed to bring advanced reasoning capabilities to physical AI applications. The announcement appears on the Hugging Face blog, indicating the model is likely available or accessible through the platform. This represents a continuation of NVIDIA's Cosmos model family targeting robotics and physical world understanding.
Related guides (3)
Related events (8)
Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action
NVIDIA has released Cosmos 3, described as the first open omni-model targeting physical AI reasoning and action. The model is hosted and announced via Hugging Face, positioning it as an open-weights offering for robotics and embodied AI applications. The announcement highlights multimodal capabilities oriented toward physical world understanding and agent-level action.
NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets
NVIDIA announced new open models and datasets for physical AI development at GTC 2025, covered via the Hugging Face blog. The release targets robotics and embodied AI developers with open-weights resources. This represents NVIDIA's continued push into the physical AI ecosystem alongside its hardware dominance.
Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation
This Hugging Face blog post details a workflow for fine-tuning NVIDIA's Cosmos Predict 2.5 world model using LoRA and DoRA parameter-efficient techniques for robot video generation tasks. The post covers practical implementation steps for adapting the foundation video model to robotics-specific domains. This represents a concrete application of world models to embodied AI, where synthetic video generation can support robot training data pipelines.
NVIDIA Cosmos 3, Nemotron 3 Ultra, and RTX Spark
A Latent Space AI news digest covers three NVIDIA announcements: Cosmos 3 (a world model/simulation platform), Nemotron 3 Ultra (a large language model), and RTX Spark (likely a new hardware or inference product). The piece frames these as a significant win for Jensen Huang and NVIDIA's AI portfolio. Coverage is commentary-tier aggregation rather than primary technical reporting.
Gemini Robotics brings AI into the physical world
Google DeepMind has announced Gemini Robotics and Gemini Robotics-ER, two AI models purpose-built for robotic systems to perceive, reason about, and act within physical environments. The release extends the Gemini model family into embodied AI and robotics applications. Gemini Robotics-ER appears to target enhanced reasoning capabilities for robotic control. This marks a significant step by DeepMind toward deploying frontier multimodal models in physical-world settings.
NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset
NVIDIA has released a dataset of 6 million multilingual reasoning examples, published via Hugging Face. The dataset is intended to support training and evaluation of reasoning capabilities across multiple languages. This release addresses a known gap in multilingual reasoning data availability for the research community.
Gemini Robotics 1.5 brings AI agents into the physical world
DeepMind has announced Gemini Robotics 1.5, a model designed to enable physical AI agents with capabilities spanning perception, planning, reasoning, tool use, and multi-step task execution. The release positions Gemini as a foundation for embodied robotics systems. This represents an extension of the Gemini model family into physical-world agentic applications.
Open R1: Update #2
Hugging Face's Open R1 project releases its second progress update on the open-source replication of DeepSeek-R1's reasoning capabilities. The update likely covers training progress, dataset releases, and intermediate model checkpoints as the team works toward a fully open reproduction of the reasoning model pipeline. Open R1 is a community-driven effort to make the techniques behind frontier reasoning models accessible to researchers.


