Almanac
← Events
6Hugging Face Blog·1mo ago

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

NVIDIA has released Cosmos Reason 2, a model designed to bring advanced reasoning capabilities to physical AI applications. The announcement appears on the Hugging Face blog, indicating the model is likely available or accessible through the platform. This represents a continuation of NVIDIA's Cosmos model family targeting robotics and physical world understanding.

Related guides (3)

Related events (8)

7Hugging Face Blog·19d ago·source ↗

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

NVIDIA has released Cosmos 3, described as the first open omni-model targeting physical AI reasoning and action. The model is hosted and announced via Hugging Face, positioning it as an open-weights offering for robotics and embodied AI applications. The announcement highlights multimodal capabilities oriented toward physical world understanding and agent-level action.

6Hugging Face Blog·1mo ago·source ↗

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

NVIDIA announced new open models and datasets for physical AI development at GTC 2025, covered via the Hugging Face blog. The release targets robotics and embodied AI developers with open-weights resources. This represents NVIDIA's continued push into the physical AI ecosystem alongside its hardware dominance.

5Hugging Face Blog·1mo ago·source ↗

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

This Hugging Face blog post details a workflow for fine-tuning NVIDIA's Cosmos Predict 2.5 world model using LoRA and DoRA parameter-efficient techniques for robot video generation tasks. The post covers practical implementation steps for adapting the foundation video model to robotics-specific domains. This represents a concrete application of world models to embodied AI, where synthetic video generation can support robot training data pipelines.

6Latent Space·18d ago·source ↗

NVIDIA Cosmos 3, Nemotron 3 Ultra, and RTX Spark

A Latent Space AI news digest covers three NVIDIA announcements: Cosmos 3 (a world model/simulation platform), Nemotron 3 Ultra (a large language model), and RTX Spark (likely a new hardware or inference product). The piece frames these as a significant win for Jensen Huang and NVIDIA's AI portfolio. Coverage is commentary-tier aggregation rather than primary technical reporting.

8Google Deepmind Blog·1mo ago·source ↗

Gemini Robotics brings AI into the physical world

Google DeepMind has announced Gemini Robotics and Gemini Robotics-ER, two AI models purpose-built for robotic systems to perceive, reason about, and act within physical environments. The release extends the Gemini model family into embodied AI and robotics applications. Gemini Robotics-ER appears to target enhanced reasoning capabilities for robotic control. This marks a significant step by DeepMind toward deploying frontier multimodal models in physical-world settings.

6Hugging Face Blog·1mo ago·source ↗

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

NVIDIA has released a dataset of 6 million multilingual reasoning examples, published via Hugging Face. The dataset is intended to support training and evaluation of reasoning capabilities across multiple languages. This release addresses a known gap in multilingual reasoning data availability for the research community.

8Google Deepmind Blog·1mo ago·source ↗

Gemini Robotics 1.5 brings AI agents into the physical world

DeepMind has announced Gemini Robotics 1.5, a model designed to enable physical AI agents with capabilities spanning perception, planning, reasoning, tool use, and multi-step task execution. The release positions Gemini as a foundation for embodied robotics systems. This represents an extension of the Gemini model family into physical-world agentic applications.

5Hugging Face Blog·1mo ago·source ↗

Open R1: Update #2

Hugging Face's Open R1 project releases its second progress update on the open-source replication of DeepSeek-R1's reasoning capabilities. The update likely covers training progress, dataset releases, and intermediate model checkpoints as the team works toward a fully open reproduction of the reasoning model pipeline. Open R1 is a community-driven effort to make the techniques behind frontier reasoning models accessible to researchers.