7Hugging Face Blog·19d ago

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

NVIDIA has released Cosmos 3, described as the first open omni-model targeting physical AI reasoning and action. The model is hosted and announced via Hugging Face, positioning it as an open-weights offering for robotics and embodied AI applications. The announcement highlights multimodal capabilities oriented toward physical world understanding and agent-level action.

Frontier Model Releases Open Weights Progress Agent and Tool Ecosystem Multimodal Progress NVIDIA Cosmos NVIDIA Hugging Face

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

NVIDIA

NVIDIA: The Hardware Backbone of the AI Era

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Related events (8)

6Hugging Face Blog·1mo ago·source ↗

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

NVIDIA has released Cosmos Reason 2, a model designed to bring advanced reasoning capabilities to physical AI applications. The announcement appears on the Hugging Face blog, indicating the model is likely available or accessible through the platform. This represents a continuation of NVIDIA's Cosmos model family targeting robotics and physical world understanding.

Frontier Model Releases Agent and Tool Ecosystem NVIDIA Cosmos Reason 2 NVIDIA Cosmos NVIDIA +2 more

6Hugging Face Blog·1mo ago·source ↗

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

NVIDIA announced new open models and datasets for physical AI development at GTC 2025, covered via the Hugging Face blog. The release targets robotics and embodied AI developers with open-weights resources. This represents NVIDIA's continued push into the physical AI ecosystem alongside its hardware dominance.

Open Weights Progress Agent and Tool Ecosystem NVIDIA Hugging Face GTC 2025 +1 more

6Hugging Face Blog·1mo ago·source ↗

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

NVIDIA has released Nemotron 3 Nano Omni, a multimodal model targeting long-context understanding across documents, audio, and video modalities. The model is positioned for agentic use cases requiring cross-modal reasoning. It is published via the Hugging Face blog as part of NVIDIA's Nemotron model family. No detailed technical specifications or benchmark results are provided in the available body text.

Long Context Evolution Open Weights Progress Nemotron 3 Nano Omni NVIDIA Hugging Face +3 more

6Latent Space·18d ago·source ↗

NVIDIA Cosmos 3, Nemotron 3 Ultra, and RTX Spark

A Latent Space AI news digest covers three NVIDIA announcements: Cosmos 3 (a world model/simulation platform), Nemotron 3 Ultra (a large language model), and RTX Spark (likely a new hardware or inference product). The piece frames these as a significant win for Jensen Huang and NVIDIA's AI portfolio. Coverage is commentary-tier aggregation rather than primary technical reporting.

Training Infrastructure Frontier Model Releases NVIDIA Cosmos NVIDIA RTX Spark NVIDIA +4 more

5Hugging Face Blog·1mo ago·source ↗

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

This Hugging Face blog post details a workflow for fine-tuning NVIDIA's Cosmos Predict 2.5 world model using LoRA and DoRA parameter-efficient techniques for robot video generation tasks. The post covers practical implementation steps for adapting the foundation video model to robotics-specific domains. This represents a concrete application of world models to embodied AI, where synthetic video generation can support robot training data pipelines.

Inference Economics Agent and Tool Ecosystem DoRA LoRA NVIDIA +3 more

7The Batch·34h ago·source ↗

Nvidia Nemotron 3 Ultra: hybrid Mamba-transformer open-weights model targeting agentic workloads

Nvidia released Nemotron 3 Ultra, a 550B parameter (55B active) hybrid Mamba-transformer mixture-of-experts model with a 1M token context window, publishing weights, training data, and RL environments under an open license. The model ranks as the highest-scoring U.S. open-weights model on the Artificial Analysis Intelligence Index (47.7-48.2) and is approximately three times faster than comparable open-weights rivals, though it trails leading Chinese models like Kimi K2.6 and DeepSeek V4 Pro on intelligence benchmarks. Nvidia used a novel Multi-Teacher On-Policy Distillation approach with 10+ specialized teacher models and trained using NVFP4 quantization. The release is strategically motivated by Nvidia's interest in a healthy open-weights ecosystem that drives AI semiconductor adoption.

Frontier Model Releases Open Weights Progress Mamba IFBench Artificial Analysis Intelligence Index +17 more

9Openai Blog·1mo ago·source ↗

Introducing OpenAI o3 and o4-mini

OpenAI has released o3 and o4-mini, described as their smartest and most capable models to date. Both models ship with full tool access, representing a significant step in integrating reasoning models with agentic capabilities. The announcement comes from OpenAI's official blog, marking a major frontier model release.

Frontier Model Releases Evaluation and Benchmarking o4-mini OpenAI o3 +2 more

8Openai Blog·1mo ago·source ↗

OpenAI o3-mini Release

OpenAI has released o3-mini, a smaller and more efficient variant of its o3 reasoning model. The announcement comes from OpenAI's official blog, indicating a formal product launch. As a tier-1 source announcement, this represents a significant addition to OpenAI's model lineup, targeting cost-effective reasoning capabilities. Further technical details about benchmarks, context length, and pricing are expected in the full release documentation.

Frontier Model Releases Inference Economics o3-mini OpenAI o3 +1 more