Step 1 of 9 in Inference Economics: Who Serves AI, and at What CostNext: Amazon Web Services →

Guide · Beginner

NVIDIA: The Hardware Backbone of the AI Era

NVIDIABeginneractive·v1 · live·generated 6d ago

Part of these paths

Inference Economics · Step 1 of 9
Long Context Evolution · Step 2 of 10
Open Weights Progress · Step 6 of 7
Training Infrastructure · Step 1 of 8

TL;DRNVIDIA is the company whose chips power most of the AI systems you've heard of — from ChatGPT to Claude to open-source models anyone can download. Beyond selling hardware, it has become a full-stack AI platform: building its own AI models, partnering with nearly every major lab, and investing billions to keep open-source AI competitive with closed alternatives.

Key takeaways

NVIDIA invested $30B in OpenAI's latest funding round and has a separate 10-gigawatt datacenter partnership with OpenAI launching in 2026.
NVIDIA invested up to $10B in Anthropic and is co-optimizing future chip architectures specifically for Anthropic's AI workloads.
Its own model families — Nemotron, Cosmos, and Gated DeltaNet-2 — span language, physical-world reasoning, and quantum computing applications.
NVIDIA plans a $26B five-year investment in open-weights AI models, partly as a strategic response to Chinese labs building capable AI on non-NVIDIA hardware.
AI tools now assist NVIDIA's own chip design process: systems like NVCell and PrefixRL produce circuit layouts measurably better than human engineers in a fraction of the time.
Mistral Large 3 was trained on 3,000 NVIDIA H200 GPUs, and Mistral NeMo was jointly released with NVIDIA — illustrating how deeply the company is embedded in third-party model development.

What NVIDIA is

NVIDIA is a semiconductor company best known for making GPUs — graphics processing units — that turned out to be exceptionally good at training and running AI. If you've used ChatGPT, Claude, or almost any other major AI product, there's a very good chance NVIDIA hardware was involved somewhere in building or serving it.

But calling NVIDIA a "chip company" undersells what it has become. It now offers a full platform: chips, software to run AI efficiently on those chips (like TensorRT-LLM), ready-to-deploy AI models under its own brand, and enterprise tools for building AI-powered applications. Think of it less like a parts supplier and more like the company that built the roads, the trucks, and some of the cargo.

Why it matters to you

If your organization is evaluating, buying, or building AI tools, NVIDIA's position matters for a simple reason: almost everything runs on its hardware. The major AI labs — OpenAI, Anthropic, Mistral — all train and serve their models on NVIDIA GPUs. When Anthropic signed a deal to access over 220,000 NVIDIA GPUs through SpaceX's Colossus data center, or when Mistral trained its Large 3 model on 3,000 NVIDIA H200 GPUs, those weren't coincidences. They reflect NVIDIA's near-universal presence in serious AI infrastructure.

This means NVIDIA's product decisions — which chips it builds, what software it supports, which partners it favors — ripple through the entire AI industry.

NVIDIA's own AI models

Beyond hardware, NVIDIA has been quietly building a substantial portfolio of its own AI models:

Nemotron is its family of language and multimodal models. Nemotron 3 Super 120B is a large open-weights model that activates only 12 billion of its 120 billion parameters at a time (a technique called Mixture-of-Experts), making it fast and efficient. Nemotron 3 Nano Omni handles documents, audio, and video. Nemotron 3.5 Content Safety is built specifically for enterprise content moderation.
Cosmos is NVIDIA's family of models for physical AI — robots and systems that need to understand and act in the real world. Cosmos 3 was released as the first open omni-model targeting physical AI reasoning and action.
Gated DeltaNet-2 is a research-level architecture that outperforms competing approaches on certain efficiency benchmarks.
Ising is a family of models for quantum computing calibration, already adopted by institutions like Fermilab and Harvard.

All of these are released as open-weights models — meaning researchers and companies can download and use them freely.

The partnership web

NVIDIA has made itself indispensable by investing in and partnering with the companies that might otherwise be its biggest customers or competitors:

OpenAI: NVIDIA invested $30 billion in OpenAI's latest funding round and has a separate agreement to deploy 10 gigawatts of AI datacenter capacity together, with the first phase launching in 2026.
Anthropic: NVIDIA invested up to $10 billion and is co-designing future chip architectures specifically for Anthropic's AI workloads. Claude models run on NVIDIA Grace Blackwell and Vera Rubin systems.
Mistral AI: NVIDIA is a founding partner of the Nemotron Coalition, a multi-lab initiative to advance open-source AI. Mistral and NVIDIA jointly released Mistral NeMo, and Mistral's models are available as NVIDIA NIM inference microservices — pre-packaged, easy-to-deploy containers.
Microsoft: Claude models (and by extension NVIDIA compute) are available across Microsoft's Copilot product family and Azure.

Using AI to design its own chips

One of the more striking developments in the events covered here: NVIDIA is using AI to design better NVIDIA chips. At GTC 2025, NVIDIA's chief scientist described several systems in active use:

NVCell uses reinforcement learning and genetic algorithms to redesign thousands of chip layout cells overnight — work that would otherwise take ten engineer-months.
PrefixRL designs arithmetic circuits that are 20–30% better than human-designed equivalents.
ChipNeMo and BugNeMo are AI assistants fine-tuned on internal GPU documentation to help engineers find and fix bugs.

This is a feedback loop: better chips train better AI, which helps design better chips.

The open-weights bet

NVIDIA announced a $26 billion, five-year investment in open-weights AI models. The stated reason is partly strategic: Chinese AI labs have been building capable models on non-NVIDIA hardware, and a thriving open-weights ecosystem that runs best on NVIDIA chips keeps the company central to global AI development — even in markets where its hardware faces export restrictions.

The geopolitical dimension is real. DeepSeek gave Huawei early access to its upcoming V4 model for hardware optimization while blocking NVIDIA and AMD — a signal that China is actively working to reduce dependence on NVIDIA's supply chain.

Where it's heading

The events in this bundle point toward NVIDIA deepening its role on three fronts: as the default infrastructure for frontier AI training and inference, as a model publisher in its own right (especially for physical AI and robotics), and as a strategic investor whose chip roadmap is increasingly co-designed with the labs that use it most. The company is also pushing into enterprise software — NemoClaw, its agentic governance stack, launched with partners including Salesforce, Cisco, and CrowdStrike — suggesting ambitions well beyond selling GPUs.

NVIDIA's AI ecosystem: chips, models, and key partnerships

Timeline

FAQ

Do I need NVIDIA to run AI?

Not strictly — AMD GPUs, Google TPUs, and Amazon Trainium chips are all used for AI — but NVIDIA's hardware is by far the most widely deployed, and most AI software is optimized for it first.

Does NVIDIA make AI models, or just chips?

Both. Its Nemotron family covers language and multimodal tasks, and its Cosmos family targets robotics and physical-world AI — all released as open-weights models anyone can use.

Why is NVIDIA investing in AI companies like OpenAI and Anthropic?

The investments come with deep technical partnerships — co-designing future chips for specific AI workloads — ensuring NVIDIA's hardware stays central to how the next generation of AI is built and run.

What is the geopolitical risk to NVIDIA?

DeepSeek gave Huawei early access to its upcoming model for hardware optimization while blocking NVIDIA and AMD, signaling that China is actively working to reduce dependence on NVIDIA chips — a trend NVIDIA's open-weights investment strategy is partly designed to counter.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

v1live6d ago

Related guides (4)

NVIDIA

NVIDIA: The Infrastructure Layer Powering the AI Era

Read asIn-depth

Microsoft

Microsoft: The AI Infrastructure Giant Betting on Every Horse

Read asBeginner In-depth

Google DeepMind

Google DeepMind: The Lab Behind Gemini, AlphaFold, and Frontier AI

Read asBeginner In-depth

Google

Google: The AI Lab That Builds Everything from DNA Models to Your Phone's Assistant

Read asBeginner

More on NVIDIA (6)

7Mistral Ai News·1mo ago·source ↗

Mistral AI joins NVIDIA Nemotron Coalition as founding member, co-developing open frontier models

Mistral AI has announced a strategic partnership with NVIDIA as a founding member of the newly formed NVIDIA Nemotron Coalition, a multi-lab initiative to advance open-source frontier foundation models. The collaboration will combine Mistral's model architectures, multimodal capabilities, and fine-tuning expertise with NVIDIA's DGX Cloud compute and synthetic data pipelines. The coalition's first deliverable is a base model trained on DGX Cloud that will underpin the upcoming NVIDIA Nemotron 4 model family, to be open-sourced. Coinciding with the announcement, Mistral is also releasing Mistral Small 4.

Training Infrastructure Frontier Model Releases Mistral AI Mistral Small 4 Arthur Mensch +8 more

8Openai Blog·1mo ago·source ↗

OpenAI and NVIDIA Announce Strategic Partnership to Deploy 10 Gigawatts of AI Datacenters

OpenAI and NVIDIA have announced a strategic partnership targeting deployment of 10 gigawatts of AI datacenter capacity powered by NVIDIA systems. The first phase of the buildout is scheduled to launch in 2026. This represents a major infrastructure commitment between two of the most prominent organizations in AI compute and model development.

Training Infrastructure Frontier Model Releases NVIDIA OpenAI +1 more

6The Batch·19d ago·source ↗

Nvidia's AI Systems Design Chip Circuits, Verify Designs, and Test New Layouts

Nvidia chief scientist Bill Dally described the company's use of AI across five stages of chip design at GTC 2025, including NVCell (a RL+genetic algorithm system that redesigns ~2,500-3,000 layout cells overnight vs. 10 engineer-months), PrefixRL (RL-designed arithmetic circuits 20-30% better than human designs), and ChipNeMo/BugNeMo (LLaMA 2-based LLMs fine-tuned on internal GPU documentation). The systems demonstrate measurable improvements over human and industry-standard designs, though Dally acknowledged that fully autonomous GPU design from a prompt remains a distant goal. The piece also references a 2025 Verkoran paper describing an agentic system that autonomously designed a RISC-V CPU from a 219-word specification.

Training Infrastructure Inference Economics Jeff Dean BugNeMo Verkoran +10 more

6Latent Space·18d ago·source ↗

NVIDIA Cosmos 3, Nemotron 3 Ultra, and RTX Spark

A Latent Space AI news digest covers three NVIDIA announcements: Cosmos 3 (a world model/simulation platform), Nemotron 3 Ultra (a large language model), and RTX Spark (likely a new hardware or inference product). The piece frames these as a significant win for Jensen Huang and NVIDIA's AI portfolio. Coverage is commentary-tier aggregation rather than primary technical reporting.

Training Infrastructure Frontier Model Releases NVIDIA Cosmos NVIDIA RTX Spark NVIDIA +4 more

7The Batch·18d ago·source ↗

Nvidia releases Nemotron 3 Super 120B-A12B open-weights model with hybrid Mamba-2/MoE architecture

Nvidia released Nemotron 3 Super 120B-A12B, an open-weights LLM with a hybrid Mamba-2/transformer/MoE architecture that activates only 12B parameters per token and supports up to 1 million token context. The model claims the fastest inference speed in its size class at 442 tokens/second and leads open-weights models on PinchBench agentic task evaluation, outperforming larger models including Kimi K2.5 (1T parameters). Nvidia is releasing weights, training data, and recipes under a permissive commercial license, and plans a $26B five-year investment in open-weights models — framed partly as a strategic response to Chinese labs building capable open-weights models on non-Nvidia hardware.

Frontier Model Releases Open Weights Progress Nemotron 3 Super 120B-A12B Nemotron 3 Ultra-500B-A50B PivotRL +18 more

7The Batch·34h ago·source ↗

Nvidia Nemotron 3 Ultra: hybrid Mamba-transformer open-weights model targeting agentic workloads

Nvidia released Nemotron 3 Ultra, a 550B parameter (55B active) hybrid Mamba-transformer mixture-of-experts model with a 1M token context window, publishing weights, training data, and RL environments under an open license. The model ranks as the highest-scoring U.S. open-weights model on the Artificial Analysis Intelligence Index (47.7-48.2) and is approximately three times faster than comparable open-weights rivals, though it trails leading Chinese models like Kimi K2.6 and DeepSeek V4 Pro on intelligence benchmarks. Nvidia used a novel Multi-Teacher On-Policy Distillation approach with 10+ specialized teacher models and trained using NVFP4 quantization. The release is strategically motivated by Nvidia's interest in a healthy open-weights ecosystem that drives AI semiconductor adoption.

Frontier Model Releases Open Weights Progress Mamba IFBench Artificial Analysis Intelligence Index +17 more