Entity · product

NVIDIA NIM

productactivenvidia-nim-e06dd328·6 events·first seen May 18, 2026

Aliases: NVIDIA NIM

Co-occurring entities

More like this (12)

NVIDIA NeMo NVIDIA Nemotron Coalition NVIDIA NVIDIA Nemotron 4 NVIDIA Isaac Optimum-NVIDIA NVIDIA Labs NVIDIA Nemotron 3 Ultra Nvidia NeMo Retriever NVIDIA Nemotron-3-Nano-30B-A3B NVIDIA DGX Cloud NVIDIA/skills

Recent events (6)

7Mistral Ai News·Jun 1, 2026·source ↗

Mistral Small 3.1: Multimodal, 128k Context, Apache 2.0 Open-Weight Model

Mistral AI releases Mistral Small 3.1, a ~24B parameter model with multimodal understanding, 128k token context window, and claimed best-in-class performance among small models, outperforming Gemma 3 and GPT-4o Mini on text, multimodal, and multilingual benchmarks. The model runs on a single RTX 4090 or 32GB RAM Mac at 150 tokens/second and is released under Apache 2.0 license with both base and instruct checkpoints. It is available on HuggingFace, Mistral's La Plateforme API, and Google Cloud Vertex AI, with NVIDIA NIM and Azure AI Foundry support coming soon. The release targets enterprise and on-device use cases including document verification, agentic workflows, and domain fine-tuning.

Long Context Evolution Frontier Model Releases Mistral AI Mistral Small 4 MT-Bench +12 more

7Mistral Ai News·Jun 1, 2026·source ↗

Mistral NeMo: 12B Open-Weights Model with 128k Context, Built with NVIDIA

Mistral AI and NVIDIA jointly release Mistral NeMo, a 12B parameter model under Apache 2.0 license featuring a 128k token context window and a new tokenizer called Tekken based on Tiktoken. The model is designed as a drop-in replacement for Mistral 7B, supports multilingual applications across 11+ languages, and was trained with quantization awareness enabling FP8 inference without performance loss. Benchmark comparisons show competitive performance against Gemma 2 9B and Llama 3 8B. Weights are available on HuggingFace and the model is also packaged as an NVIDIA NIM inference microservice.

Long Context Evolution Frontier Model Releases Mistral AI Gemma 2 9B Apache 2.0 +9 more

7Mistral Ai News·Jun 1, 2026·source ↗

Mistral Medium 3: Frontier-Class Performance at 8x Lower Cost

Mistral AI has released Mistral Medium 3, a new enterprise-focused language model priced at $0.4/$2 per million input/output tokens. The model claims to achieve 90%+ of Claude Sonnet 3.7's benchmark performance while undercutting cost leaders like DeepSeek v3, and outperforming open models including Llama 4 Maverick. It supports hybrid, on-premises, and in-VPC deployment on as few as four GPUs, and is available immediately on Mistral La Plateforme and Amazon SageMaker, with additional cloud platforms coming soon. The announcement also teases an upcoming large open-weights model release.

Frontier Model Releases Open Weights Progress Mistral AI Amazon SageMaker DeepSeek V4 +11 more

5Hugging Face Blog·May 19, 2026·source ↗

Serverless Inference with Hugging Face and NVIDIA NIM

Hugging Face and NVIDIA have partnered to offer serverless inference via NVIDIA NIM microservices on DGX Cloud infrastructure. The integration allows developers to run optimized model inference without managing GPU infrastructure, combining Hugging Face's model hub with NVIDIA's inference optimization stack. This represents an expansion of the existing Hugging Face–NVIDIA partnership into managed inference services.

Training Infrastructure Inference Economics NVIDIA NIM NVIDIA DGX Cloud +2 more

5Hugging Face Blog·May 19, 2026·source ↗

Accelerate a World of LLMs on Hugging Face with NVIDIA NIM

NVIDIA NIM microservices are being integrated with Hugging Face to enable optimized inference deployment for a broad range of LLMs hosted on the Hub. The partnership allows developers to deploy Hugging Face models via NIM's containerized inference stack, leveraging NVIDIA's TensorRT-LLM and other optimizations. This expands the ecosystem of models accessible through NIM beyond NVIDIA's own catalog to the wider Hugging Face model repository.

Inference Economics Enterprise Deployment Patterns NVIDIA NIM NVIDIA TensorRT-LLM +2 more

8Mistral Ai News·May 18, 2026·source ↗

Mistral Releases Devstral 2 (123B) and Devstral Small 2 (24B) Coding Models Plus Vibe CLI Agent

Mistral AI has released Devstral 2, a 123B-parameter open-weight coding model scoring 72.2% on SWE-bench Verified, and Devstral Small 2, a 24B model scoring 68.0% on the same benchmark and deployable on consumer hardware. Both models support a 256K context window and are permissively licensed (modified MIT and Apache 2.0 respectively). Mistral also launched Vibe CLI, an open-source terminal-based coding agent powered by Devstral that supports multi-file orchestration, natural language code editing, and IDE integration via Agent Communication Protocol. Devstral 2 is currently free via API with post-free pricing of $0.40/$2.00 per million tokens input/output.

Long Context Evolution Frontier Model Releases Devstral 2 Small Mistral AI Kimi K2 +13 more