Almanac
← Events
5Hugging Face Blog·16d ago

NVIDIA releases Nemotron 3.5 Content Safety, a customizable multimodal safety model for enterprise AI

NVIDIA has released Nemotron 3.5 Content Safety, a multimodal safety model designed for enterprise AI deployments with customization capabilities for global use cases. The model is announced via the Hugging Face blog, targeting content moderation and safety classification across modalities. This is relevant to the growing enterprise demand for controllable, deployable safety layers on top of foundation models.

Related guides (3)

Related events (8)

6Hugging Face Blog·1mo ago·source ↗

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

NVIDIA has released Nemotron 3 Nano Omni, a multimodal model targeting long-context understanding across documents, audio, and video modalities. The model is positioned for agentic use cases requiring cross-modal reasoning. It is published via the Hugging Face blog as part of NVIDIA's Nemotron model family. No detailed technical specifications or benchmark results are provided in the available body text.

7The Batch·35h ago·source ↗

Nvidia Nemotron 3 Ultra: hybrid Mamba-transformer open-weights model targeting agentic workloads

Nvidia released Nemotron 3 Ultra, a 550B parameter (55B active) hybrid Mamba-transformer mixture-of-experts model with a 1M token context window, publishing weights, training data, and RL environments under an open license. The model ranks as the highest-scoring U.S. open-weights model on the Artificial Analysis Intelligence Index (47.7-48.2) and is approximately three times faster than comparable open-weights rivals, though it trails leading Chinese models like Kimi K2.6 and DeepSeek V4 Pro on intelligence benchmarks. Nvidia used a novel Multi-Teacher On-Policy Distillation approach with 10+ specialized teacher models and trained using NVFP4 quantization. The release is strategically motivated by Nvidia's interest in a healthy open-weights ecosystem that drives AI semiconductor adoption.

7The Batch·18d ago·source ↗

Nvidia releases Nemotron 3 Super 120B-A12B open-weights model with hybrid Mamba-2/MoE architecture

Nvidia released Nemotron 3 Super 120B-A12B, an open-weights LLM with a hybrid Mamba-2/transformer/MoE architecture that activates only 12B parameters per token and supports up to 1 million token context. The model claims the fastest inference speed in its size class at 442 tokens/second and leads open-weights models on PinchBench agentic task evaluation, outperforming larger models including Kimi K2.5 (1T parameters). Nvidia is releasing weights, training data, and recipes under a permissive commercial license, and plans a $26B five-year investment in open-weights models — framed partly as a strategic response to Chinese labs building capable open-weights models on non-Nvidia hardware.

4Openai Blog·1mo ago·source ↗

SafetyKit scales risk agents with OpenAI's most capable models

SafetyKit, a content moderation and compliance platform, has integrated OpenAI's GPT-5 to power its risk-detection agents. The deployment targets content moderation accuracy and compliance enforcement, positioning itself as a replacement for legacy safety systems. This represents a production enterprise use case of GPT-5 in trust and safety workflows.

6The Batch·17d ago·source ↗

Data Points: NemoClaw enterprise stack, GPT-5.4 mini/nano, Nemotron 3 Nano 4B, Midjourney V8, and Mamba-3

A multi-item roundup covers several AI developments: Nvidia unveiled NemoClaw at GTC 2026, an enterprise software stack integrating with OpenClaw to add security and governance for agentic deployments, with launch partners including Salesforce, Cisco, and CrowdStrike. OpenAI released GPT-5.4 mini and nano, smaller variants optimized for speed with benchmark results on SWE-Bench Pro and OSWorld-Verified, priced at $0.75 and $0.20 per million input tokens respectively. Nvidia also released Nemotron 3 Nano 4B, a hybrid Mamba-Transformer 4B parameter on-device model. Additional items cover Midjourney V8 alpha (5x faster, diffusion-only) and Mamba-3, a 1.5B state space model from CMU and Together.AI with improved accuracy over Mamba-2.

5Hugging Face Blog·1mo ago·source ↗

NVIDIA Llama Nemotron Nano VLM Released on Hugging Face Hub

NVIDIA has released the Llama Nemotron Nano VLM on Hugging Face Hub, a compact vision-language model built on the Llama architecture. The model is part of NVIDIA's Nemotron family targeting efficient multimodal inference. This release makes the model accessible to the broader research and developer community through Hugging Face's model hosting infrastructure.

5Meta Llama·11d ago·source ↗

Meta releases Llama Guard 3 11B Vision for multimodal content safety classification

Meta released Llama Guard 3 11B Vision on Hugging Face, a multimodal safety classifier supporting image-text-to-text inputs built on the Llama 3 architecture. The model extends the Llama Guard safety classification family to handle visual content alongside text. This is relevant to AI safety tooling for multimodal deployments.

5Openai Blog·1mo ago·source ↗

OpenAI Upgrades Moderation API with GPT-4o-Based Multimodal Model

OpenAI has released an updated Moderation API powered by a new model built on GPT-4o, extending content moderation capabilities to both text and images. The update aims to improve accuracy in detecting harmful content, giving developers better tools for building moderation systems. This represents an expansion of OpenAI's safety infrastructure into multimodal domains.