5Meta Llama (HuggingFace model releases)·11d ago

Meta releases Llama Prompt Guard 2 (22M) safety classifier on Hugging Face

Meta released Llama Prompt Guard 2-22M, a lightweight 22-million-parameter text classification model for prompt safety, published on Hugging Face under the meta-llama organization. The model is based on DeBERTa-v2 architecture and tagged for safety use cases including prompt injection and jailbreak detection. It is part of the Llama 4 safety tooling ecosystem and supports English and French.

Frontier Model Releases AI Safety Research Hugging Face Llama Prompt Guard 2-86M DeBERTa-v3 Meta

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Related events (8)

5Meta Llama·11d ago·source ↗

Meta releases Llama Prompt Guard 2 (86M) for prompt injection and jailbreak detection

Meta released Llama Prompt Guard 2-86M, a DeBERTa-v2-based text classification model on Hugging Face designed for safety filtering, specifically prompt injection and jailbreak detection. The model is tagged with llama4, suggesting it is part of the Llama 4 safety tooling ecosystem. With over 122K downloads, it has seen meaningful early adoption.

Frontier Model Releases AI Safety Research Hugging Face Llama Prompt Guard 2-86M DeBERTa-v3 +1 more

5Meta Llama·11d ago·source ↗

Meta releases Llama Guard 3 1B safety classifier on Hugging Face

Meta released Llama Guard 3 1B, a compact 1-billion-parameter text-generation model designed for content safety classification, published on Hugging Face. The model is part of the Llama Guard 3 family and supports multiple languages including English, German, and French. Its small size makes it suitable for lightweight safety filtering in production deployments.

Open Weights Progress AI Safety Research Hugging Face Meta Llama Guard 3 1B

6Meta Llama·11d ago·source ↗

Meta releases Llama Guard 4 12B multimodal safety classifier on Hugging Face

Meta released Llama Guard 4 12B, a multimodal (image-text-to-text) safety classification model built on the Llama 4 architecture, published to Hugging Face. The model is designed for conversational safety filtering and supports both text and image inputs. With 143K downloads and 102 likes shortly after release, it is seeing meaningful early adoption.

Open Weights Progress AI Safety Research Hugging Face Llama Llama Guard 4 +2 more

5Meta Llama·11d ago·source ↗

Meta releases Llama Guard 3 11B Vision for multimodal content safety classification

Meta released Llama Guard 3 11B Vision on Hugging Face, a multimodal safety classifier supporting image-text-to-text inputs built on the Llama 3 architecture. The model extends the Llama Guard safety classification family to handle visual content alongside text. This is relevant to AI safety tooling for multimodal deployments.

Open Weights Progress AI Safety Research Llama Guard 3 11B Vision Llama 3 Hugging Face +2 more

5Hugging Face Blog·1mo ago·source ↗

Llama Guard 4 Released on Hugging Face Hub

Meta's Llama Guard 4 safety classifier has been made available on the Hugging Face Hub. Llama Guard 4 is a content moderation model designed to detect unsafe inputs and outputs in LLM pipelines. The Hugging Face blog post announces its availability and integration into the Hub ecosystem, continuing the Llama Guard series of safety-focused models.

Open Weights Progress AI Safety Research Hugging Face Llama Guard 4 Meta

7Meta Llama·11d ago·source ↗

Meta releases Llama 4 Scout 17B-16E instruct model on Hugging Face

Meta released Llama 4 Scout, a 17B active parameter / 16-expert mixture-of-experts instruct model with image-text-to-text (multimodal) capabilities, published on Hugging Face under the meta-llama organization. The model supports multiple languages including Arabic, German, and English. With over 420K downloads and 1,300 likes shortly after release, it is seeing significant community uptake.

Frontier Model Releases Open Weights Progress Hugging Face Meta Llama 4 Scout 17B-16E +1 more

6Meta Llama·11d ago·source ↗

Meta releases Llama 3.2-3B open-weights text generation model

Meta released Llama 3.2-3B, a 3-billion parameter open-weights language model, on Hugging Face under the meta-llama organization. The model supports multiple languages including English, German, French, and Italian, and uses the standard transformers/safetensors format. With over 900K downloads and 800+ likes, it has seen substantial community adoption.

Frontier Model Releases Open Weights Progress Llama 3.2 Meta

7Meta Llama·11d ago·source ↗

Meta releases Llama 3.2 11B Vision Instruct multimodal model

Meta released Llama 3.2 11B Vision Instruct on Hugging Face, an open-weights multimodal model supporting image-text-to-text tasks. The model is part of the Llama 3.2 family and supports English and German. With over 157K downloads and 1,600 likes, it has seen substantial community adoption.

Open Weights Progress Multimodal Progress Hugging Face Meta Llama 3.2 90B Vision-Instruct