Qwen3Guard: Real-time Safety Guardrail Model for Token Stream Classification
Alibaba's Qwen team has released Qwen3Guard, the first dedicated safety guardrail model in the Qwen family, built on Qwen3 foundation models and fine-tuned for safety classification. The model performs real-time safety detection on both prompts and responses, providing risk levels and categorized classifications for content moderation. Qwen3Guard claims state-of-the-art performance on major safety benchmarks across English, Chinese, and multilingual settings.
Related guides (3)
Related events (8)
Qwen releases Qwen3.5-2B multimodal model on Hugging Face
Alibaba's Qwen team released Qwen3.5-2B, a 2-billion-parameter image-text-to-text model, on Hugging Face. The model supports conversational use and is compatible with Azure deployment endpoints. With nearly 2 million downloads, it has seen substantial community uptake.
Qwen releases Qwen3.5-0.8B multimodal model on Hugging Face
Alibaba's Qwen team released Qwen3.5-0.8B, a small-scale image-text-to-text model, on Hugging Face. The model supports conversational use and is compatible with Azure deployment endpoints. With over 2.7 million downloads and 562 likes, it has seen substantial community uptake for a sub-1B parameter multimodal model.
Qwen2.5-Omni: Alibaba Releases End-to-End Multimodal Model with Real-Time Streaming
Alibaba's Qwen team releases Qwen2.5-Omni, a 7B-parameter end-to-end multimodal model capable of processing text, images, audio, and video simultaneously. The model delivers real-time streaming responses in both text and natural speech synthesis. It is openly available on Hugging Face, ModelScope, DashScope, and GitHub, accompanied by a technical paper.
Qwen2-VL: Alibaba Releases Latest Vision-Language Model with Extended Video Understanding
Alibaba's Qwen team has released Qwen2-VL, the latest iteration of their vision-language model series built on the Qwen2 foundation. The model claims state-of-the-art performance on visual understanding benchmarks including MathVista, DocVQA, RealWorldQA, and MTVQA. A notable capability is understanding videos exceeding 20 minutes in length for question answering, dialog, and content creation tasks.
Qwen releases Qwen3.5-4B-Base multimodal model on Hugging Face
Qwen has released Qwen3.5-4B-Base, a 4-billion parameter base model supporting image-text-to-text tasks, published on Hugging Face. The model is tagged as conversational and endpoints-compatible, using the safetensors format. With over 207,000 downloads, it represents a new entry in the Qwen3.5 model family with multimodal capabilities at a small parameter count.
Qwen 3.7 Preview Announced by Alibaba
Alibaba's Qwen team has announced a preview of Qwen 3.7, the next iteration in their Qwen 3 model series. The announcement appeared on Twitter/X and generated notable community discussion on Hacker News with 179 points and 67 comments. Specific capability details and model specifications are not available from this source alone.
Qwen releases Qwen-Image-Bench, a multimodal judge/evaluation model
Qwen has released Qwen-Image-Bench on Hugging Face, an image-text-to-text model tagged as a judge-model for evaluation and benchmarking purposes. The model supports both English and Chinese and appears designed to evaluate text-to-image outputs. With 8,572 downloads and 50 likes shortly after release, it has attracted modest early interest.
Qwen releases Qwen3.5-27B multimodal model on Hugging Face
Qwen has released Qwen3.5-27B, a 27-billion parameter image-text-to-text model, on Hugging Face. The model supports conversational use and is compatible with Azure deployment endpoints. With nearly 3 million downloads and 981 likes, it has seen substantial community uptake.


