Entity · model

QwQ-32B

modelactiveqwq-32b-9a24471d·2 events·first seen May 18, 2026

Aliases: QwQ-32B

Co-occurring entities

DeepSeek V4 Hugging Face Alibaba Qwen ModelScope Reinforcement Learning Alibaba Qwen Qwen3-30B-A3B Qwen3-4B OpenAI o3-mini Gemini-2.5-Pro Qwen3-235B OpenAI Grok-3

More like this (12)

QwQ-32B-Preview Qwen2-72B Qwen3-30B-A3B Qwen2-57B-A14B Qwen-7B QVQ-72B-Preview Qwen3-235B Qwen3-4B Qwen1.5-32B Qwen2.5-VL-32B-Instruct Qwen3-30B-A3B-Base Qwen3-8B-Base

Recent events (2)

7Qwen Research·May 18, 2026·source ↗

QwQ-32B: Scaling Reinforcement Learning for Enhanced Reasoning

Alibaba's Qwen team releases QwQ-32B, a 32-billion parameter model trained with scaled Reinforcement Learning to improve reasoning capabilities beyond conventional pretraining and post-training methods. The release draws explicit comparison to DeepSeek R1's cold-start and multi-stage RL training approach. The model is available via Qwen Chat, Hugging Face, ModelScope, and a demo interface. This represents Qwen's exploration of RL scalability as a path to enhanced LLM intelligence.

Frontier Model Releases Evaluation and Benchmarking DeepSeek V4 Alibaba Qwen +6 more

8Qwen Research·May 18, 2026·source ↗

Qwen3 Release: Flagship 235B MoE and Full Model Family Announced

Alibaba's Qwen team has released Qwen3, a new family of large language models including the flagship Qwen3-235B-A22B mixture-of-experts model. The flagship model claims competitive benchmark performance against DeepSeek-R1, OpenAI o1/o3-mini, Grok-3, and Gemini-2.5-Pro on coding, math, and general capabilities. A smaller MoE variant, Qwen3-30B-A3B, reportedly outperforms QwQ-32B despite using only one-tenth the activated parameters, and the 4B model is said to match Qwen2.5's larger models. Models are available across Hugging Face, ModelScope, and Kaggle.

Frontier Model Releases Evaluation and Benchmarking Alibaba Qwen DeepSeek V4 Qwen3-30B-A3B +10 more