8Qwen Research (via RSSHub)·1mo ago

Qwen3 Release: Flagship 235B MoE and Full Model Family Announced

Alibaba's Qwen team has released Qwen3, a new family of large language models including the flagship Qwen3-235B-A22B mixture-of-experts model. The flagship model claims competitive benchmark performance against DeepSeek-R1, OpenAI o1/o3-mini, Grok-3, and Gemini-2.5-Pro on coding, math, and general capabilities. A smaller MoE variant, Qwen3-30B-A3B, reportedly outperforms QwQ-32B despite using only one-tenth the activated parameters, and the 4B model is said to match Qwen2.5's larger models. Models are available across Hugging Face, ModelScope, and Kaggle.

Frontier Model Releases Evaluation and Benchmarking Open Weights Progress Inference Economics Alibaba Qwen DeepSeek V4 Qwen3-30B-A3B Qwen3-4B OpenAI o3-mini QwQ-32B Gemini-2.5-Pro Qwen3-235B Hugging Face OpenAI Grok-3

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

DeepSeek V4

DeepSeek V4: The Open-Weights Giant Reshaping AI Economics

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race from GPT-3 to Safety-Tiered Superintelligence

Read asIn-depth

Related events (8)

7The Batch·18d ago·source ↗

Alibaba releases Qwen3.5 open-weights vision-language model family with MoE architecture across eight sizes

Alibaba released the Qwen3.5 family of eight open-weights vision-language models ranging from 0.8B to 397B parameters, built on a mixture-of-experts architecture with mixed attention and Gated DeltaNet layers. The flagship Qwen3.5-397B-A17B outperforms GPT-5.2, Claude 4.5 Opus, and Gemini-3 Pro on 28 of 44 vision benchmarks, while the 9B model surpasses OpenAI's gpt-oss-120B on most language tasks. Open weights are available under Apache 2.0, with hosted agentic variants (Qwen3.5-Plus, Qwen3.5-Flash) available via Alibaba Cloud. The release is notable for strong small-model efficiency and comes amid reported team departures following the Qwen3 rollout.

Frontier Model Releases Open Weights Progress GPT-5.2 Alibaba Cloud Model Studio Claude Opus 4.6 +10 more

7Qwen Research·1mo ago·source ↗

Qwen2.5-Max: Large-Scale MoE Model Release by Alibaba's Qwen Team

Alibaba's Qwen team announces Qwen2.5-Max, a large-scale Mixture-of-Experts language model. The post acknowledges that scaling insights for very large MoE models have been limited, citing DeepSeek V3's recent disclosures as a reference point. The model is positioned as a frontier-scale MoE system developed concurrently with ongoing Qwen2 research.

Training Infrastructure Frontier Model Releases DeepSeek V4 Alibaba Qwen Team +3 more

8Qwen Research·1mo ago·source ↗

Qwen2 Model Family Released: Five Sizes, 128K Context, Multilingual

Alibaba's Qwen team has released Qwen2, an evolution from Qwen1.5, comprising five pretrained and instruction-tuned models ranging from 0.5B to 72B parameters, including a 57B mixture-of-experts variant (57B-A14B). The release highlights training on 27 additional languages beyond English and Chinese, significantly improved coding and mathematics performance, and extended context support up to 128K tokens for the 7B and 72B instruct variants. Benchmark results are claimed to be state-of-the-art across a large number of evaluations.

Long Context Evolution Frontier Model Releases Qwen2-72B Qwen2.5 Qwen2-57B-A14B +4 more

7Qwen Research·1mo ago·source ↗

Introducing Qwen1.5: Open-Source Models Across Eight Sizes Including MoE

Alibaba's Qwen team released Qwen1.5, open-sourcing both base and chat models in eight sizes ranging from 0.5B to 110B parameters, plus a Mixture-of-Experts (MoE) variant. The release emphasizes developer experience improvements alongside model quality. Models are available on GitHub, Hugging Face, and ModelScope.

Frontier Model Releases Open Weights Progress Qwen1.5 Mixture of Experts Hugging Face +3 more

6Qwen·15d ago·source ↗

Qwen releases Qwen3.5-35B-A3B-Base multimodal MoE model on Hugging Face

Qwen has released Qwen3.5-35B-A3B-Base, a 35B-parameter mixture-of-experts image-text-to-text base model on Hugging Face, activating approximately 3B parameters per forward pass. The model supports conversational use and is compatible with Azure deployment endpoints. With over 109K downloads, it represents a notable open-weights multimodal MoE release from the Qwen team.

Frontier Model Releases Open Weights Progress Qwen3.5-35B-A3B-Base Qwen Hugging Face +1 more

7Qwen·15d ago·source ↗

Qwen releases Qwen3.5-122B-A10B multimodal MoE model on Hugging Face

Qwen has released Qwen3.5-122B-A10B, a 122B-parameter mixture-of-experts image-text-to-text model with 10B active parameters, published on Hugging Face. The model supports conversational use and is compatible with Azure deployment endpoints. High download counts (840K) and likes (564) suggest rapid community uptake shortly after release.

Frontier Model Releases Open Weights Progress Microsoft Azure Qwen Qwen3.5-122B-A10B +2 more

7Qwen·15d ago·source ↗

Qwen releases Qwen3.5-35B-A3B multimodal MoE model on Hugging Face

Qwen has released Qwen3.5-35B-A3B, a 35B-parameter mixture-of-experts image-text-to-text model with approximately 3B active parameters, published on Hugging Face. The model supports conversational use and is compatible with Azure deployment endpoints. With over 2.8 million downloads and 1,400+ likes, it has seen substantial community uptake.

Frontier Model Releases Open Weights Progress Qwen3.5-35B-A3B Qwen +1 more

8Qwen Research·1mo ago·source ↗

Qwen2.5-VL: Alibaba's New Flagship Vision-Language Model Released in 3B/7B/72B Sizes

Alibaba's Qwen team has released Qwen2.5-VL, their new flagship vision-language model, representing a significant upgrade over Qwen2-VL. The release includes both base and instruct variants in three sizes (3B, 7B, 72B), all open-weighted and available on Hugging Face and ModelScope. The 72B instruct model is also accessible via Qwen Chat. Key capabilities highlighted include enhanced visual understanding, with the model positioned as a major step forward in multimodal performance.

Frontier Model Releases Open Weights Progress Qwen2.5-VL Qwen Chat Hugging Face +3 more