Entity · product

ModelScope

productactivemodelscope-28205287·21 events·first seen May 18, 2026

Aliases: ModelScope

Co-occurring entities

Alibaba Hugging Face Qwen HuggingFace Alibaba Qwen Team Qwen3 Qwen2.5 Alibaba Qwen Mixture of Experts CodeQwen1.5 Qwen Chat ms-swift GRPO DPO Llama FunASR OpenAI-compatible API Qwen-7B Qwen1.5 Mixtral

More like this (12)

TimeScope Model Spec SCOPE PlanetScope DashScope SCOPE-RL Llama Scope AgentScope Model Context Protocol Gemma Scope 2 OpenAI Model Spec OpenAI Microscope

Recent events (21)

5Github Trending·Jun 12, 2026·source ↗

ms-swift: ModelScope framework for fine-tuning 600+ LLMs and 300+ MLLMs

ms-swift is an open-source Python framework from ModelScope supporting PEFT and full-parameter fine-tuning methods (CPT, SFT, DPO, GRPO) across 600+ LLMs and 300+ multimodal LLMs, including Qwen3, DeepSeek, Llama4, and others. The project has accumulated 14,487 GitHub stars and was accepted at AAAI 2025. It serves as a broad-coverage training harness for the current generation of open-weights frontier models.

Open Weights Progress Agent and Tool Ecosystem ms-swift GRPO DPO +3 more

4Github Trending·May 26, 2026·source ↗

FunASR: Industrial-Grade Speech Recognition Toolkit with 170x Realtime Performance

FunASR is an open-source speech recognition toolkit from ModelScope supporting 50+ languages, speaker diarization, emotion detection, and streaming inference at 170x realtime speed. It exposes an OpenAI-compatible API, positioning it as a drop-in alternative for production ASR workloads. The repository has accumulated 16,317 stars with modest daily momentum (+42 today).

Open Weights Progress Agent and Tool Ecosystem FunASR ModelScope OpenAI-compatible API

4Qwen Research·May 18, 2026·source ↗

Introducing the Qwen Series: Overview of Alibaba's Open-Source LLM Journey

Alibaba's Qwen team published a retrospective introduction to the Qwen series of large language models, four months after the initial Qwen-7B open-source release. The post consolidates links to their paper, GitHub, Hugging Face, and ModelScope repositories, and outlines the team's objectives for the open-source LLM program. It serves as a canonical reference point for the Qwen model family's public positioning.

Frontier Model Releases Open Weights Progress Alibaba Qwen-7B Qwen +2 more

7Qwen Research·May 18, 2026·source ↗

Introducing Qwen1.5: Open-Source Models Across Eight Sizes Including MoE

Alibaba's Qwen team released Qwen1.5, open-sourcing both base and chat models in eight sizes ranging from 0.5B to 110B parameters, plus a Mixture-of-Experts (MoE) variant. The release emphasizes developer experience improvements alongside model quality. Models are available on GitHub, Hugging Face, and ModelScope.

Frontier Model Releases Open Weights Progress Qwen1.5 Mixture of Experts Hugging Face +3 more

6Qwen Research·May 18, 2026·source ↗

Qwen1.5-MoE: Matching 7B Model Performance with 1/3 Activated Parameters

Alibaba's Qwen team releases Qwen1.5-MoE-A2.7B, a mixture-of-experts model with only 2.7 billion activated parameters that claims performance parity with 7B dense models such as Mistral 7B and Qwen1.5-7B. The model activates roughly one-third of its total parameters during inference, offering significant compute efficiency gains. This release follows growing industry interest in MoE architectures sparked by Mixtral, and the model is available on GitHub, HuggingFace, and ModelScope.

Frontier Model Releases Open Weights Progress Mixtral Qwen1.5-MoE-A2.7B Qwen1.5-7B +6 more

6Qwen Research·May 18, 2026·source ↗

Qwen1.5-32B: Alibaba's 30B-Parameter Capstone for the Qwen1.5 Series

Alibaba's Qwen team released Qwen1.5-32B, a ~30 billion parameter open-weights language model positioned as the capstone of the Qwen1.5 series. The model targets the emerging consensus around 30B parameters as an optimal balance between performance, memory footprint, and inference efficiency. It is released alongside code on GitHub, weights on HuggingFace and ModelScope, and an interactive demo.

Frontier Model Releases Open Weights Progress Qwen1.5-72B DBRX Qwen1.5-32B +4 more

5Qwen Research·May 18, 2026·source ↗

CodeQwen1.5: Alibaba's Open-Source Code LLM Release

Alibaba's Qwen team released CodeQwen1.5, an open-source large language model specialized for code generation and programming assistance. The release is positioned as a transparent, accessible alternative to proprietary coding assistants like GitHub Copilot, addressing concerns around cost, privacy, security, and copyright. The model is available on GitHub, HuggingFace, and ModelScope.

Open Weights Progress Agent and Tool Ecosystem CodeQwen1.5 Alibaba Qwen +3 more

6Qwen Research·May 18, 2026·source ↗

Qwen2-Audio: Multimodal Audio-Language Model Release

Alibaba's Qwen team releases Qwen2-Audio, the successor to Qwen-Audio, capable of accepting both audio and text inputs and generating text outputs. The model is positioned as a step toward AGI by extending large language model capabilities to audio modalities. It is released with accompanying paper, GitHub repository, and model weights on Hugging Face and ModelScope.

Frontier Model Releases Open Weights Progress Alibaba Qwen Hugging Face +3 more

6Qwen Research·May 18, 2026·source ↗

Qwen2.5-Coder Released: Next-Generation Open-Source Coding Model

Alibaba's Qwen team has released Qwen2.5-Coder, the next generation of their open-source coding-specialized language model, succeeding CodeQwen1.5 which launched in April 2024. The release also marks a rebranding from CodeQwen to Qwen-Coder. The model is available on GitHub, Hugging Face, and ModelScope.

Open Weights Progress Agent and Tool Ecosystem CodeQwen1.5 Hugging Face ModelScope +2 more

8Qwen Research·May 18, 2026·source ↗

Qwen2.5-LLM: Alibaba releases open-weight language models from 0.5B to 72B

Alibaba's Qwen team releases the Qwen2.5 series of decoder-only dense language models, open-sourcing seven variants spanning 0.5B to 72B parameters. The release targets production use cases in the 10-30B range and mobile deployments at 3B scale. This represents a significant expansion of the open-weights frontier from a Tier 1 Chinese AI lab.

Frontier Model Releases Open Weights Progress Qwen2.5 Alibaba Qwen Team +4 more

8Qwen Research·May 18, 2026·source ↗

Qwen2.5: Large-Scale Open-Source Foundation Model Family Release

Alibaba's Qwen team has released Qwen2.5, described as potentially the largest open-source model release in history, following three months of development after Qwen2. The release encompasses a family of foundation models with improvements in knowledge and reasoning capabilities. The announcement targets developers who have been building on Qwen2 and incorporates feedback from that community.

Frontier Model Releases Open Weights Progress Qwen2.5 Alibaba Hugging Face +2 more

7Qwen Research·May 18, 2026·source ↗

Qwen2.5-Turbo Extends Context Length to 1M Tokens

Alibaba's Qwen team has released Qwen2.5-Turbo, extending the model's context window from 128K to 1 million tokens (approximately 1 million English words). The update includes optimizations for both model capabilities and inference performance at extreme context lengths. The model is available via API and through HuggingFace and ModelScope demos.

Long Context Evolution Frontier Model Releases Qwen2.5 Alibaba ModelScope +3 more

7Qwen Research·May 18, 2026·source ↗

QwQ-32B-Preview: Alibaba's Qwen Reasoning Model with Deep Reflection Capabilities

Alibaba's Qwen team has released QwQ-32B-Preview, a 32-billion parameter model designed for deep reasoning across mathematics, code, and general knowledge. The model is positioned as a reasoning-focused system that emphasizes uncertainty and iterative questioning as core design principles. It is available on GitHub, Hugging Face, ModelScope, and via a demo interface.

Frontier Model Releases Evaluation and Benchmarking Alibaba QwQ-32B-Preview Qwen +3 more

7Qwen Research·May 18, 2026·source ↗

QVQ-72B-Preview: Qwen Visual Reasoning Model Release

Alibaba's Qwen team has released QVQ-72B-Preview, a 72-billion parameter multimodal model designed to integrate visual understanding with advanced reasoning capabilities. The model is positioned as an extension of Qwen's language reasoning work into the visual domain. It is available on GitHub, Hugging Face, ModelScope, and Kaggle with a live demo.

Frontier Model Releases Open Weights Progress Alibaba Qwen QVQ-72B-Preview +3 more

6Qwen Research·May 18, 2026·source ↗

Qwen2.5-Math Process Reward Model for Mathematical Reasoning Supervision

Alibaba's Qwen team introduces a process reward model (PRM) aimed at improving the reliability of mathematical reasoning in LLMs by supervising intermediate reasoning steps rather than only final answers. The work addresses the problem of models producing plausible but flawed intermediate derivations even when reaching correct conclusions. The release includes model weights on HuggingFace and ModelScope alongside a GitHub repository.

Evaluation and Benchmarking Open Weights Progress Process Reward Model Alibaba Qwen +4 more

8Qwen Research·May 18, 2026·source ↗

Qwen2.5-VL: Alibaba's New Flagship Vision-Language Model Released in 3B/7B/72B Sizes

Alibaba's Qwen team has released Qwen2.5-VL, their new flagship vision-language model, representing a significant upgrade over Qwen2-VL. The release includes both base and instruct variants in three sizes (3B, 7B, 72B), all open-weighted and available on Hugging Face and ModelScope. The 72B instruct model is also accessible via Qwen Chat. Key capabilities highlighted include enhanced visual understanding, with the model positioned as a major step forward in multimodal performance.

Frontier Model Releases Open Weights Progress Qwen2.5-VL Qwen Chat Hugging Face +3 more

7Qwen Research·May 18, 2026·source ↗

QwQ-32B: Scaling Reinforcement Learning for Enhanced Reasoning

Alibaba's Qwen team releases QwQ-32B, a 32-billion parameter model trained with scaled Reinforcement Learning to improve reasoning capabilities beyond conventional pretraining and post-training methods. The release draws explicit comparison to DeepSeek R1's cold-start and multi-stage RL training approach. The model is available via Qwen Chat, Hugging Face, ModelScope, and a demo interface. This represents Qwen's exploration of RL scalability as a path to enhanced LLM intelligence.

Frontier Model Releases Evaluation and Benchmarking DeepSeek V4 Alibaba Qwen +6 more

7Qwen Research·May 18, 2026·source ↗

Qwen2.5-Omni: Alibaba Releases End-to-End Multimodal Model with Real-Time Streaming

Alibaba's Qwen team releases Qwen2.5-Omni, a 7B-parameter end-to-end multimodal model capable of processing text, images, audio, and video simultaneously. The model delivers real-time streaming responses in both text and natural speech synthesis. It is openly available on Hugging Face, ModelScope, DashScope, and GitHub, accompanied by a technical paper.

Frontier Model Releases Open Weights Progress Alibaba Qwen2.5-Omni Qwen +5 more

7Qwen Research·May 18, 2026·source ↗

Qwen3 Embedding: State-of-the-Art Text Embedding and Reranking Models Released

Alibaba's Qwen team has released the Qwen3 Embedding series, a set of open-weights text embedding and reranking models built on the Qwen3 foundation model. The models are designed for retrieval and reranking tasks and claim state-of-the-art performance across multiple benchmarks. They are released under the Apache 2.0 license and are available on Hugging Face and ModelScope.

Evaluation and Benchmarking Open Weights Progress Qwen3 Embedding Alibaba Qwen Apache 2.0 +5 more

7Qwen Research·May 18, 2026·source ↗

Qwen-Image: 20B MMDiT Image Foundation Model with Native Text Rendering

Alibaba's Qwen team has released Qwen-Image, a 20B parameter MMDiT (Multimodal Diffusion Transformer) image generation foundation model. The model claims significant advances in complex text rendering capabilities, including multi-line layouts, paragraph-level semantics, and fine-grained typographic details across alphabetic and other language scripts. It also features precise image editing capabilities and is accessible via Qwen Chat and open-weight repositories on HuggingFace and ModelScope.

Frontier Model Releases Open Weights Progress Alibaba Qwen Qwen-Image Qwen Chat +4 more

6Qwen Research·May 18, 2026·source ↗

Qwen3Guard: Real-time Safety Guardrail Model for Token Stream Classification

Alibaba's Qwen team has released Qwen3Guard, the first dedicated safety guardrail model in the Qwen family, built on Qwen3 foundation models and fine-tuned for safety classification. The model performs real-time safety detection on both prompts and responses, providing risk levels and categorized classifications for content moderation. Qwen3Guard claims state-of-the-art performance on major safety benchmarks across English, Chinese, and multilingual settings.

Frontier Model Releases AI Safety Research Qwen3Guard Alibaba Qwen Hugging Face +3 more