Almanac
model

Qwen1.5-7B

modelactiveqwen1-5-7b-c2354669·1 events·first seen 1mo ago

Aliases: Qwen1.5-7B

Co-occurring entities

More like this (12)

Recent events (1)

6Qwen Research·1mo ago·source ↗

Qwen1.5-MoE: Matching 7B Model Performance with 1/3 Activated Parameters

Alibaba's Qwen team releases Qwen1.5-MoE-A2.7B, a mixture-of-experts model with only 2.7 billion activated parameters that claims performance parity with 7B dense models such as Mistral 7B and Qwen1.5-7B. The model activates roughly one-third of its total parameters during inference, offering significant compute efficiency gains. This release follows growing industry interest in MoE architectures sparked by Mixtral, and the model is available on GitHub, HuggingFace, and ModelScope.