Almanac
technique

MMDiT

techniqueactivemmdit-abc054df·1 events·first seen 1mo ago

Aliases: MMDiT

Co-occurring entities

More like this (12)

Recent events (1)

7Qwen Research·1mo ago·source ↗

Qwen-Image: 20B MMDiT Image Foundation Model with Native Text Rendering

Alibaba's Qwen team has released Qwen-Image, a 20B parameter MMDiT (Multimodal Diffusion Transformer) image generation foundation model. The model claims significant advances in complex text rendering capabilities, including multi-line layouts, paragraph-level semantics, and fine-grained typographic details across alphabetic and other language scripts. It also features precise image editing capabilities and is accessible via Qwen Chat and open-weight repositories on HuggingFace and ModelScope.