Qwen-Image
qwen-image-6898d3ff·2 events·first seen 1mo agoAliases: Qwen-Image
Co-occurring entities
More like this (12)
Recent events (2)
Qwen-Image: 20B MMDiT Image Foundation Model with Native Text Rendering
Alibaba's Qwen team has released Qwen-Image, a 20B parameter MMDiT (Multimodal Diffusion Transformer) image generation foundation model. The model claims significant advances in complex text rendering capabilities, including multi-line layouts, paragraph-level semantics, and fine-grained typographic details across alphabetic and other language scripts. It also features precise image editing capabilities and is accessible via Qwen Chat and open-weight repositories on HuggingFace and ModelScope.
Qwen-Image-Edit: Image Editing Model with Text Rendering and Dual Visual Control
Alibaba's Qwen team has released Qwen-Image-Edit, a 20B-parameter image editing model built on the Qwen-Image foundation. The model extends Qwen-Image's text rendering capabilities to editing tasks, enabling precise in-image text modification. It uses a dual-path architecture that simultaneously feeds input images into Qwen2.5-VL for semantic control and a VAE Encoder for appearance control, enabling both semantic and appearance-level edits.