Entity · benchmark

RealWorldQA

benchmarkactiverealworldqa-9dacb720·1 events·first seen May 18, 2026

Aliases: RealWorldQA

Co-occurring entities

Qwen2.5-VL DocVQA MathVista Qwen2.5 Alibaba MTVQA

More like this (12)

StrategyQA SimpleQA TruthfulQA FreshQA ResearchQA TableQA OfficeQA Pro ChartQA PubMedQA IndQA Protocol QA CommonsenseQA

Recent events (1)

7Qwen Research·May 18, 2026·source ↗

Qwen2-VL: Alibaba Releases Latest Vision-Language Model with Extended Video Understanding

Alibaba's Qwen team has released Qwen2-VL, the latest iteration of their vision-language model series built on the Qwen2 foundation. The model claims state-of-the-art performance on visual understanding benchmarks including MathVista, DocVQA, RealWorldQA, and MTVQA. A notable capability is understanding videos exceeding 20 minutes in length for question answering, dialog, and content creation tasks.

Frontier Model Releases Evaluation and Benchmarking Qwen2.5-VL RealWorldQA DocVQA +6 more