Entity · benchmark

MTVQA

benchmarkactivemtvqa-87a25085·1 events·first seen May 18, 2026

Aliases: MTVQA

Co-occurring entities

Qwen2.5-VL RealWorldQA DocVQA MathVista Qwen2.5 Alibaba

More like this (12)

CXR-VQA VQ-VAE VQA-RAD ChartQA QVQ-Max DocVQA GQA IndQA PubMedQA MedQA MedMCQA SimpleQA

Recent events (1)

7Qwen Research·May 18, 2026·source ↗

Qwen2-VL: Alibaba Releases Latest Vision-Language Model with Extended Video Understanding

Alibaba's Qwen team has released Qwen2-VL, the latest iteration of their vision-language model series built on the Qwen2 foundation. The model claims state-of-the-art performance on visual understanding benchmarks including MathVista, DocVQA, RealWorldQA, and MTVQA. A notable capability is understanding videos exceeding 20 minutes in length for question answering, dialog, and content creation tasks.

Frontier Model Releases Evaluation and Benchmarking Qwen2.5-VL RealWorldQA DocVQA +6 more