benchmark
RealWorldQA
benchmarkactive
realworldqa-9dacb720·1 events·first seen 1mo agoAliases: RealWorldQA
Co-occurring entities
More like this (12)
Recent events (1)
Qwen2-VL: Alibaba Releases Latest Vision-Language Model with Extended Video Understanding
Alibaba's Qwen team has released Qwen2-VL, the latest iteration of their vision-language model series built on the Qwen2 foundation. The model claims state-of-the-art performance on visual understanding benchmarks including MathVista, DocVQA, RealWorldQA, and MTVQA. A notable capability is understanding videos exceeding 20 minutes in length for question answering, dialog, and content creation tasks.