Entity · benchmark

DocVQA

benchmarkactivedocvqa-58ab5199·2 events·first seen May 18, 2026

Aliases: DocVQA

Co-occurring entities

MathVista Google Cloud Mistral AI MT-Bench Claude 3.5 Sonnet Mistral Large 2 GPT-4o Mistral Research License Microsoft Azure Pixtral Large Gemini-2.5-Pro LMSys Vision Leaderboard ChartQA Mistral Large 24.11 Qwen2.5-VL RealWorldQA Qwen2.5 Alibaba MTVQA

More like this (12)

CiteVQA CXR-VQA VQ-VAE EG-VQA VQA-RAD Document Visual Question Answering OfficeQA Pro TableQA ChartQA SimpleQA VQ-Diffusion MTVQA

Recent events (2)

7Mistral Ai News·May 18, 2026·source ↗

Pixtral Large: Mistral AI's 124B Open-Weights Multimodal Model

Mistral AI released Pixtral Large, a 124B open-weights multimodal model built on Mistral Large 2, featuring a 1B parameter vision encoder and 128K context window supporting at least 30 high-resolution images. The model claims state-of-the-art results on MathVista, DocVQA, and ChartQA, outperforming GPT-4o and Gemini-1.5 Pro on several benchmarks, and leads the LMSys Vision Leaderboard among open-weights models by ~50 ELO points. Simultaneously, Mistral updated its text model to Mistral Large 24.11 with improvements in long-context understanding, function calling, and RAG/agentic workflows. Note: the model has since been deprecated and replaced by newer Mistral vision models.

Frontier Model Releases Evaluation and Benchmarking Google Cloud Mistral AI MT-Bench +15 more

7Qwen Research·May 18, 2026·source ↗

Qwen2-VL: Alibaba Releases Latest Vision-Language Model with Extended Video Understanding

Alibaba's Qwen team has released Qwen2-VL, the latest iteration of their vision-language model series built on the Qwen2 foundation. The model claims state-of-the-art performance on visual understanding benchmarks including MathVista, DocVQA, RealWorldQA, and MTVQA. A notable capability is understanding videos exceeding 20 minutes in length for question answering, dialog, and content creation tasks.

Frontier Model Releases Evaluation and Benchmarking Qwen2.5-VL RealWorldQA DocVQA +6 more