Entity · benchmark

TABVERSE

benchmarkactivetabverse-b5b6d378·1 events·first seen Jun 9, 2026

Aliases: TABVERSE

More like this (12)

Tabnine Text Aphasia Battery (TAB)BITEMBED ViT-Base NAVER LiTBench AdvBench TEVI ATE-Bench VCT Tribev2 VISTA

Recent events (1)

4arXiv · cs.CL·Jun 9, 2026·source ↗

TABVERSE benchmark isolates table representation effects across formats in LLMs and VLMs

TABVERSE is a new controlled multimodal benchmark that evaluates LLMs and VLMs on table understanding by holding table content fixed while varying representation format (HTML, Markdown, LaTeX, rendered images). Evaluation across three tasks—Question Answering, Structural Understanding, and Structure Reconstruction—shows that representation choice substantially affects performance, with structured text generally outperforming rendered images and HTML being the most robust text format. The benchmark addresses a gap in existing evaluations where content, format, and modality vary simultaneously, making it impossible to isolate representation effects.

Evaluation and Benchmarking Multimodal Progress TABVERSE