Entity · benchmark

SpaceNum

benchmarkactivespacenum-e182394c·1 events·first seen May 25, 2026

Aliases: SpaceNum

Co-occurring entities

Vision-Language Models Space2Num Num2Space

More like this (12)

Space2Num Num2Space Project Numina NuminaMath SpatialWorld SigNoz PageIndex SynMax SciSummNet NuScenes SimSD TabPFN

Recent events (1)

6arXiv · cs.AI·May 25, 2026·source ↗

SPACENUM: Revisiting Spatial Numerical Understanding in VLMs

SpaceNum is a new evaluation framework probing whether Vision-Language Models genuinely ground numerical outputs (coordinates, action magnitudes) in spatial perception, rather than relying on shallow cues. The benchmark defines two bidirectional tasks—Num2Space and Space2Num—across dynamic and static spatial settings. Results show current VLMs perform near random chance on spatial numerical grounding, with explicit reasoning providing only marginal improvement and fine-tuning offering partial gains.

Evaluation and Benchmarking Agent and Tool Ecosystem SpaceNum Vision-Language Models Space2Num +2 more