Almanac
dataset

NuScenes

datasetactiveprovisionalnuscenes-2a7f7339·1 events·first seen 8d ago

Aliases: NuScenes

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.CL·8d ago·source ↗

Benchmark for view-level visual evidence identification in multi-view MLLMs for autonomous driving

A new arXiv preprint introduces a multi-view visual question answering benchmark targeting evidence-source identification in autonomous driving scenarios. Given six synchronized NuScenes camera views and a question, models must identify which camera view supports the answer — not just produce a correct answer. The 122-pair benchmark spans causality, counterfactual reasoning, and intent prediction, and exposes grounding failures that answer-only evaluation misses. The work addresses a meaningful gap between answer accuracy and correct visual grounding in safety-critical multimodal systems.