Almanac
technique

counterfactual chart generation

techniqueactiveprovisionalcounterfactual-chart-generation-7643db65·1 events·first seen 21d ago

Aliases: counterfactual chart generation

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.CL·21d ago·source ↗

Chartographer: Counterfactual Chart Generation for Evaluating Vision-Language Models

Chartographer is a framework for generating counterfactual chart variants to rigorously evaluate visual reasoning in vision-language models (VLMs), addressing the problem of shortcut-taking and prior knowledge exploitation in chart QA benchmarks. The system reverse-engineers charts into executable code, generates seed-controlled variants, and derives new ground-truth answers via executable QA logic. Evaluation of proprietary and open-source VLMs reveals that models frequently fail to generalize to counterfactual charts even after correctly answering the original, with failures most common when novel visual reasoning pathways are required.