Almanac
technique

probing classifiers

techniqueactiveprovisionalprobing-classifiers-36daffab·1 events·first seen 21d ago

Aliases: probing classifiers

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.CL·21d ago·source ↗

Real Images, Worse Judgments: Evaluating VLMs on Concreteness and Imagery

This paper evaluates whether vision-language models (VLMs) benefit from real image context when making lexical judgments about word concreteness and imagery. The authors find that real-image contexts frequently hurt alignment with human ratings, especially when visual evidence is least relevant to the word being judged. Probing and canonical correlation analysis reveal that real images cause representational shifts and increased sensitivity to spurious visual cues. Instructing models to focus on text-only content at inference time partially mitigates this degradation.