Entity · benchmark

VisAnomBench

benchmarkactivevisanombench-c881cd74·1 events·first seen May 29, 2026

Aliases: VisAnomBench

Co-occurring entities

time-series anomaly detection Vision-Language Models TSB-AD-U VisAnomReasoner

More like this (12)

AdvBench SorryBench VerifierBench AdversaBench OmniaBench ProverBench SelectBench MemBench VR-Bench TriViewBench LiveBench EvoBench

Recent events (1)

5arXiv · cs.AI·May 29, 2026·source ↗

VisAnomReasoner: Efficient VLM for Time-Series Anomaly Detection via VisAnomBench

Researchers introduce VisAnomBench, a curated benchmark augmenting public time-series anomaly datasets with natural-language rationales generated and selected from multiple large VLMs using task-specific rewards. Fine-tuning on this benchmark produces VisAnomReasoner, a parameter-efficient vision-language model that outperforms all baselines by at least 21.23 and 23.87 percentage points in precision and F1 on VisAnomBench. Cross-benchmark evaluation on TSB-AD-U shows further generalization gains of 9.57 and 13.39 percentage points in precision and F1, respectively.

Evaluation and Benchmarking Agent and Tool Ecosystem time-series anomaly detection Vision-Language Models TSB-AD-U +3 more