Almanac
benchmark

OpenMedReason-Bench

benchmarkactiveprovisionalopenmedreason-bench-c5a4d48e·1 events·first seen 6d ago

Aliases: OpenMedReason-Bench

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.CL·6d ago·source ↗

OpenMedReason: Large-scale multimodal medical reasoning corpus with 450K instances for clinical VLM training

Researchers introduce OpenMedReason, a 450K-instance open multimodal medical reasoning corpus with reasoning traces derived from human-authored biomedical literature rather than synthetic chains of thought. The dataset covers diverse medical imaging modalities and is paired with OpenMedReason-Bench, a held-out benchmark evaluating LVLMs on perception, medical knowledge, and rationale axes. Training with OpenMedReason yields a 20% average VQA accuracy improvement over base models and achieves performance within 4.2% of leading comparable-scale medical VLMs. Both the dataset and code are publicly released.