model

RadGrounder

modelactiveprovisionalradgrounder-061e7a36·1 events·first seen 47h ago

Aliases: RadGrounder

Co-occurring entities

RefRad2D Slake VQA-RAD

More like this (12)

Graph RAG Grok-3 Grok AdvGRPO GRPO GRUFF Groq GR00T N1.5 Gopher Grok 4 Grok 4.3 RAG

Recent events (1)

5arXiv · cs.CL·47h ago·source ↗

RefRad2D dataset and RadGrounder model enable spatially grounded radiology VLMs without manual annotations

Researchers introduce RefRad2D, a 1.2M-pair bilingual (German/English) CT and MR image-text dataset generated automatically via LLM curation and automated segmentation, requiring no manual spatial annotations. The accompanying RadGrounder model jointly performs report generation, VQA, and spatial grounding via bounding-box or segmentation outputs. On external benchmarks Slake and VQA-RAD, RadGrounder matches specialized medical VLMs while adding grounding supervision without degrading language quality. The work demonstrates that large-scale automatically curated clinical data can transfer to downstream medical VQA tasks.

Evaluation and Benchmarking Multimodal Progress RefRad2D Slake RadGrounder +1 more