Almanac
paper

A Multi-Domain Benchmark for Detecting AI-Generated Text-Rich Images from GPT-Image-2

paperactiveprovisionala-multi-domain-benchmark-for-detecting-ai-generated-text-rich-images-from-gpt-image-2-545e818b·1 events·first seen 3d ago

Aliases: A Multi-Domain Benchmark for Detecting AI-Generated Text-Rich Images from GPT-Image-2

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.AI·3d ago·source ↗

Multi-domain benchmark for detecting AI-generated text-rich images from GPT-Image-2

Researchers introduce a new benchmark of 8,602 images across six categories (commercial posters, infographics, academic posters, receipts, tables, UI screenshots) specifically for detecting AI-generated text-rich images produced by OpenAI's GPT-Image-2. Five zero-shot detectors are evaluated, revealing highly domain-dependent performance and severe sensitivity to JPEG compression even in the strongest conventional detector. A multimodal VLM is also explored as a detector, showing promise but limitations on structured formats. The work highlights a gap in existing benchmarks that focus on object-centric rather than text-layout-centric images.