paper

A Multi-Domain Benchmark for Detecting AI-Generated Text-Rich Images from GPT-Image-2

paperactiveprovisionala-multi-domain-benchmark-for-detecting-ai-generated-text-rich-images-from-gpt-image-2-545e818b·1 events·first seen 3d ago

Aliases: A Multi-Domain Benchmark for Detecting AI-Generated Text-Rich Images from GPT-Image-2

Co-occurring entities

GPT-Image-2 OpenAI

More like this (12)

Massive Text Embedding Benchmark Artificial Analysis Text to Image Artificial Analysis Text to Image Leaderboard GPT-4o Image Generation AI Reproducibility Benchmark AI image verification NAMESAKES: Probing Identity Memorization in Text-to-Image Models OpAI-Bench Subject-driven Image Generation third-party AI evaluations Adversarial Creation and Detection of AI-Generated Social Bot Content Image GPT

Recent events (1)

5arXiv · cs.AI·3d ago·source ↗

Multi-domain benchmark for detecting AI-generated text-rich images from GPT-Image-2

Researchers introduce a new benchmark of 8,602 images across six categories (commercial posters, infographics, academic posters, receipts, tables, UI screenshots) specifically for detecting AI-generated text-rich images produced by OpenAI's GPT-Image-2. Five zero-shot detectors are evaluated, revealing highly domain-dependent performance and severe sensitivity to JPEG compression even in the strongest conventional detector. A multimodal VLM is also explored as a detector, showing promise but limitations on structured formats. The work highlights a gap in existing benchmarks that focus on object-centric rather than text-layout-centric images.

Evaluation and Benchmarking Multimodal Progress GPT-Image-2 OpenAI A Multi-Domain Benchmark for Detecting AI-Generated Text-Rich Images from GPT-Image-2