4arXiv cs.CL (Computation and Language)·29d ago

Image-Semantic Guided Detection of AI-Generated Modern Chinese Poetry Using MLLMs

This paper proposes a multimodal detection method for identifying AI-generated modern Chinese poetry by incorporating images that reflect poetic content alongside text. The approach uses example-driven prompting to integrate meaning, imagery, and emotional cues from images as a complement to textual analysis. A Gemini-based detector using this method achieves 85.65% Macro-F1, outperforming both plain-text LLM baselines and the traditional RoBERTa detector. The work extends AI-generated content detection research into a domain—modern Chinese poetry—previously unaddressed by prior studies.

Evaluation and Benchmarking Multimodal Progress RoBERTa image-semantic guided poetry detection modern Chinese poetry AI detection Macro-F1 Gemini

Related guides (2)

Multimodal ProgressTopic guide

Multimodal Progress: How AI Learned to See, Hear, and Act

Read asBeginner In-depth

Evaluation and BenchmarkingTopic guide

Evaluation and Benchmarking: How We Measure AI — and Why It Keeps Getting Harder

Read asBeginner In-depth

Related events (8)

5arXiv · cs.AI·2d ago·source ↗

Multi-domain benchmark for detecting AI-generated text-rich images from GPT-Image-2

Researchers introduce a new benchmark of 8,602 images across six categories (commercial posters, infographics, academic posters, receipts, tables, UI screenshots) specifically for detecting AI-generated text-rich images produced by OpenAI's GPT-Image-2. Five zero-shot detectors are evaluated, revealing highly domain-dependent performance and severe sensitivity to JPEG compression even in the strongest conventional detector. A multimodal VLM is also explored as a detector, showing promise but limitations on structured formats. The work highlights a gap in existing benchmarks that focus on object-centric rather than text-layout-centric images.

Evaluation and Benchmarking Multimodal Progress GPT-Image-2 OpenAI A Multi-Domain Benchmark for Detecting AI-Generated Text-Rich Images from GPT-Image-2

3arXiv · cs.CL·9d ago·source ↗

PoetryQwen: LoRA-fine-tuned Qwen2.5-14B for classical Chinese poetry understanding with new 49K dataset

Researchers introduce CCPoetry-49K, a 49,404-pair instruction dataset for classical Chinese poetry appreciation, decomposed into term interpretation, semantic interpretation, and emotional inference subtasks. They fine-tune Qwen2.5-14B using LoRA to produce PoetryQwen, achieving a 9.7% improvement over the baseline on the CCL25-Eval Task 5 benchmark (0.757 vs 0.690). The work addresses a gap in domain-specific LLM adaptation for classical Chinese literary tasks.

Open Weights Progress CCPoetry-49K Qwen2.5-14B LoRA (Low-Rank Adaptation)+2 more

5arXiv · cs.CL·12d ago·source ↗

Adversarial methodology improves detection of AI-generated social bot content

Researchers introduce an adversarial framework that simulates malicious actors impersonating real social media users to generate training data for AI-content detection. The approach produces a multilingual, cross-platform dataset of paired human and AI-generated messages. Models trained on this adversarial data significantly outperform existing content-based bot detection systems on out-of-distribution real-world data.

Evaluation and Benchmarking AI Safety Research Adversarial Creation and Detection of AI-Generated Social Bot Content

5Openai Blog·1mo ago·source ↗

New AI classifier for indicating AI-written text

OpenAI launched a classifier designed to distinguish between AI-generated and human-written text. The tool was positioned as an aid for detecting content produced by large language models. OpenAI acknowledged limitations including unreliability on short texts and non-English content, and noted the classifier should not be used as a sole decision-making tool.

Evaluation and Benchmarking AI Safety Research OpenAI AI Text Classifier OpenAI

5arXiv · cs.AI·24d ago·source ↗

Social Gaze Consistency as a Semantic Cue for AI-Generated Image Detection

This paper introduces Social Gaze Consistency (SGC), a high-level semantic detection axis based on the mutual coherence of gaze direction, head-eye alignment, and pupil placement between interacting individuals in images. The authors construct a controlled diagnostic dataset with region-specific gaze perturbations and a Block-Compositional Caption Supervision scheme to train detectors without generator-fingerprint memorization shortcuts. Cross-architecture validation shows +3.7 pp improvement on the COCOAI Interaction subset when applied to FakeVLM, with gains transferring from a single inpainter (FLUX.1-Fill) to multi-generator suites. The work argues that diffusion models share a spectral weakness in periocular structure, making gaze coherence a robust, backbone-agnostic detection signal orthogonal to existing low-level artifact methods.

Evaluation and Benchmarking AI Safety Research Effort FLUX.1-Fill Social Gaze Consistency +5 more

5arXiv · cs.CL·12d ago·source ↗

SV-Detect: AI-generated text detection via steering vectors in representation space

SV-Detect proposes a method for detecting machine-generated text by extracting steering vectors from the hidden representations of a frozen language model, constructing layer-wise directions that separate human from AI-written text. A lightweight classifier trained on projection features achieves strong performance both in-distribution and under distribution shift across domains, source models, and editing attacks like polishing and rewriting. The approach reframes AI-text detection as a representation-space probing problem, with interpretation analyses showing the learned directions capture stylistic cues beyond surface features.

Evaluation and Benchmarking AI Safety Research SV-Detect steering vectors

6Hugging Face Blog·1mo ago·source ↗

Introducing SynthID Text

Hugging Face published a blog post introducing SynthID Text, Google DeepMind's watermarking technique for AI-generated text. The method embeds imperceptible signals into LLM outputs by modifying token sampling distributions, enabling detection of AI-generated content without degrading text quality. The post likely covers integration with Hugging Face's transformers library, making the technique accessible to the broader ML community.

Evaluation and Benchmarking AI Safety Research Hugging Face Transformers Google DeepMind Hugging Face +2 more

6Google Deepmind Blog·1mo ago·source ↗

DeepMind Brings AI Image Verification to the Gemini App

DeepMind is integrating AI image verification capabilities directly into the Gemini app, enabling users to assess the authenticity or provenance of images. The feature likely leverages content credentials or watermarking techniques to surface metadata about AI-generated or manipulated images. This represents a practical deployment of provenance and authenticity tooling within a major consumer AI product.

AI Safety Research Enterprise Deployment Patterns Google DeepMind Gemini App AI image verification +1 more