paper

Hallucination in World Models is Predictable and Preventable

paperactiveprovisionalhallucination-in-world-models-is-predictable-and-preventable-4df7c11b·1 events·first seen 4d ago

Aliases: Hallucination in World Models is Predictable and Preventable

Co-occurring entities

Nicklas Hansen MMBench2

More like this (12)

Vision-Default, Prior-Override: Causal Mechanisms of Perception-Knowledge Conflict in Vision-Language Models Looped World Models stable-worldmodel Predicting Future Behaviors in Reasoning Models Enables Better Steering Hallucinations Leaderboard hallucination (LLM)world model physical world model Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models Language Modeling Loss A Causal Model of Theory of Mind in Conflict for Artificial Intelligence

Recent events (1)

6arXiv · cs.LG·4d ago·source ↗

MMBench2 paper: hallucination in world models is predictable and preventable via coverage signals

Researchers introduce MMBench2, a 427-hour, 210-task dataset for visual world modeling, and train a 350M-parameter world model to study hallucination in generative world models. The paper identifies three distinct hallucination modes (perceptual, action-marginalized, scene-diverging) and develops lightweight signals that predict where models will fail. A coverage-aware sampling technique and curiosity-reward-based data collection enable efficient finetuning to unseen environments with as few as 50 real trajectories. The central finding is that world model hallucination is fundamentally a data coverage problem, with the same signals serving both detection and mitigation.

Evaluation and Benchmarking Nicklas Hansen MMBench2 Hallucination in World Models is Predictable and Preventable