Almanac
paper

One Polluted Page Is Enough: Evaluating Web Content Pollution in Generative Recommenders

paperactiveprovisionalone-polluted-page-is-enough-evaluating-web-content-pollution-in-generative-recommenders-46574e5f·1 events·first seen 5d ago

Aliases: One Polluted Page Is Enough: Evaluating Web Content Pollution in Generative Recommenders

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.AI·5d ago·source ↗

FORGE benchmark reveals search-augmented LLMs vulnerable to fake product promotion via web content pollution

Researchers introduce FORGE, a benchmark measuring how often search-augmented LLMs recommend fake products when retrieval results are polluted with fabricated reviews or promotional pages. Across 12 commercial and open-weights models, a single polluted page causes fooled rates up to 27%, rising to 73.8% when all top-3 results are replaced. Notably, chain-of-thought reasoning does not mitigate the vulnerability and often generates spurious social proof to justify false recommendations. Three defenses tested—skepticism prompting, model-prior filtering, and cross-document consensus—each carry significant drawbacks.