Entity · dataset

WildChat

datasetactivewildchat-ef48d630·1 events·first seen Jun 17, 2026

Aliases: WildChat

Co-occurring entities

Security and Privacy Prompts in the Wild: What Users Ask LLMs and How LLMs Respond Llama GPT-5.5

More like this (12)

HuggingChat WeChat LibreChat nanochat Qwen Chat LobeChat deepseek-chat StarChat-Alpha Q8-Chat Qwen1.5-110B-Chat chat-latest ChatGPT Plus

Recent events (1)

5arXiv · cs.CL·Jun 17, 2026·source ↗

Study of security and privacy prompts in the wild reveals LLM response quality gaps and inconsistency

Researchers analyzed 14,727 security and privacy (S&P) prompts drawn from WildChat's 3.2M real user-LLM conversations, categorizing them into nine topic areas and evaluating response quality across 270 advice-seeking prompts. Commercial models substantially outperformed open-weight models (GPT achieving 98% 'good enough' responses vs. Llama 4 at 47%), but even high-performing commercial models showed inconsistent responses across repeated runs of the same prompt. The study is the first to analyze real user S&P queries to LLMs rather than expert-authored test sets, surfacing both a capability gap and a reliability concern.

Evaluation and Benchmarking AI Safety Research Security and Privacy Prompts in the Wild: What Users Ask LLMs and How LLMs Respond WildChat Llama +1 more