The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models
the-shibboleth-effect-auditing-the-cross-lingual-distributional-skew-of-large-language-models-3303e003·1 events·first seen 7d agoAliases: The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models
Co-occurring entities
More like this (12)
Recent events (1)
The Shibboleth Effect: Cross-lingual behavioral skew in frontier LLMs under adversarial geopolitical simulation
Researchers introduce the 'Shibboleth Effect' — systematic behavioral differences in LLMs when operating in different languages — and audit six frontier models (GPT-4o, Llama-4, Mistral-Large, Gemini-3.1-Pro, Qwen3.6-Plus, DeepSeek-R1) using a synthetic maritime territorial dispute wargame played in English versus Turkish. Results are heterogeneous: Llama-4 becomes significantly more coercive in Turkish while Gemini-3.1-Pro and DeepSeek-R1 become less so, and GPT-4o shows no detectable shift. The study identifies two candidate buffering mechanisms — chain-of-thought institutional anchoring and multilingual RLHF alignment — with direct implications for deploying LLMs in diplomatic or crisis-management contexts.