Apparent Psychological Profiles of Large Language Models are Largely a Measurement Artifact
apparent-psychological-profiles-of-large-language-models-are-largely-a-measurement-artifact-b6be45ae·1 events·first seen 47h agoAliases: Apparent Psychological Profiles of Large Language Models are Largely a Measurement Artifact
More like this (12)
Recent events (1)
LLM psychological profiles are largely measurement artifacts, not model properties
A new arXiv preprint administers a battery of personality and risk-preference instruments to 56 instruction-tuned LLMs alongside large human reference samples, finding that 81-90% of between-model variation is explained by directional response bias rather than the traits the instruments target. The authors introduce the concept of 'response orthogonality' to explain why some instruments appear more reliable than others, and show that apparent psychological profiles can be manufactured through item selection. The findings challenge the validity of using human-designed psychometric tools to characterize LLMs, with direct implications for safety assessment and the use of LLMs as proxies for human participants in research.