Entity · person

Jacob Hilton

personactivejacob-hilton-a8485d2d·1 events·first seen May 20, 2026

Aliases: Jacob Hilton

Co-occurring entities

TruthfulQA Stephanie Lin Owain Evans OpenAI

More like this (12)

Jake Cooper Jack Clark John Horton Nathan Lambert Harrison Edwards Christopher J. Kelly Simon Willison Harvey Jay Shim Adam Jared Kaplan JAT (Jack of All Trades)

Recent events (1)

6Openai Blog·May 20, 2026·source ↗

TruthfulQA: Measuring how models mimic human falsehoods

OpenAI introduced TruthfulQA, a benchmark designed to measure whether language models generate truthful answers or mimic common human misconceptions and falsehoods. The benchmark tests models on questions where humans frequently give wrong answers due to misconceptions, conspiracy theories, or false beliefs. Results showed that larger models were not necessarily more truthful, and in some cases performed worse, highlighting a key alignment challenge.

Evaluation and Benchmarking AI Safety Research TruthfulQA Stephanie Lin Jacob Hilton +3 more