Almanac
technique

AI-assisted human evaluation

techniqueactiveai-assisted-human-evaluation-334dc925·1 events·first seen 28d ago

Aliases: AI-assisted human evaluation

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·28d ago·source ↗

AI-Written Critiques Help Humans Notice Flaws in Summaries

OpenAI trained critique-writing models to identify flaws in AI-generated summaries, finding that human evaluators catch significantly more errors when assisted by model-generated critiques. A key finding is that scale improves critique-writing ability more than summary-writing ability. The work is framed as a step toward using AI to assist human oversight of AI systems on difficult tasks, relevant to scalable oversight research.