technique
AI-assisted human evaluation
techniqueactive
ai-assisted-human-evaluation-334dc925·1 events·first seen 28d agoAliases: AI-assisted human evaluation
Co-occurring entities
More like this (12)
Recent events (1)
AI-Written Critiques Help Humans Notice Flaws in Summaries
OpenAI trained critique-writing models to identify flaws in AI-generated summaries, finding that human evaluators catch significantly more errors when assisted by model-generated critiques. A key finding is that scale improves critique-writing ability more than summary-writing ability. The work is framed as a step toward using AI to assist human oversight of AI systems on difficult tasks, relevant to scalable oversight research.