Almanac
model

critique-writing model

modelactivecritique-writing-model-05b6f84f·1 events·first seen 28d ago

Aliases: critique-writing model

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·28d ago·source ↗

AI-Written Critiques Help Humans Notice Flaws in Summaries

OpenAI trained critique-writing models to identify flaws in AI-generated summaries, finding that human evaluators catch significantly more errors when assisted by model-generated critiques. A key finding is that scale improves critique-writing ability more than summary-writing ability. The work is framed as a step toward using AI to assist human oversight of AI systems on difficult tasks, relevant to scalable oversight research.