Almanac
paper

Fine-tuning GPT-2 from Human Preferences

paperactivefine-tuning-gpt-2-from-human-preferences-bd26ab92·1 events·first seen 28d ago

Aliases: Fine-tuning GPT-2 from Human Preferences

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·28d ago·source ↗

Fine-tuning GPT-2 from Human Preferences

OpenAI fine-tuned the 774M parameter GPT-2 model using human feedback across summarization and style-continuation tasks, requiring 60k and 5k human labels respectively. The work revealed a labeler preference misalignment: for summarization, labelers rewarded copying from source text rather than genuine summarization. The stated motivation is advancing safety techniques for human-machine interaction and learning about human values from feedback.