paper
Fine-tuning GPT-2 from Human Preferences
paperactive
fine-tuning-gpt-2-from-human-preferences-bd26ab92·1 events·first seen 28d agoAliases: Fine-tuning GPT-2 from Human Preferences
Co-occurring entities
More like this (12)
Recent events (1)
Fine-tuning GPT-2 from Human Preferences
OpenAI fine-tuned the 774M parameter GPT-2 model using human feedback across summarization and style-continuation tasks, requiring 60k and 5k human labels respectively. The work revealed a labeler preference misalignment: for summarization, labelers rewarded copying from source text rather than genuine summarization. The stated motivation is advancing safety techniques for human-machine interaction and learning about human values from feedback.