paper
Language Models are Few-Shot Learners
paperactive
language-models-are-few-shot-learners-9455793a·1 events·first seen 28d agoAliases: Language Models are Few-Shot Learners
Co-occurring entities
More like this (12)
few-shot learningReinforcement Learning for Language ModelsOne-Shot Imitation LearningLanguage Modeling Lossunsupervised language modelingencoder-only language modelsMultimodal Large Language ModelsAnyLanguageModel1B-scale language modelsLanguage Model FinetuningLarge Language Models (frontier)multi-turn language models
Recent events (1)
Language models are few-shot learners
OpenAI published the GPT-3 paper introducing a 175-billion-parameter autoregressive language model demonstrating strong few-shot learning capabilities across a wide range of NLP tasks. The work showed that scaling language models dramatically improves task-agnostic, few-shot performance, often matching or exceeding fine-tuned models without any gradient updates. This paper became a foundational milestone in the development of large language models and the modern AI landscape.