technique
Character-level Language Model
techniqueactive
character-level-language-model-33666ebd·1 events·first seen 28d agoAliases: Character-level Language Model
Co-occurring entities
More like this (12)
AnyLanguageModelmRNA Language Modelprotein language modelslarge language model agentscontinuous diffusion language modelMultimodal Large Language ModelsRecursive Language Models (RLMs)large language models1B-scale language modelsReinforcement Learning for Language Modelsgenerative language modelingLatent Context Language Models
Recent events (1)
Unsupervised Sentiment Neuron
OpenAI researchers trained a character-level language model on Amazon reviews to predict the next character and discovered it spontaneously learned a single neuron encoding sentiment with high accuracy. The system achieved state-of-the-art sentiment classification with minimal labeled data, demonstrating that unsupervised language modeling can yield interpretable, task-relevant representations. This was an early result connecting unsupervised pretraining to downstream NLP tasks.