Entity · paper

Dravid et al., 2023

paperactivedravid-et-al-2023-ef7638b4·1 events·first seen Jun 3, 2026

Aliases: Dravid et al., 2023

Co-occurring entities

Rosetta Neurons Neuron Populations Exhibit Divergent Selectivity with Scale

More like this (12)

Wang et al. 2024 Mirzadeh et al. 2025 Huang et al. 2025 Eloundou et al.Bechiri and Lanasri [2026]Aditya (co-lead author)Aviral Kumar CVPR 2026 arXiv:2602.05394 Adithya S K Devika Verma Bradley-Terry-Davidson

Recent events (1)

6arXiv · cs.LG·Jun 3, 2026·source ↗

Rosetta Neurons follow sublinear power-law scaling with model size, becoming more monosemantic at scale

A new arXiv paper investigates how neuron populations evolve with scale in both language models (up to 30B parameters) and vision models (up to 5B parameters), focusing on 'Rosetta Neurons' — neurons with similar activation patterns across independently trained models. The authors find Rosetta Neurons grow in absolute count but shrink as a fraction of total neurons, and exhibit a 'Neuron Polarization Effect' where they become increasingly monosemantic while non-Rosetta neurons remain less selective. An analytical model explains the sublinear power-law scaling, and the paper demonstrates practical utility via a targeted data-filtering case study for continued pretraining. The results extend scaling laws to neuron-level interpretability structure, linking model size to systematic changes in universality and specialization.

Evaluation and Benchmarking AI Safety Research Rosetta Neurons Neuron Populations Exhibit Divergent Selectivity with Scale Dravid et al., 2023