paper
Deep Double Descent
paperactive
deep-double-descent-80b909c1·1 events·first seen 28d agoAliases: Deep Double Descent, double descent
Co-occurring entities
More like this (12)
Recent events (1)
Deep Double Descent: Universal Phenomenon in CNNs, ResNets, and Transformers
OpenAI researchers demonstrate that the double descent phenomenon—where model performance improves, degrades, then improves again—occurs universally across CNNs, ResNets, and transformers as a function of model size, data size, or training time. The effect can often be masked by careful regularization, which may explain why it has been underappreciated. The underlying mechanism remains poorly understood, and the authors identify it as an important open research direction.