Almanac
model

EMO

modelactiveemo-18d805b9·1 events·first seen 1mo ago

Aliases: EMO

Co-occurring entities

More like this (12)

Recent events (1)

5Hugging Face Blog·1mo ago·source ↗

EMO: Pretraining Mixture of Experts for Emergent Modularity

AllenAI introduces EMO, a pretraining approach for Mixture of Experts (MoE) models that aims to produce emergent modularity during training. The work explores how MoE architectures can develop specialized expert routing without explicit supervision. Published on the Hugging Face blog, this represents research-level work on improving MoE training dynamics and efficiency.