model
EMO
modelactive
emo-18d805b9·1 events·first seen 1mo agoAliases: EMO
Co-occurring entities
More like this (12)
Recent events (1)
EMO: Pretraining Mixture of Experts for Emergent Modularity
AllenAI introduces EMO, a pretraining approach for Mixture of Experts (MoE) models that aims to produce emergent modularity during training. The work explores how MoE architectures can develop specialized expert routing without explicit supervision. Published on the Hugging Face blog, this represents research-level work on improving MoE training dynamics and efficiency.