Entity · model

MobileLLM-Pro

modelactivemobilellm-pro-82ac0c5a·1 events·first seen May 27, 2026

Aliases: MobileLLM-Pro

Co-occurring entities

OLMoE-1B-7B INT4 Quantization Mixture of Experts MobileMoE on-device MoE scaling law quantization-aware training

More like this (12)

dLLM-Prover-7B LLMScan StreamingLLM EvalLLM MyMentorLLM MMLU-Pro Dep-LLM LLM CLI LLM Wiki SpeechLLM RTLLM LLM (CLI tool)

Recent events (1)

7arXiv · cs.CL·May 27, 2026·source ↗

MobileMoE: Scaling Mixture-of-Experts for Sub-Billion Parameter On-Device Deployment

MobileMoE introduces a family of on-device MoE language models with 0.3–0.9B active parameters and 1.3–5.3B total parameters, targeting mobile deployment under memory and compute constraints. The authors derive an on-device MoE scaling law identifying a sweet spot of moderate sparsity with fine-grained and shared experts, then train models through a four-stage recipe including quantization-aware training on open-source data. Across 14 benchmarks, MobileMoE matches or exceeds leading dense on-device LLMs with 2–4× fewer inference FLOPs, and delivers 1.8–3.8× faster prefill and 2.2–3.4× faster decode than dense baselines on commodity smartphones at comparable INT4 memory.

Training Infrastructure Frontier Model Releases MobileLLM-Pro OLMoE-1B-7B INT4 Quantization +7 more