Expert-as-a-Service: Towards Efficient, Scalable, and Robust Large-scale MoE Serving
Published in arXiv preprint, 2025
Expert-as-a-Service is a serving system for efficient, scalable, and robust large-scale Mixture-of-Experts deployment.
Recommended citation: Ziming Liu, Boyu Tian, Guoteng Wang, Zhen Jiang, Peng Sun, Zhenhua Han, Tian Tang, Xiaohe Hu, Yanmin Jia, Yan Zhang, He Liu, Mingjun Zhang, Yiqi Zhang, Qiaoling Chen, Shenggan Cheng, Mingyu Gao, Yang You, and Siyuan Feng. "Expert-as-a-Service: Towards Efficient, Scalable, and Robust Large-scale MoE Serving." arXiv preprint, 2025.
Download Paper
