AI RESEARCH
MoEless: Efficient MoE LLM Serving via Serverless Computing
arXiv CS.AI
•
ArXi:2603.06350v1 Announce Type: cross Large Language Models (LLMs) have become a cornerstone of AI, driving progress across diverse domains such as content creation, search and recommendation systems, and AI-assisted workflows. To alleviate extreme