AI RESEARCH
A Systematic Evaluation of On-Device LLMs: Quantization, Performance, and Resources
arXiv CS.LG
•
ArXi:2505.15030v4 Announce Type: replace Deploying Large Language Models (LLMs) on edge devices enhances privacy but faces performance hurdles due to limited resources. We