AI RESEARCH
Prune, Update and Trim: Robust Structured Pruning for Large Language Models
arXiv CS.LG
•
ArXi:2605.18331v1 Announce Type: new Large Language Models (LLMs) have experienced significant growth and development in recent years. However, performing inference on LLMs remains costly, especially for long-context inference or in resource-constrained devices. This motivates the development of new post-