AI RESEARCH
ADMM-Q: An Improved Hessian-based Weight Quantizer for Post-Training Quantization of Large Language Models
arXiv CS.LG
•
ArXi:2605.11222v1 Announce Type: new Quantization is an effective strategy to reduce the storage and computation footprint of large language models (LLMs). Post-