AI RESEARCH

ADMM-Q: An Improved Hessian-based Weight Quantizer for Post-Training Quantization of Large Language Models

arXiv CS.LG

ArXi:2605.11222v1 Announce Type: new Quantization is an effective strategy to reduce the storage and computation footprint of large language models (LLMs). Post-