AI RESEARCH
A Hardware-Aware, Per-Layer Methodology for Post-Training Quantization of Large Language Models
arXiv CS.LG
•
ArXi:2605.14929v1 Announce Type: new Scaled Outer Product (SOP) is a post-