AI RESEARCH
High-Rate Quantized Matrix Multiplication II
arXiv CS.AI
•
ArXi:2605.13768v1 Announce Type: cross This is the second part of the work investigating quantized matrix multiplication (MatMul). In part I we considered the case of calibration-free quantization, whereas here we discuss the setting where covariance matrix $\Sigma_X$ of the columns of the second factor is available. This setting arises in the ubiquitous task of weight-only post-