OSAQ: Outlier Self-Absorption for Accurate Low-bit LLM Quantization

ArXi:2605.04738v1 Announce Type: new Large Language Models (LLMs) have nstrated remarkable capabilities. However, their massive parameter scale leads to significant resource consumption and latency during inference. Post-