AI RESEARCH

BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization

arXiv CS.AI

ArXi:2603.16590v1 Announce Type: cross Microscaling floating-point (MXFP) formats have emerged as a promising standard for deploying Multi-modal Large Language Models (MLLMs) and Large Language Models (LLMs) on modern accelerator architectures. However, existing Post-