AI RESEARCH
BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization
arXiv CS.AI
•
ArXi:2603.16590v1 Announce Type: cross Microscaling floating-point (MXFP) formats have emerged as a promising standard for deploying Multi-modal Large Language Models (MLLMs) and Large Language Models (LLMs) on modern accelerator architectures. However, existing Post-