AI RESEARCH

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

arXiv CS.CL

ArXi:2605.16882v1 Announce Type: new Low-resource deployment constraints have made model quantization essential for deploying neural networks while preserving performance. Meanwhile, model merging has become an increasingly practical low-resource strategy for integrating multiple task- or domain-specialized experts into a single model without joint