PQuantML: A Tool for End-to-End Hardware-aware Model Compression

ArXi:2603.26595v1 Announce Type: new PQuantML is a new open-source, hardware-aware neural network model compression library tailored to end-to-end workflows. Motivated by the need to deploy performant models to environments with strict latency constraints, PQuantML simplifies