AI RESEARCH
AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization
arXiv CS.LG
•
ArXi:2603.23566v1 Announce Type: new AscendC (Ascend C) operator optimization on Huawei Ascend neural processing units (NPUs) faces a two-fold knowledge bottleneck: unlike the CUDA ecosystem, there are few public reference implementations to learn from, and performance hinges on a coupled two-part artifact - a host-side tiling program that orchestrates data movement and a kernel program that schedules and pipelines instructions. We present AscendOptimizer, an episodic agent that bootstraps this missing expertise by turning execution into experience.