AI RESEARCH
DVM: Real-Time Kernel Generation for Dynamic AI Models
arXiv CS.LG
•
ArXi:2603.24239v1 Announce Type: cross Dynamism is common in AI computation, e.g., the dynamic tensor shapes and the dynamic control flows in models. Due to the long compilation time, existing runtime compilation damages the model efficiency, while the offline compilers either suffer from the long compilation time and device memory footprint to cover all the possible execution instances of a dynamic model, or sacrifice optimization opportunities for usability.