AI RESEARCH

DAPA: Distribution Aware Piecewise Activation Functions for On-Device Transformer Inference and Training

arXiv CS.LG

ArXi:2603.19338v1 Announce Type: new Non-linear activation functions play a pivotal role in on-device inference and