AI RESEARCH
DAPA: Distribution Aware Piecewise Activation Functions for On-Device Transformer Inference and Training
arXiv CS.LG
•
ArXi:2603.19338v1 Announce Type: new Non-linear activation functions play a pivotal role in on-device inference and