AI RESEARCH
Two-dimensional early exit optimisation of LLM inference
arXiv CS.AI
•
ArXi:2604.18592v1 Announce Type: cross
ArXi:2604.18592v1 Announce Type: cross