AI RESEARCH

Two-dimensional early exit optimisation of LLM inference

arXiv CS.AI

ArXi:2604.18592v1 Announce Type: cross