AI RESEARCH

TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference

arXiv CS.LG

ArXi:2603.21365v1 Announce Type: new Large language models run every token through every layer, regardless of difficulty. We present TIDE, a post-