AI RESEARCH
TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference
arXiv CS.LG
•
ArXi:2603.21365v1 Announce Type: new Large language models run every token through every layer, regardless of difficulty. We present TIDE, a post-