AI RESEARCH

Task-Specific Knowledge Distillation via Intermediate Probes

arXiv CS.AI

ArXi:2603.12270v1 Announce Type: cross Knowledge distillation from large language models (LLMs) assumes that the teacher's output distribution is a high-quality