AI RESEARCH

Improving Diversity in Black-box Few-shot Knowledge Distillation

arXiv CS.LG

ArXi:2604.25795v1 Announce Type: cross Knowledge distillation (KD) is a well-known technique to effectively compress a large network (teacher) to a smaller network (student) with little sacrifice in performance. However, most KD methods require a large