AI RESEARCH
Improving Diversity in Black-box Few-shot Knowledge Distillation
arXiv CS.LG
•
ArXi:2604.25795v1 Announce Type: cross Knowledge distillation (KD) is a well-known technique to effectively compress a large network (teacher) to a smaller network (student) with little sacrifice in performance. However, most KD methods require a large