AI RESEARCH
ADMEDTAGGER: an annotation framework for distillation of expert knowledge for the Polish medical language
arXiv CS.CL
•
ArXi:2601.09722v2 Announce Type: replace In this work, we present an annotation framework that nstrates how a multilingual LLM pretrained on a large corpus can be used as a teacher model to distill the expert knowledge needed for tagging medical texts in Polish. This work is part of a larger project called ADMEDVOICE, within which we collected an extensive corpus of medical texts representing five clinical categories - Radiology, Oncology, Cardiology, Hypertension, and Pathology.