AI RESEARCH

ADMEDTAGGER: an annotation framework for distillation of expert knowledge for the Polish medical language

arXiv CS.CL

ArXi:2601.09722v2 Announce Type: replace In this work, we present an annotation framework that nstrates how a multilingual LLM pretrained on a large corpus can be used as a teacher model to distill the expert knowledge needed for tagging medical texts in Polish. This work is part of a larger project called ADMEDVOICE, within which we collected an extensive corpus of medical texts representing five clinical categories - Radiology, Oncology, Cardiology, Hypertension, and Pathology.