AI RESEARCH
PARHAF, a human-authored corpus of clinical reports for fictitious patients in French
arXiv CS.CL
•
ArXi:2603.20494v1 Announce Type: new The development of clinical natural language processing (NLP) systems is severely hampered by the sensitive nature of medical records, which restricts data sharing under stringent privacy regulations, particularly in France and the broader European Union. To address this gap, we The corpus contains 7394 clinical reports covering 5009 patient cases across a wide range of medical and surgical specialties.