AI RESEARCH

FeatEHR-LLM: Leveraging Large Language Models for Feature Engineering in Electronic Health Records

arXiv CS.AI

ArXi:2604.22534v1 Announce Type: cross Feature engineering for Electronic Health Records (EHR) is complicated by irregular observation intervals, variable measurement frequencies, and structural sparsity inherent to clinical time series. Existing automated methods either lack clinical domain awareness or assume clean, regularly sampled inputs, limiting their applicability to real-world EHR data. We present \textbf{FeatEHR-LLM}, a framework that leverages Large Language Models (LLMs) to generate clinically meaningful tabular features from irregularly sampled EHR time series.