AI RESEARCH

Feature Weighting Improves Pool-Based Sequential Active Learning for Regression

arXiv CS.LG

ArXi:2604.02019v1 Announce Type: new Pool-based sequential active learning for regression (ALR) optimally selects a small number of samples sequentially from a large pool of unlabeled samples to label, so that a accurate regression model can be constructed under a given labeling budget. Representativeness and diversity, which involve computing the distances among different samples, are important considerations in ALR. However, previous ALR approaches do not incorporate the importance of different features in inter-sample distance computation, resulting in sub-optimal sample selection.