AI RESEARCH

UniPROT: Uniform Prototype Selection via Partial Optimal Transport with Submodular Guarantees

arXiv CS.LG

ArXi:2604.10952v1 Announce Type: new Selecting prototypical examples from a source distribution to represent a target data distribution is a fundamental problem in machine learning. Existing subset selection methods often rely on implicit importance scores, which can be skewed towards majority classes and lead to low-quality prototypes for minority classes. We present $\methodprop$, a novel subset selection framework that minimizes the optimal transport (OT) distance between a uniformly weighted prototypical distribution and the target distribution.