AI RESEARCH

Partial Feedback Online Learning

arXiv CS.LG

ArXi:2601.21462v3 Announce Type: replace We study a new learning protocol, termed partial-feedback online learning, where each instance admits a set of acceptable labels, but the learner observes only one acceptable label per round. We highlight that, while classical version space is widely used for online learnability, it does not directly extend to this setting. We address this obstacle by