AI RESEARCH
Randomized Antipodal Search Done Right for Data Pareto Improvement of LLM Unlearning
arXiv CS.LG
•
ArXi:2604.16591v1 Announce Type: new Large language models (LLMs) sometimes memorize undesirable knowledge, which must be removed after deployment. Prior work on machine unlearning has focused largely on optimization methods that adjust parameters to enforce forgetting while preserving retention. However, these approaches assume that the forget and retain sets are readily available, which rarely holds in practice. Unlearning is typically triggered by an undesired generation at inference time, making the retrieval of relevant data the central challenge.