AI RESEARCH
Reinforcement Unlearning via Group Relative Policy Optimization
arXiv CS.LG
•
ArXi:2601.20568v3 Announce Type: replace