AI RESEARCH
Learn Where Outcomes Diverge: Efficient VLA RL via Probabilistic Chunk Masking
arXiv CS.LG
•
ArXi:2605.16154v1 Announce Type: new Reinforcement learning (RL) allows vision-language-action (VLA) policies to generalize beyond their