Privacy Preserving Reinforcement Learning with One-Sided Feedback

ArXi:2605.18246v1 Announce Type: new We study reinforcement learning (RL) in multi-dimensional continuous state and action spaces with one-sided feedback, where the agent receives partial observations of the state and obtains reward information for only a subset of the state-action space at each time step. This setting