AI RESEARCH
Identifying Latent Actions and Dynamics from Offline Data via Demonstrator Diversity
arXiv CS.LG
•
ArXi:2603.17577v1 Announce Type: new Can latent actions and environment dynamics be recovered from offline trajectories when actions are never observed? We study this question in a setting where trajectories are action-free but tagged with nstrator identity. We assume that each nstrator follows a distinct policy, while the environment dynamics are shared across nstrators and identity affects the next observation only through the chosen action.