AI RESEARCH
Lyapunov-Guided Self-Alignment: Test-Time Adaptation for Offline Safe Reinforcement Learning
arXiv CS.AI
•
ArXi:2604.26516v1 Announce Type: cross Offline reinforcement learning (RL) agents often fail when deployed, as the gap between