AI RESEARCH

Lyapunov-Guided Self-Alignment: Test-Time Adaptation for Offline Safe Reinforcement Learning

arXiv CS.AI

ArXi:2604.26516v1 Announce Type: cross Offline reinforcement learning (RL) agents often fail when deployed, as the gap between