AI RESEARCH

LexiSafe: Offline Safe Reinforcement Learning with Lexicographic Safety-Reward Hierarchy

arXiv CS.LG

ArXi:2602.17312v2 Announce Type: replace Offline safe reinforcement learning (RL) is increasingly important for cyber-physical systems (CPS), where safety violations during