AI RESEARCH
LexiSafe: Offline Safe Reinforcement Learning with Lexicographic Safety-Reward Hierarchy
arXiv CS.LG
•
ArXi:2602.17312v2 Announce Type: replace Offline safe reinforcement learning (RL) is increasingly important for cyber-physical systems (CPS), where safety violations during