AI RESEARCH

Worst-Case Discovery and Runtime Protection for RL-Based Network Controllers

arXiv CS.AI

ArXi:2605.04373v1 Announce Type: cross RL-based controllers achieve strong average-case performance in networking tasks such as congestion control and adaptive bitrate streaming. Yet their performance can degrade severely under network conditions where strong performance is still achievable. Identifying such conditions and quantifying the resulting performance gap is intractable by enumeration, while the sequential and closed-loop nature of RL controllers makes formal verification methods impractical.