AI RESEARCH

STAR: A Stage-attributed Triage and Repair framework for RCA Agents in Microservices

arXiv CS.AI

ArXi:2605.15581v1 Announce Type: new LLM-based root cause analysis (RCA) agents have recently emerged as a promising paradigm for incident diagnosis in microservice AIOps. However, their reliability remains fragile: an error in early evidence collection, hypothesis formulation, or causal analysis can propagate through the reasoning trace and eventually corrupt the final diagnosis. In this paper, we present \textbf{STAR}, a \emph{Stage-attributed Triage and Repair} framework for repairing erroneous RCA traces.