AI RESEARCH
A Physical Agentic Loop for Language-Guided Grasping with Execution-State Monitoring
arXiv CS.CV
•
ArXi:2604.07395v1 Announce Type: cross Robotic manipulation systems that follow language instructions often execute grasp primitives in a largely single-shot manner: a model proposes an action, the robot executes it, and failures such as empty grasps, slips, stalls, timeouts, or semantically wrong grasps are not surfaced to the decision layer in a structured way. Inspired by agentic loops in digital tool-using agents, we reformulate language-guided grasping as a bounded embodied agent operating over grounded execution states, where physical actions expose an explicit tool-state stream. We.