Beating the Style Detector: Three Hours of Agentic Research on the AI-Text Arms Race

ArXi:2605.02620v1 Announce Type: cross Reproducing an empirical NLP study used to take weeks. Given the released data and a modern agentic-research harness, we redo every experiment of a recent ACL\,2026 study on personal-style post-editing of LLM drafts -- and add three new ones -- with the human investigator acting only as a reviewer-in-the-loop. We reproduce all seven preregistered hypotheses and recover the paper's headline correlation between perceived self-similarity and embedding-measured self-similarity to three decimal places ($r0.244$, $p{<}10^{-8}$, $n{=}648.