AI RESEARCH

Finding Duplicates in 1.1M BDD Steps: cukereuse, a Paraphrase-Robust Static Detector for Cucumber and Gherkin

arXiv CS.CL

ArXi:2604.20462v1 Announce Type: cross Behaviour-Driven Development (BDD) suites accumulate step-text duplication whose maintenance cost is established in prior work. Existing detection techniques require running the tests (Binamungu, 2018-2023) or are confined to a single organisation (Irshad, 2020-2022), leaving a gap: a purely static, paraphrase-robust, step-level detector usable on any repository.