ReCUBE: Evaluating Repository-Level Context Utilization in Code Generation

ArXi:2603.25770v1 Announce Type: cross Large Language Models (LLMs) have recently emerged as capable coding assistants that operate over large codebases through either agentic exploration or full-context generation. Existing benchmarks capture a broad range of coding capabilities, such as resolving GitHub issues, but none of them directly isolate and measure how effectively LLMs leverage repository-level context during code generation. To address this, we