Reminder that Anthropic reported memorization on some SWE-Bench Pro problems

r/singularity
Generative AI

"SWE-bench Verified, Pro, and Multilingual: Our memorization screens flag a subset of problems in these SWE-bench evals." submitted by /u/RideOrDieRemember [link] [comments]