AdamBench - a benchmark for local LLMs for agentic coding (on RTX5080 16Gb + 64Gb RAM)

So. I was looking for the best local models for myself to use them in agentic coding workflows. And this is how this benchmark idea was born. And even though it's very "me-specific", I think that it might be useful for others as well, so I decided to document and publish it. The full benchmark results, methodology, visalisations etc. can be found here: README (+ prompt files in review_outputs) should provide all necessary info to replicate exactly the same benchmark flow if you want to compare the results or test other models against the ones that I tested.