(Rant ;)) Make your benchmarks realistic

r/LocalLLaMA
Generative AI AI Research

Everybody here is posting their optimizations for running different models - thats good but make these benchmark realistic as speed is not one factor to run llm effectively.