Since FastFlowLM added support for Linux, I decided to benchmark all the models they support, here are some results

Tested on an HP zbook ultra g1a with Ryzen AI Max+ 395. I attempted to test on context depths of 0, 10k, 40k and 70k. If the result is missing, the test failed. I increased the context size for gpt-oss-20b and qwen3.5 to their maximum. I did not touch the rest of the config. This explains why many of the other models don't have results for deep contexts.