New "major breakthrough?" architecture SubQ

r/LocalLLaMA
Generative AI

While reading through papers and news today i came across this post/blog, claiming major architectural breakthrough, having 12M tokens context window, better than opus, gemini and other models and whopping less than 5% of the cost and it processes token 52X faster than flashattention, yep you read that number right, Fifty two times, at this point i instantly called BS and was ready to move one tbh, there is zero code, paper, api or anything to either test it out or reproduce it.