Taalas rumoured to etch Qwen 3.5 27B into silicon. Which price would you buy their PCIe card for?

r/singularity
Generative AI Open Source AI

I posted about them before because of their incredible 17.000 tokens/second for Llama 3.1 8B. With production costs rumoured to be $300 to $400, would you buy a PCIe card for $600 to $800 enabling you to get 10.000 tokens/s of Qwen 3.5 27B intelligence with LORA? I myself feel torn. I would probably just go for an API anyway (albeit one with that speed, though). submitted by /u/elemental-mind [link] [comments]