Forcing Flash Attention onto a TPU and Learning the Hard Way

Hacker News (AI)
AI Hardware

Article URL: Comments URL: Points: 3 # Comments: 0