Executing programs inside transformers with exponentially faster inference

r/LocalLLaMA
Generative AI

Submitted by /u/liquiddandruff [link] [comments]