Executing programs inside transformers with exponentially faster inference
r/LocalLLaMA
•
Generative AI
Submitted by /u/liquiddandruff [link] [comments]