ggml: backend-agnostic tensor parallelism by JohannesGaessler · Pull Request #19378 · ggml-org/llama.cpp
r/LocalLLaMA
•
Generative AI
Open Source AI
Gregano approved the tensor parallelism PR! submitted by /u/FullstackSensei [link] [comments]