ggml: backend-agnostic tensor parallelism by JohannesGaessler · Pull Request #19378 · ggml-org/llama.cpp

r/LocalLLaMA
Generative AI Open Source AI

Gregano approved the tensor parallelism PR! submitted by /u/FullstackSensei [link] [comments]