AI RESEARCH

TAH-QUANT: Effective Activation Quantization in Pipeline Parallelism over Slow Network

arXiv CS.LG

ArXi:2506.01352v2 Announce Type: replace