AI RESEARCH
TAH-QUANT: Effective Activation Quantization in Pipeline Parallelism over Slow Network
arXiv CS.LG
•
ArXi:2506.01352v2 Announce Type: replace