AI RESEARCH
SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention
arXiv CS.AI
•
ArXi:2604.13847v1 Announce Type: cross While sparse attention mitigates the computational bottleneck of long-context LLM