AI RESEARCH
Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention [P]
r/MachineLearning
•
AI/ML research on KV Sharing, mHC, and Compressed Attention [P