AI RESEARCH
QV May Be Enough: Toward the Essence of Attention in LLMs
arXiv CS.AI
•
ArXi:2603.15665v1 Announce Type: new Starting from first principles and a linguistic perspective centered on part-of-speech (POS) and syntactic analysis, this paper explores and derives the underlying essence of the Query-Key-Value (QKV) mechanism within the Transformer architecture. Based on this theoretical foundation, we provide a unified explanatory framework for the efficacy of contemporary architectures, including MQA, GQA, and MLA, while identifying their inherent trade-offs and potential optimization trajectories. We.