AI RESEARCH
Query-focused and Memory-aware Reranker for Long Context Processing
arXiv CS.CL
•
ArXi:2602.12192v2 Announce Type: replace Built upon the existing analysis of retrieval heads in large language models, we propose an alternative reranking framework that trains models to estimate passage-query relevance using the attention scores of selected heads. This approach provides a listwise solution that leverages holistic information within the entire candidate shortlist during ranking. At the same time, it naturally produces continuous relevance scores, enabling