AI RESEARCH

Query-focused and Memory-aware Reranker for Long Context Processing

arXiv CS.CL

ArXi:2602.12192v2 Announce Type: replace Built upon the existing analysis of retrieval heads in large language models, we propose an alternative reranking framework that trains models to estimate passage-query relevance using the attention scores of selected heads. This approach provides a listwise solution that leverages holistic information within the entire candidate shortlist during ranking. At the same time, it naturally produces continuous relevance scores, enabling