AI RESEARCH
Cardinality Estimation for High Dimensional Similarity Queries with Adaptive Bucket Probing
arXiv CS.AI
•
ArXi:2604.04603v1 Announce Type: cross In this work, we address the problem of cardinality estimation for similarity search in high-dimensional spaces. Our goal is to design a framework that is lightweight, easy to construct, and capable of providing accurate estimates with satisfying online efficiency. We leverage locality-sensitive hashing (LSH) to partition the vector space while preserving distance proximity. Building on this, we adopt the principles of classical multi-probe LSH to adaptively explore neighboring buckets, accounting for distance thresholds of varying magnitudes.