菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-15
📄 Abstract - Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views

Vector Similarity Search (VSS) in high-dimensional spaces is rapidly emerging as core functionality in next-generation database systems for numerous data-intensive services -- from embedding lookups in large language models (LLMs), to semantic information retrieval and recommendation engines. Current benchmarks, however, evaluate VSS primarily on the recall-latency trade-off against a ground truth defined solely by distance metrics, neglecting how retrieval quality ultimately impacts downstream tasks. This disconnect can mislead both academic research and industrial practice. We present Iceberg, a holistic benchmark suite for end-to-end evaluation of VSS methods in realistic application contexts. From a task-centric view, Iceberg uncovers the Information Loss Funnel, which identifies three principal sources of end-to-end performance degradation: (1) Embedding Loss during feature extraction; (2) Metric Misuse, where distances poorly reflect task relevance; (3) Data Distribution Sensitivity, highlighting index robustness across skews and modalities. For a more comprehensive assessment, Iceberg spans eight diverse datasets across key domains such as image classification, face recognition, text retrieval, and recommendation systems. Each dataset, ranging from 1M to 100M vectors, includes rich, task-specific labels and evaluation metrics, enabling assessment of retrieval algorithms within the full application pipeline rather than in isolation. Iceberg benchmarks 13 state-of-the-art VSS methods and re-ranks them based on application-level metrics, revealing substantial deviations from traditional rankings derived purely from recall-latency evaluations. Building on these insights, we define a set of task-centric meta-features and derive an interpretable decision tree to guide practitioners in selecting and tuning VSS methods for their specific workloads.

顶级标签: systems data benchmark
详细标签: vector similarity search evaluation framework information retrieval embedding quality task-centric evaluation 或 搜索:

从任务中心视角揭示向量相似性搜索的隐藏陷阱并导航其下一代发展 / Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views


1️⃣ 一句话总结

这篇论文提出了一个名为Iceberg的综合性评测框架,它从实际应用任务的角度出发,揭示了传统向量搜索评测的局限,并帮助开发者为特定任务选择最合适的搜索方法。


源自 arXiv: 2512.12980