菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-11
📄 Abstract - A Universal Nearest-Neighbor Estimator for Intrinsic Dimensionality

Estimating the intrinsic dimensionality (ID) of data is a fundamental problem in machine learning and computer vision, providing insight into the true degrees of freedom underlying high-dimensional observations. Existing methods often rely on geometric or distributional assumptions and can significantly fail when these assumptions are violated. In this paper, we introduce a novel ID estimator based on nearest-neighbor distance ratios that involves simple calculations and achieves state-of-the-art results. Most importantly, we provide a theoretical analysis proving that our estimator is \emph{universal}, namely, it converges to the true ID independently of the distribution generating the data. We present experimental results on benchmark manifolds and real-world datasets to demonstrate the performance of our estimator.

顶级标签: machine learning theory model evaluation
详细标签: intrinsic dimensionality nearest neighbor dimensionality estimation nonparametric estimation convergence analysis 或 搜索:

一种通用的内在维度最近邻估计器 / A Universal Nearest-Neighbor Estimator for Intrinsic Dimensionality


1️⃣ 一句话总结

这篇论文提出了一种基于最近邻距离比率的全新方法,能够简单高效且无需依赖数据分布假设地准确估计出高维数据的真实内在维度。

源自 arXiv: 2603.10493