菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-25
📄 Abstract - IPBC: An Interactive Projection-Based Framework for Human-in-the-Loop Semi-Supervised Clustering of High-Dimensional Data

High-dimensional datasets are increasingly common across scientific and industrial domains, yet they remain difficult to cluster effectively due to the diminishing usefulness of distance metrics and the tendency of clusters to collapse or overlap when projected into lower dimensions. Traditional dimensionality reduction techniques generate static 2D or 3D embeddings that provide limited interpretability and do not offer a mechanism to leverage the analyst's intuition during exploration. To address this gap, we propose Interactive Project-Based Clustering (IPBC), a framework that reframes clustering as an iterative human-guided visual analysis process. IPBC integrates a nonlinear projection module with a feedback loop that allows users to modify the embedding by adjusting viewing angles and supplying simple constraints such as must-link or cannot-link relationships. These constraints reshape the objective of the projection model, gradually pulling semantically related points closer together and pushing unrelated points further apart. As the projection becomes more structured and expressive through user interaction, a conventional clustering algorithm operating on the optimized 2D layout can more reliably identify distinct groups. An additional explainability component then maps each discovered cluster back to the original feature space, producing interpretable rules or feature rankings that highlight what distinguishes each cluster. Experiments on various benchmark datasets show that only a small number of interactive refinement steps can substantially improve cluster quality. Overall, IPBC turns clustering into a collaborative discovery process in which machine representation and human insight reinforce one another.

顶级标签: systems model training data
详细标签: interactive clustering human-in-the-loop dimensionality reduction visual analytics semi-supervised learning 或 搜索:

IPBC:一种基于交互式投影的人机协同半监督高维数据聚类框架 / IPBC: An Interactive Projection-Based Framework for Human-in-the-Loop Semi-Supervised Clustering of High-Dimensional Data


1️⃣ 一句话总结

这篇论文提出了一种名为IPBC的交互式框架,通过让用户在二维投影图上调整视角和添加简单约束(如必须关联或不能关联),来引导机器学习模型优化数据布局,从而将高维数据的聚类过程转变为人机协作的探索任务,最终在少量人工干预下显著提升聚类效果并生成可解释的结果。

源自 arXiv: 2601.18828