在数据集中寻找多种解释 / Finding Multiple Interpretations in Datasets
1️⃣ 一句话总结
本文提出一种方法,能够在保持模型性能(如准确率)相近的前提下,自动找到多个在内在特征(如基因表达模式)上截然不同的模型,从而帮助研究者从不同角度理解数据背后的现象,避免单一解释的偏见。
In this paper, we propose an approach to finding sets of similar-performing models (in terms of loss/accuracy measurements) with highly different context-aware characteristics. Through experiments on the METABRIC dataset, we show that the proposed method finds multiple models with highly different gene expressions than those found by the control methodology without performance penalties. We argue that the proposed methodology is important whenever one aims to analyze any global characteristic of a model to extract insight into the underlying phenomenon being studied.
在数据集中寻找多种解释 / Finding Multiple Interpretations in Datasets
本文提出一种方法,能够在保持模型性能(如准确率)相近的前提下,自动找到多个在内在特征(如基因表达模式)上截然不同的模型,从而帮助研究者从不同角度理解数据背后的现象,避免单一解释的偏见。
源自 arXiv: 2606.12277