针对XGBoost模型与网络入侵数据集的机器遗忘技术 / Machine Unlearning for the XGBoost Model with Network Intrusion Datasets
1️⃣ 一句话总结
本文提出了一种名为XGBoost-Forget的方法,使XGBoost模型能够高效地忘记特定的训练数据点,而无需重新训练整个模型,并在网络入侵检测的表格数据集上验证了该方法在保持原有预测精度的同时大幅提升了遗忘速度。
Machine Unlearning (MU) has emerged as an important technique for removing specific data points from trained models without requiring full retraining. However, most existing MU research focuses on deep learning and image data, leaving a gap in the domain of network intrusion detection, which relies heavily on tabular data. This work introduces XGBoost-Forget, an unlearning approach for the XGBoost model, to address this gap. The approach is evaluated on two tabular Network Intrusion (NI) datasets, IoT-23 and GeNIS, using multiple metrics to assess model performance, unlearning efficiency, and forgetting quality. The results show that XGBoost-Forget maintains predictive performance close to the original model while providing significantly faster unlearning, demonstrating its potential for MU in tabular NI settings.
针对XGBoost模型与网络入侵数据集的机器遗忘技术 / Machine Unlearning for the XGBoost Model with Network Intrusion Datasets
本文提出了一种名为XGBoost-Forget的方法,使XGBoost模型能够高效地忘记特定的训练数据点,而无需重新训练整个模型,并在网络入侵检测的表格数据集上验证了该方法在保持原有预测精度的同时大幅提升了遗忘速度。
源自 arXiv: 2606.19220