菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-21
📄 Abstract - From CVE to CWE: Syscall-Based HIDS Generalisation

Host intrusion detection systems (HIDS) based on system-call traces are typically trained and evaluated against individual Common Vulnerabilities and Exposures (CVE) instances. In operational settings, however, defenders need to recognise new exploits of an already known type of weakness. We empirically examine whether a one-class anomaly detector trained on the normal behaviour of a set of CVEs that share a Common Weakness Enumeration (CWE) class generalises to a different, unseen CVE inside the same class. Using six scenarios drawn from LID-DS-2021 and grouped into three CWE families (CWE-307 broken authentication, CWE-89 SQL injection, CWE-434 unrestricted file upload), we extract a 66-dimensional Peng-Guo-style feature vector per sliding window and train Isolation Forest and SGD One-Class SVM detectors with normal-only thresholds calibrated to fixed target false positive rates. We define and answer four research questions covering self-detection, asymmetric cross-CVE transfer, the value of a combined CWE-level normal profile, and the effect of feature filtering on transferability. The combined CWE-307 detector reaches F1 = 0.6976 at calibration target FPR = 0.05 (precision = 0.8994, recall = 0.5698), whereas CWE-89 and CWE-434 collapse to F1 <= 0.21 under the same protocol. Cross-CVE transfer turns out to be strongly direction-dependent and dominated by the breadth of the source normal profile rather than by the CWE label. We conclude that CWE-level generalisation in HIDS is empirically attainable for some but not all weakness families with current syscall features, and we argue that calibrated FPR is a methodological prerequisite for honest reporting in this setting.

顶级标签: machine learning systems
详细标签: anomaly detection host intrusion detection cve cwe system calls 或 搜索:

从CVE到CWE:基于系统调用的主机入侵检测系统泛化能力研究 / From CVE to CWE: Syscall-Based HIDS Generalisation


1️⃣ 一句话总结

本研究通过实验发现,基于系统调用轨迹的主机入侵检测系统(HIDS)在某些漏洞类别(如认证绕过)中能够从已知漏洞实例泛化到同类别未知漏洞,但在其他类别(如SQL注入)中效果不佳,提示仅靠当前系统调用特征无法在所有漏洞类型上实现可靠的跨实例泛化。

源自 arXiv: 2606.22581