菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-24
📄 Abstract - Quality-Driven Selective Mutation for Deep Learning

Mutants support testing and debugging in two roles: (i) as test goals and (ii) as substitutes for real faults. Hard-to-kill mutants provide better guidance for test improvement, while realism is essential when mutants are used to simulate real bugs. Building on these roles, selective mutation for deep learning (DL) aims to reduce the cost of mutant generation and execution by choosing operator configurations that yield resistant and realistic mutants. However, the DL literature lacks a unified measure that captures both aspects. This study presents a probabilistic framework to quantify mutant quality along two complementary axes: resistance and realism. Resistance adapts the classical notion of hard-to-kill mutants to the DL setting using statistical killing probabilities, while realism is measured via the generalized Jaccard similarity between mutant and real-fault detectability patterns. The framework enables ranking and filtering of low-quality mutation-operator configurations without assuming a specific use case. We empirically evaluate the approach on four datasets of real DL faults. Three datasets (CleanML, DeepFD, and DeepLocalize) are used to estimate and select high-quality operator configurations, and the held-out defect4ML dataset is used for validation. Results show that quality-driven selection reduces the number of generated mutants by up to 55.6% while preserving typical levels of resistance and realism under baseline-aligned selection thresholds. These findings confirm that dual-objective selection can lower cost without compromising the usefulness of mutants for either role.

顶级标签: machine learning model evaluation
详细标签: deep learning mutation testing selective mutation fault detection resistance 或 搜索:

面向深度学习的质量驱动选择性变异 / Quality-Driven Selective Mutation for Deep Learning


1️⃣ 一句话总结

本文提出一种基于抗杀性和真实性的双重指标,通过概率框架筛选高质量变异算子配置,从而在减少深度学习模型变异生成成本的同时,保持变异体用于测试优化和模拟真实缺陷的有效性。

源自 arXiv: 2604.22640