PrismaDV:自动化任务感知的数据单元测试生成 / PrismaDV: Automated Task-Aware Data Unit Test Generation
1️⃣ 一句话总结
PrismaDV 是一个结合下游任务代码分析和数据分布特性的AI系统,能够自动生成与具体业务任务相关的数据单元测试,并通过一种名为 SIFTA 的自适应优化方法持续改进测试质量,从而比传统通用测试方法更有效地发现对实际应用有影响的数据错误。
Data is a central resource for modern enterprises, and data validation is essential for ensuring the reliability of downstream applications. However, existing automated data unit testing frameworks are largely task-agnostic: they validate datasets without considering the semantics and requirements of the code that consumes the data. We present PrismaDV, a compound AI system that analyzes downstream task code together with dataset profiles to identify data access patterns, infer implicit data assumptions, and generate task-aware executable data unit tests. To further adapt the data unit tests over time to specific datasets and downstream tasks, we propose "Selective Informative Feedback for Task Adaptation" (SIFTA), a prompt-optimization framework that leverages the scarce outcomes from the execution of data unit tests and downstream tasks. We evaluate PrismaDV on two new benchmarks spanning 60 tasks across five datasets, where it consistently outperforms both task-agnostic and task-aware baselines in generating unit tests that reflect the end-to-end impact of data errors. Furthermore, we show that with SIFTA, we can automatically learn prompts for PrismaDV's modules that outperform prompts written by hand or generated from a generic prompt optimizer. We publicly release our benchmarks and prototype implementation.
PrismaDV:自动化任务感知的数据单元测试生成 / PrismaDV: Automated Task-Aware Data Unit Test Generation
PrismaDV 是一个结合下游任务代码分析和数据分布特性的AI系统,能够自动生成与具体业务任务相关的数据单元测试,并通过一种名为 SIFTA 的自适应优化方法持续改进测试质量,从而比传统通用测试方法更有效地发现对实际应用有影响的数据错误。
源自 arXiv: 2604.21765