菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-21
📄 Abstract - AblateCell: A Reproduce-then-Ablate Agent for Virtual Cell Repositories

Systematic ablations are essential to attribute performance gains in AI Virtual Cells, yet they are rarely performed because biological repositories are under-standardized and tightly coupled to domain-specific data and formats. While recent coding agents can translate ideas into implementations, they typically stop at producing code and lack a verifier that can reproduce strong baselines and rigorously test which components truly matter. We introduce AblateCell, a reproduce-then-ablate agent for virtual cell repositories that closes this verification gap. AblateCell first reproduces reported baselines end-to-end by auto-configuring environments, resolving dependency and data issues, and rerunning official evaluations while emitting verifiable artifacts. It then conducts closed-loop ablation by generating a graph of isolated repository mutations and adaptively selecting experiments under a reward that trades off performance impact and execution cost. Evaluated on three single-cell perturbation prediction repositories (CPA, GEARS, BioLORD), AblateCell achieves 88.9% (+29.9% to human expert) end-to-end workflow success and 93.3% (+53.3% to heuristic) accuracy in recovering ground-truth critical components. These results enable scalable, repository-grounded verification and attribution directly on biological codebases.

顶级标签: agents machine learning biology
详细标签: virtual cell reproducibility ablation study code agent benchmark 或 搜索:

AblateCell:面向虚拟细胞仓库的“先复现再消融”智能体 / AblateCell: A Reproduce-then-Ablate Agent for Virtual Cell Repositories


1️⃣ 一句话总结

本文提出了一个名为AblateCell的AI智能体,它能自动复现生物模型仓库中的结果,然后通过系统性地移除(消融)模型的不同组件,来精确判断哪些部分真正贡献了性能提升,从而帮助研究人员在缺乏统一标准的生物代码库中可靠地验证和归因模型效果。

源自 arXiv: 2604.19606