菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-19
📄 Abstract - Does Code Cleanliness Affect Coding Agents? A Controlled Minimal-Pair Study

As autonomous coding agents see rapid adoption, their evaluation has primarily focused on task completion rates holding the target codebase fixed. This leaves a critical question unanswered: does the structural and stylistic quality, or ``cleanliness'' of the underlying code affect an agent's ability to navigate and modify it? To isolate the effect of code cleanliness from agent capability, we introduce an evaluation protocol built around minimal pairs: repositories that match on architecture, dependencies, and external behaviour, but differ on static-analysis rule violations and cognitive complexity. The pairs are constructed in both directions, by agent pipelines that either degrade a clean repository or clean a messy one. We author 33 tasks across six such pairs, evaluated through hidden tests at the application's public surface. Across 660 trials with Claude Code, code cleanliness does not change the agent's pass rate. However, it substantially alters the agent's operational footprint: agents working on cleaner code use 7 to 8% fewer tokens and reduce file revisitations by 34%. Our findings suggest that traditional maintainability principles remain highly relevant in the era of AI-driven development, shaping the computational cost and navigational efficiency of coding agents. Code cleanliness joins model choice, harness, and prompting as a factor that materially affects agent behaviours.

顶级标签: agents llm software engineering
详细标签: coding agent code cleanliness evaluation minimal pairs agent behavior 或 搜索:

代码整洁度会影响编码智能体吗?一项基于最小配对的控制性研究 / Does Code Cleanliness Affect Coding Agents? A Controlled Minimal-Pair Study


1️⃣ 一句话总结

这项研究通过构建功能相同但整洁度不同的代码仓库,测试了AI编码智能体在其中的表现,发现代码整洁度虽然不改变任务完成成功率,但能显著降低计算成本和提升导航效率,因此代码整洁性在AI时代依然重要。

源自 arXiv: 2605.20049