基于大语言模型的软件工程问题解决:进展与前沿综合综述 / Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey
1️⃣ 一句话总结
这篇论文系统性地综述了如何利用大语言模型自动解决软件开发中的实际问题,涵盖了从数据构建、方法技术到实际应用与未来挑战的全面分析。
Issue resolution, a complex Software Engineering (SWE) task integral to real-world development, has emerged as a compelling challenge for artificial intelligence. The establishment of benchmarks like SWE-bench revealed this task as profoundly difficult for large language models, thereby significantly accelerating the evolution of autonomous coding agents. This paper presents a systematic survey of this emerging domain. We begin by examining data construction pipelines, covering automated collection and synthesis approaches. We then provide a comprehensive analysis of methodologies, spanning training-free frameworks with their modular components to training-based techniques, including supervised fine-tuning and reinforcement learning. Subsequently, we discuss critical analyses of data quality and agent behavior, alongside practical applications. Finally, we identify key challenges and outline promising directions for future research. An open-source repository is maintained at this https URL to serve as a dynamic resource in this field.
基于大语言模型的软件工程问题解决:进展与前沿综合综述 / Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey
这篇论文系统性地综述了如何利用大语言模型自动解决软件开发中的实际问题,涵盖了从数据构建、方法技术到实际应用与未来挑战的全面分析。
源自 arXiv: 2601.11655