菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-16
📄 Abstract - PCodeTrans: Translate Decompiled Pseudocode to Compilable and Executable Equivalent

Decompilation is foundational to binary analysis, yet conventional tools prioritize human readability over strict recompilability and verifiable runtime correctness. While recent LLM-based approaches attempt to refine decompiled pseudocode, they typically either optimize solely for readability or rely on static analysis for evaluation. This makes them prone to "semantic hallucinations" that compromise accuracy and fail to resolve actual runtime failures. For critical tasks like software modernization and vulnerability remediation, recovered code must not only compile but replicate the original binary's behavior. We present PCodeTrans, a feedback-driven framework that bridges the gap between decompilation, recompilation, and rigorous function-level dynamic validation. After extracting a minimal yet coherent context to guarantee recompilability, PCodeTrans employs an in situ substitutable engine to hot-swap the compiled function directly into the unmodified binary, natively preserving its authentic execution context and global dependencies. Guided by fine-grained differential tracing, PCodeTrans generates precise runtime feedback to iteratively guide an LLM in repairing semantic discrepancies. Evaluated on Coreutils and Binutils, PCodeTrans achieves unprecedented recovery performance when rectifying raw Hex-Rays outputs, attaining 100% function-level compilability on unstripped binaries alongside 99.55% and 99.89% test-validated behavioral consistency, respectively. In doing so, it resolves 76.56% and 79.74% of logic errors exposed by official test suites. Exhibiting exceptional resilience, PCodeTrans maintains over 96% behavioral consistency even on fully stripped binaries. By significantly outperforming all existing baselines, PCodeTrans paves a practical path to reliably translate decompiled pseudocode into compilable and executable equivalents.

顶级标签: llm systems model evaluation
详细标签: decompilation binary analysis code translation runtime validation feedback-driven 或 搜索:

PCodeTrans:将反编译得到的伪代码转换为可编译和可执行的等价代码 / PCodeTrans: Translate Decompiled Pseudocode to Compilable and Executable Equivalent


1️⃣ 一句话总结

这篇论文提出了一个名为PCodeTrans的智能框架,它能够自动修复反编译工具产生的伪代码,不仅确保修复后的代码可以成功编译,还能在运行时精确地复制原始程序的行为,从而为软件更新和安全漏洞修复等关键任务提供了可靠的技术支持。

源自 arXiv: 2603.14855