DeGPT: Optimizing Decompiler Output with LLM
Peiwei Hu
Network and Distributed System Security (NDSS) Symposium 2024 · Day 3 · Reverse Engineering
Reverse engineering stands as a cornerstone in cybersecurity, indispensable for critical tasks such as malware analysis, vulnerability discovery, and understanding closed-source software. At its heart, this discipline seeks to unravel the intricate program logic and algorithms embedded within target binaries. Decompilers, tools like Hex-Rays and Ghidra, are pivotal in this process, transforming low-level assembly code into more human-readable, high-level programming languages. Despite their widespread use and significant advancements, state-of-the-art decompilers grapple with a fundamental challenge: the profound loss of semantic information during compilation. Crucial details, including meaningful variable names and descriptive comments, are typically stripped away, leaving reverse engineers with outputs that are difficult to parse and comprehend.