SrcMarker: Dual-Channel Source Code Watermarking via Scalable Code Transformations

Borui Yang, Wei Li, Liyao Xiang, Bo Li

IEEE Symposium on Security and Privacy 2024 · Day 3 · Continental Ballroom 4

In an era defined by the explosive growth of the developer community and the pervasive influence of large language models (LLMs) in code generation, the protection of intellectual property and the verification of code authorship have become paramount concerns. Borui Yang, along with his colleagues Wei Li, Liyao Xiang, and Bo Li, presented SrcMarker at IEEE S&P, a novel **dual-channel source code watermarking system** designed to address these pressing challenges. This talk introduces a sophisticated method for embedding invisible bit strings into source code snippets, serving as an undeniable indicator of ownership or as a means to differentiate between human-written and machine-generated code.

AI review

SrcMarker is a groundbreaking dual-channel source code watermarking system that tackles the critical problem of IP protection and code provenance in the age of LLMs. Its innovative 'Feature Approximation Module' cleverly bridges the discrete-continuous gap for neural network training, allowing for robust, end-to-end learning of semantic-preserving transformations. This is a significant technical leap with immediate, actionable impact.

Watch on YouTube