Exploiting Task-Level Vulnerabilities: An Automatic Jailbreak Attack and Defense Benchmarking for LLMs

Lan Zhang

34th USENIX Security Symposium (USENIX Security '25) · Day 2 · LLM Security 2: Jailbreaking and Prompt Stealing