Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity

Guang Yan, Yuhui Zhang, Zimu Guo, Lutan Zhao, Xiaojun Chen, Chen Wang

IEEE Symposium on Security and Privacy 2025 · Day 2 · Secure Data Processing II