Optimized layerwise approximation for efficient private inference on fully homomorphic encryption

Lee, Junghyun; Lee, Joon-Woo; Lee, Eunsang; Kim, Young-Sik; Lee, Yongwoo; Kim, Yongjune; No, Jong-Seon

doi:10.1016/j.neucom.2026.134016

상세 보기

Optimized layerwise approximation for efficient private inference on fully homomorphic encryption

Lee, Junghyun;
Lee, Joon-Woo;
Lee, Eunsang;
Kim, Young-Sik;
Lee, Yongwoo;
외 2명

Citations

SCOPUS

0

초록

Recent studies have explored the deployment of privacy-preserving deep neural networks utilizing homomorphic encryption (HE), especially for private inference (PI). Many works have attempted the approximation-aware training (AAT) approach in PI, changing the activation functions of a model to low-degree polynomials that are easier to compute on HE by allowing model retraining. However, due to constraints in the training environment, it is often necessary to consider post-training approximation (PTA), using the pre-trained parameters of the existing plaintext model without retraining. Existing PTA studies have uniformly approximated the activation function in all layers to a high degree to mitigate accuracy loss from approximation, leading to significant time consumption. This study proposes an optimized layerwise approximation (OLA), a systematic framework that optimizes both accuracy loss and time consumption by using different approximation polynomials for each layer in the PTA scenario. For efficient approximation, we reflect the layerwise impact on the classification accuracy by considering the actual input distribution of each activation function while constructing the optimization problem. Additionally, we provide a dynamic programming technique to solve the optimization problem and achieve the optimized layerwise degrees in polynomial time. As a result, we successfully approximated the ReLU and GELU functions, significantly reducing time latency while maintaining classification performance. Especially, the OLA method reduces inference times for the ResNet-20 model and the ResNet-32 model by 3.02 times and 2.82 times, respectively, compared to prior state-of-the-art implementations employing uniform degree polynomials. © 2026 Elsevier B.V.

키워드

Cloud computing; Dynamic programming; Homomorphic encryption; Private inference; RNS-CKKS; Weighted least squares

제목: Optimized layerwise approximation for efficient private inference on fully homomorphic encryption

저자: Lee, Junghyun; Lee, Joon-Woo; Lee, Eunsang; Kim, Young-Sik; Lee, Yongwoo; Kim, Yongjune; No, Jong-Seon

DOI: 10.1016/j.neucom.2026.134016

발행일: 2026-09

유형: Article

저널명: Neurocomputing

권: 695