数字水印
计算机科学
水印
人工神经网络
加权
人工智能
许可
嵌入
深度学习
数据挖掘
计算机安全
机器学习
图像(数学)
放射科
医学
法学
政治学
作者
Ryota Namba,Jun Sakuma
标识
DOI:10.1145/3321705.3329808
摘要
Deep learning has been achieving top levels of performance in many tasks. However, since it is costly to train a deep learning model, neural network models must be treated as valuable intellectual properties. One concern arising from our current situation is that malicious users might redistribute proprietary models or provide prediction services using such models without permission. One promising solution to this problem is digital watermarking, which works by embedding a mechanism into the model so that the model owners can verify their ownership of the model externally. In this study, we present a novel attack method against such watermarks known as query modification and demonstrate that all currently existing watermarking methods are vulnerable to either query modification or other existing attack methods (such as model modification). To overcome these vulnerabilities, we then present a novel watermarking method that we have named exponential weighting and experimentally show that our watermarking method achieves high watermark verification performance even under malicious invalidation processing attempts by unauthorized service providers (such as model modification and query modification) without sacrificing the predictive performance of the neural network model itself.
科研通智能强力驱动
Strongly Powered by AbleSci AI