ProTegO: Protect Text Content against OCR Extraction Attack

计算机科学 光学字符识别 情报检索 对抗制 文本识别 阅读(过程) 人工智能 图像(数学) 法学 政治学
作者
Yanru He,Kejiang Chen,Guoqiang Chen,Zehua Ma,Kui Zhang,Jie Zhang,Huanyu Bian,Han Fang,Weiming Zhang,Nenghai Yu
标识
DOI:10.1145/3581783.3612076
摘要

Online documents greatly improve the efficiency of information interaction but also cause potential security hazards, such as the ability to copy and reuse text content without authorization readily. To address copyright concerns, recent works have proposed converting reproducible text content into non-reproducible formats, making digital text content observable but not duplicable. However, as the Optical Character Recognition (OCR) technology develops, adversaries can still take screenshots of the target text region and use OCR to extract the text content. None of the existing methods can be well adapted to this kind of OCR extraction attack. In this paper, we propose "ProTegO'', a novel text content protection method against the OCR extraction attack, which generates adversarial underpaintings that do not affect human reading but can interfere with OCR after taking screenshots. Specifically, we design a text-style universal adversarial underpaintings generation framework, which can mislead both text recognition models and commercial OCR services. For invisibility, we take full advantage of the fusion property of human eyes and create complementary underpaintings to display alternatively on the screen. Experimental results demonstrate that ProTegO is a one-size-fits-all method that can ensure good visual quality while simultaneously achieving a high protection success rate on text recognition models with different architectures, outperforming the state-of-the-art methods. Furthermore, we validate the feasibility of ProTegO on a wide range of popular commercial OCR services, including Microsoft, Tencent, Alibaba, Huawei, Baidu, Apple, and Xiaomi. Codes will be available at https://github.com/Ruby-He/ProTegO.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
所所应助qizhixu采纳,获得10
3秒前
4秒前
YY完成签到,获得积分10
5秒前
webweb发布了新的文献求助10
5秒前
ping777755完成签到,获得积分10
5秒前
6秒前
6秒前
7秒前
7秒前
Buduan完成签到,获得积分10
8秒前
不会搞科研完成签到,获得积分10
9秒前
陈晨发布了新的文献求助10
9秒前
9秒前
小酒的lyj完成签到,获得积分10
10秒前
11秒前
11秒前
林一发布了新的文献求助10
12秒前
不安青牛应助老木虫采纳,获得10
12秒前
断罪残影完成签到 ,获得积分10
13秒前
13秒前
平常山河发布了新的文献求助10
13秒前
13秒前
14秒前
伊莱恩关注了科研通微信公众号
14秒前
个性的紫菜应助里新采纳,获得30
14秒前
bkagyin应助Hohowinnie采纳,获得10
15秒前
xx发布了新的文献求助10
16秒前
阿宝发布了新的文献求助10
16秒前
奇奇完成签到,获得积分10
16秒前
情怀应助haohao342采纳,获得10
17秒前
。墨殇发布了新的文献求助10
17秒前
和尚哥完成签到,获得积分10
17秒前
Du发布了新的文献求助10
18秒前
11完成签到,获得积分10
19秒前
wuyf发布了新的文献求助10
19秒前
Man发布了新的文献求助10
19秒前
高兴的老黑完成签到,获得积分10
20秒前
dxz完成签到,获得积分10
20秒前
张烤明完成签到,获得积分10
20秒前
高分求助中
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Herman Melville: A Biography (Volume 1, 1819-1851) 600
Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600
The Illustrated History of Gymnastics 500
Division and square root. Digit-recurrence algorithms and implementations 500
Hemerologies of Assyrian and Babylonian Scholars 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2496579
求助须知:如何正确求助?哪些是违规求助? 2153205
关于积分的说明 5503719
捐赠科研通 1874029
什么是DOI,文献DOI怎么找? 931969
版权声明 563605
科研通“疑难数据库(出版商)”最低求助积分说明 498116