计算机科学
有效载荷(计算)
信息隐藏
语义学(计算机科学)
隐写术
嵌入
自然语言处理
Unicode码
人工智能
语义相似性
编码(社会科学)
语音识别
理论计算机科学
程序设计语言
数学
计算机网络
统计
网络数据包
作者
Tianyu Yang,Hanzhou Wu,Biao Yi,Guorui Feng,Xinpeng Zhang
标识
DOI:10.1109/tdsc.2023.3247493
摘要
Linguistic steganography (LS) aims to embed secret information into a highly encoded text for covert communication. It can be roughly divided to two main categories, i.e., modification based LS (MLS) and generation based LS (GLS). MLS embeds secret data by slightly modifying a given text without impairing the meaning of the text, whereas GLS uses a well trained language model to directly generate a text carrying secret data. A common disadvantage for MLS methods is that the embedding payload is very small, whose return is well preserving the semantic quality of the text. In contrast, GLS enables the data hider to embed a large payload, which has to pay the high price of uncontrollable semantics. In this article, we propose a novel LS method to modify a given text by pivoting it between two different languages and embed secret data using a semantic-aware information encoding strategy. Our purpose is to alter the expression of the given text, enabling a large payload to be embedded while keeping the semantic information unchanged. Experiments have shown that the proposed work not only achieves a large embedding payload, but also shows superior performance in maintaining the semantic consistency and resisting linguistic steganalysis.
科研通智能强力驱动
Strongly Powered by AbleSci AI