计算机科学
帧(网络)
特征(语言学)
对数
人工智能
领域(数学分析)
点(几何)
计算机视觉
分解
理论(学习稳定性)
图像(数学)
模式识别(心理学)
机器学习
数学
数学分析
电信
哲学
语言学
生态学
几何学
生物
作者
Chenfeng Zhu,Bolin Zhang,Qilin Yin,Changchun Yin,Wei Lu
标识
DOI:10.1016/j.patcog.2023.110077
摘要
Due to the remarkable progress in face manipulation technology, malicious applications of these technologies may pose a great threat to the social stability. Therefore, it is essential to carry out the research of deepfake detection. In this paper, we assumed that the illumination on frames that skip a certain space is basically consistent in real videos, but tends to be inconsistent in fake videos. From this point, a network which contains a learnable Image Decomposition Module (IDM) and multi-level feature enhancement is proposed. IDM decomposes frames into illumination and reflection, and frame recomposition is followed to highlight the frame-level illumination inconsistency. Multi-level feature enhancement is proposed to enhance the illumination inconsistency at feature level. In addition, considering the computational complexity and human vision perception mechanism, we train the network in logarithm domain. Experimental results show that the proposed method is effective and superior compared with other state-of-the-art deepfake detection methods on mainstream deepfake datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI