计算机科学
面部识别系统
人工智能
网(多面体)
模式识别(心理学)
语音识别
数学
几何学
作者
Qinghui Zhang,Xiaofeng Wang,Mengya Zhang,Lihua Lu,Pengtao Lv
出处
期刊:Electronics
[Multidisciplinary Digital Publishing Institute]
日期:2024-03-01
卷期号:13 (5): 958-958
标识
DOI:10.3390/electronics13050958
摘要
Face recognition in general scenarios has been saturated in recent years, but there is still room to enhance model performance in extreme scenarios and fairness situations. Inspired by the successful application of Transformer and ConvNet in computer vision, we propose a FIN-Block, which gives a more flexible composition paradigm for building a novel pure convolution model and provides a foundation for constructing a new framework for general face recognition in both extreme scenarios and fairness situations. FIN-Block-A uses a combination of stacked large-size convolution kernels and parallel convolution branches to ensure a large spatial receptive field while improving the module’s deep feature embedding and extraction capabilities. FIN-Block-B takes advantage of stacked orthogonal convolution kernels and parallel branches to balance model size and performance. By applying FIN-Block with an adapted convolution kernel size in different stages, we built a reasonable and novel framework Face-Inception-Net, and the performance of the model is highly competitive with ConvNeXt and InceptionNeXt. The models were trained on CASIA-WebFace and MS-wo-RFW databases and evaluated on 14 mainstream benchmarks, including LFW, extreme scene, and fairness test sets. The proposed Face-Inception-Net achieved the highest average TAR@FAR0.001 of 95.9% in all used benchmarks, fully demonstrating effectiveness and generality in various scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI