凝视
人工智能
计算机科学
计算机视觉
稳健性(进化)
融合机制
眼动
特征(语言学)
模式识别(心理学)
特征提取
融合
生物化学
脂质双层融合
基因
哲学
语言学
化学
作者
Zhangfang Hu,Yanling Xia,Yuan Luo,Lan Wang
摘要
The variable head pose and low-quality eye images in natural scenes can lead to low accuracy of gaze estimation. In this paper, we propose a multi-feature fusion gaze estimation model based on the attention mechanism. First, face and eye feature extractors based on the group convolution channel and spatial attention mechanism (GCCSAM) are designed to use channel and spatial information to adaptively select and enhance important features in face images and two eye images, and suppress information irrelevant to gaze estimation. Then we design two feature fusion networks to fuse the features of face, two eyes and pupil center position, thus avoiding the effects of two-eye asymmetry and inaccurate head pose estimation on gaze estimation. The average angular error of the proposed method is 4.1° on MPIIGaze and 5.2° on EyeDiap. Compared with the current mainstream methods, our method effectively improves the accuracy and robustness of gaze estimation in natural scenes.
科研通智能强力驱动
Strongly Powered by AbleSci AI