凝视
计算机科学
杠杆(统计)
人工智能
一般化
推论
计算机视觉
模式识别(心理学)
回归
机器学习
数学
统计
数学分析
作者
Xinming Wang,Hanlin Zhang,Zhiyong Wang,Wei Nie,Zhihao Yang,Weihong Ren,Qiong Xu,Xiu Xu,Honghai Liu
标识
DOI:10.1109/tcyb.2023.3244269
摘要
Gaze is a vital feature in analyzing natural human behavior and social interaction. Existing gaze target detection studies learn gaze from gaze orientations and scene cues via a neural network to model gaze in unconstrained scenes. Though achieve decent accuracy, these studies either employ complex model architectures or leverage additional depth information, which limits the model application. This article proposes a simple and effective gaze target detection model that employs dual regression to improve detection accuracy while maintaining low model complexity. Specifically, in the training phase, the model parameters are optimized under the supervision of coordinate labels and corresponding Gaussian-smoothed heatmap labels. In the inference phase, the model outputs the gaze target in the form of coordinates as prediction rather than heatmaps. Extensive experimental results on within-dataset and cross-dataset evaluations on public datasets and clinical data of autism screening demonstrate that our model has high accuracy and inference speed with solid generalization capabilities.
科研通智能强力驱动
Strongly Powered by AbleSci AI