计算机科学
人工智能
模式识别(心理学)
特征(语言学)
模态(人机交互)
字错误率
任务(项目管理)
心率变异性
传感器融合
面部表情
光容积图
编码
语音识别
计算机视觉
心率
工程类
医学
哲学
语言学
生物化学
化学
系统工程
滤波器(信号处理)
血压
基因
放射科
作者
Shuai Ding,Zhen Ke,Zijie Yue,Cheng Song,Lu Lu
标识
DOI:10.1109/tim.2022.3209750
摘要
Heart rate (HR), heart rate variability (HRV), and respiratory rate (RR) are vital physiological signals that can reveal the physiological and psychological states of human beings. Recent research studies have demonstrated that these physiological signals can be estimated in a non-contact way from visible or infrared facial videos based on remote Photoplethysmography (rPPG). However, most existing methods only use only one data modality and output a single type of physiological signal at a time. To overcome these restrictions, we propose a multi-task framework named the SMP-Net based on multimodal fusion to realize the non-contact multi-physiological signals estimation. First, a joint attention feature fusion (JAFF) module is designed to encode and fuse features from visible and infrared videos. The JAFF module considers different modality-wise, spatial, and channel-wise information on features comprehensively. Then, a task-oriented feature refinement (TOFR) module is developed to extract task-oriented features by refining shared features to improve the estimation performance. Finally, the proposed SMP-Net is validated on the MMVS, VIPL-HR, and UBFC-rPPG datasets and outperforms state-of-the-art methods. Our proposed SMP-Net achieves 1.12 bpm mean absolute error (MAE) for HR estimation, 0.58 ρ for rPPG estimation, and 2.08 MAE for RR estimation on the MMVS dataset. On the VIPL-HR and UBFC-rPPG datasets, the MAE of HR estimation is 2.03 and 0.59 bpm, respectively. The proposed SMP-Net is of great significance for continuous non-contact estimation of multi-physiological signals.
科研通智能强力驱动
Strongly Powered by AbleSci AI