卷积神经网络
计算机科学
特征提取
分散注意力
变压器
人工智能
深度学习
特征学习
实时计算
机器学习
模式识别(心理学)
工程类
电压
生物
电气工程
神经科学
作者
Haitao Wang,Jie Chen,Zhixiang Huang,Bing Li,Jianming Lv,JingMin Xi,Bocai Wu,Jun Zhang,Zhen Wu
出处
期刊:IEEE Transactions on Intelligent Transportation Systems
[Institute of Electrical and Electronics Engineers]
日期:2022-01-01
卷期号:: 1-15
标识
DOI:10.1109/tits.2022.3219676
摘要
According to the surveys of the World Health Organization, distracted driving is one of main causes of road traffic accidents. To improve road traffic safety, real-time detection of drivers’ driving behavior is very important for the development of highly reliable Advanced Driver Assistance System (ADAS). At present, the deep learning architecture based on a Convolutional Neural Network (CNN) has disadvantages such as large number of parameters and weak global feature extraction ability. Therefore, this paper proposes an innovative driver distraction detection model based on the fusion of a transformer and a CNN, referred to as FPT, which is the first exploration in the field of driver distraction detection. First, we introduce the latest Twins transformer as a benchmark. Then, we design residual embedding to replace block embedding, which can further integrate the convolutional neural network with Transformer and improve the feature extraction ability. In addition, the Multilayer Perceptron (MLP) module with a large parameter occupancy rate in the original transformer structure is replaced with a lightweight group convolution module to reduce computational complexity. Finally, a cross-entropy loss function for label smoothing is designed to guide network learning with significantly differentiated features. Comparison results on two large-scale driver distraction detection datasets show that the proposed FPT offers a better compromise between computational cost and performance compared to the state-of-the-art CNN and Transformer architectures.
科研通智能强力驱动
Strongly Powered by AbleSci AI