计算机科学
变压器
人工智能
软件可移植性
嵌入
卷积神经网络
计算机视觉
卷积(计算机科学)
特征提取
特征(语言学)
人工神经网络
工程类
语言学
哲学
电压
电气工程
程序设计语言
作者
Zhang Guiqiang,Yi Wang,She X. Xing
摘要
Camera surveillance plays an important role in maintaining the stability and safety of the social and public environment, and there are further requirements for the role of camera surveillance in building a smart city. This paper proposes a convolutional neural network based on the combination of the convolution module and the Transformer module. The network is applied to the tracking of pedestrian targets in infrared surveillance cameras to fill the shortcomings of surveillance cameras in the night environment. In this paper, the local features of the convolution module and the global features of the Transformer are combined into a comprehensive feature map. The feature information is used to solve the problem of less target feature information in infrared images, and the advantages of codec network structure design are used to ensure effective target features. At the same time, considering the embedding and portability of the network model, this paper adopts the method of grouping shared convolution kernels and Transformer nested segmentation in the design of the convolution module and the Transformer module, so as to achieve the purpose of light weight. After several sets of control experiments, the network designed in this paper has a certain improvement in tracking speed and tracking performance, and effectively solves the problem that infrared weak and small targets are not easy to track.
科研通智能强力驱动
Strongly Powered by AbleSci AI