计算机科学
视频处理
硬件加速
边缘计算
数据处理
数据流图
延迟(音频)
加速度
实时计算
人工智能
GSM演进的增强数据速率
嵌入式系统
现场可编程门阵列
数据库
电信
物理
经典力学
作者
Ningzhou Li,Zheming Yang,Mingxuan Li,Wen Ji
标识
DOI:10.1109/smc53992.2023.10394353
摘要
In visual intelligent scenarios, large amounts of real-time video data are generated at the end. During the optimization process for different video tasks, frequent data copying between devices and hosts can be limited by data bandwidth, resulting in high system latency. We investigate computing bottlenecks in online video processing to reduce processing latency and improve efficiency. In this paper, we propose a joint video acceleration processing (JVAP) architecture for online edge systems. First, video-compressed streams are transmitted to the GPU for decoding and conversion of data content. Second, we design data pre-processing and post-processing modules to achieve specific functional operators and separately complete operator combinations and stitching. Different computing tasks can reuse the implemented operator library. Third, we modify the data interface of the inference task model to maintain the consistent flow of data in the GPU. We conduct experiments using videos of different qualities and model frameworks of varying scales. The results indicate that the proposed method enhances the average processing efficiency over 11% with respect to existing representative acceleration frameworks and extends the potential application of online intelligent inference algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI