计算机科学
人工神经网络
建筑
人工智能
实时计算
艺术
视觉艺术
作者
Ling Yi,Yu Wu,Amr Tolba,Tengfei Li,Shiqi Ren,Jinliang Ding
标识
DOI:10.1109/jiot.2024.3359662
摘要
Image recognition techniques have become the mainstream solution for indoor unmanned aerial vehicle (UAV) localization and navigation due to the absence of global positioning system. However, unlike autonomous vehicles that enjoy the dividends of the "big model" era, UAVs fail to deploy such models due to the hardware limitations. To this end, this paper proposes a compact all multilayer perceptron (MLP) deep neural network structure that offers a paradigm for theory-guided prompt structural compression of large-scale MLP models. First, we propose a gradient-based sensitivity analysis (GB-SA) method. Unlike existing SA methods, GB-SA obtains nodes' sensitivity indices with gradient information, openning the possibility of efficient SA for large models. We start by integrating GB-SA with MLP and then extend the mode to MLP-Mixer, which is a promising all-MLP deep neural network. By deeply combining GB-SA and MLP-Mixer, SA-MLP-Mixer emerges as a compact model without reducing the model precision. Finally, we evaluate the effectiveness of the proposed model on the benchmark. The experimental results show that SA-MLP-Mixer has an airborne level model scale and accurate localization capability.
科研通智能强力驱动
Strongly Powered by AbleSci AI