联营
计算机科学
人工智能
卷积神经网络
模式识别(心理学)
特征提取
小波
小波变换
深度学习
特征(语言学)
离散小波变换
像素
计算机视觉
语言学
哲学
作者
Said E. El‐Khamy,Ahmad Al-Kabbany,Shimaa El-Bana
标识
DOI:10.1109/itc-egypt52936.2021.9513885
摘要
Aerial scene classification using multi-label remote sensing (MLRS) is a remote sensing challenge task. Conventional techniques in this research area have mainly focused either on the simplified single-label case or on pixel-based approaches, which cannot efficiently handle high-resolution images. Deep learning (DL) and convolutional neural networks (CNNs) have defined the state-of-the-art in many vision problems in recent years. CNNs often adopt pooling layers to enlarge the receptive field, which can lower computational complexity. On the other hand, Conventional pooling methods can result in data loss, degrading subsequent operations such as feature extraction, image retrieval, and scene analysis. Inspired by this drawback, we propose a new CNN model by investigating the impact of discrete wavelet transform pooling (DWTPL) on the performance of this model. Wavelet pooling allows us to utilize spectral information, which is crucial in multi-label remote sensing tasks. We show consistent improvements in precision and F1-score on a widely adopted AID dataset compared to other models from the recent literature.
科研通智能强力驱动
Strongly Powered by AbleSci AI