计算机科学
人工智能
分割
计算机视觉
背景(考古学)
航空影像
目标检测
深度学习
航空影像
杂乱
解析
图像分割
聚类分析
模式识别(心理学)
图像(数学)
雷达
古生物学
生物
电信
作者
Prateek Garg,Anirudh Chakravarthy,Murari Mandal,Pratik Narang,Vinay Chamola,Mohsen Guizani
摘要
Aerial scenes captured by UAVs have immense potential in IoT applications related to urban surveillance, road and building segmentation, land cover classification, and so on, which are necessary for the evolution of smart cities. The advancements in deep learning have greatly enhanced visual understanding, but the domain of aerial vision remains largely unexplored. Aerial images pose many unique challenges for performing proper scene parsing such as high-resolution data, small-scaled objects, a large number of objects in the camera view, dense clustering of objects, background clutter, and so on, which greatly hinder the performance of the existing deep learning methods. In this work, we propose ISDNet (Instance Segmentation and Detection Network), a novel network to perform instance segmentation and object detection on visual data captured by UAVs. This work enables aerial image analytics for various needs in a smart city. In particular, we use dilated convolutions to generate improved spatial context, leading to better discrimination between foreground and background features. The proposed network efficiently reuses the segment-mask features by propagating them from early stages using residual connections. Furthermore, ISDNet makes use of effective anchors to accommodate varying object scales and sizes. The proposed method obtains state-of-the-art results in the aerial context.
科研通智能强力驱动
Strongly Powered by AbleSci AI