地理定位
计算机科学
人工智能
卷积神经网络
特征提取
模式识别(心理学)
图形
人工神经网络
数据挖掘
理论计算机科学
万维网
作者
Yu Zhang,Shuhui Bu,Boni Hu,Pengcheng Han,Leihua Weng,S. Xue
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:61: 1-13
标识
DOI:10.1109/tgrs.2023.3293832
摘要
Large-scale visual geolocation is a meaningful task that involves locating a query image by comparing it with images in a database and predicting the most similar image. However, the widely-used training framework based on contrastive learning cannot fully utilize all data and is difficult to adapt to larger scales. At the same time, the traditional Convolutional Neural Networks (CNNs) and Vector of Locally Aggregated Descriptors (VLAD) using aggregated features cannot fully reflect the relationship between the local features of the image. Therefore, a Graph Neural Network (GNN) is designed as the feature extraction network, and then a training framework based on image classification is constructed. Specifically, a data grouping strategy and special loss function are designed for better training results. After training, we adopt an image retrieval strategy based on kNN for position. In addition, considering that existing datasets cannot be adapted to our requirements, two datasets are constructed for experiments, that one contains large-scale satellite images and the other fuses satellite and Unmanned Aerial Vehicle (UAV) images. Results demonstrate that our method outperforms other common methods on both the two datasets. The results demonstrate the effectiveness of our approach for UAV visual geolocation and provide ideas for future research in this field.
科研通智能强力驱动
Strongly Powered by AbleSci AI