计算机科学
代表(政治)
人工智能
光学(聚焦)
观点
特征学习
模式识别(心理学)
可视化
特征(语言学)
领域(数学)
数学
艺术
语言学
哲学
物理
光学
政治
政治学
纯数学
法学
视觉艺术
作者
Hwanjong Song,Zhen Wang,Yi Lei,Dianxi Shi,Xiaochong Tong,Lei Yuan,Chunping Qiu
标识
DOI:10.1109/lgrs.2023.3326005
摘要
Cross-view geo-location is a crucial research field that determines the geographic location from images taken from different viewpoints. It is often studied as a retrieval task, where the query images are with unknown locations, and the database includes images with geo-tags from a different platform. Learning image representations by neural networks is an important step, and one typical training method is using a classification loss, where cross-view images of the same locations are considered the same category. However, existing methods only focus on pushing the representation distances of different categories while ignoring the intra-category representation distances of samples from different platforms. Considering that controlling the intra-category distance can help to guide the model to extract compact category-sharing representations from cross-view images, we propose a categorized cluster loss to learn separate and compact representation clusters. Categorized cluster loss can supervise the network to learn invariant information from samples of different platforms by constraining both the inter-category and intra-category feature distances. Meanwhile, we design a category-view-stratified sampling strategy, which samples balanced inputs in terms of both category and view in each batch during the learning process. We implemented our approach with a lightweight OSNet-based network and achieved higher accuracy with fewer parameters on a typical and challenging cross-view geo-location dataset than most state-of-the-art (SOTA) methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI