Deep-learning generation of POI data with scene images

计算机科学 人工智能 分割 兴趣点 深度学习 感兴趣区域 情态动词 模式识别(心理学) 计算机视觉 化学 高分子化学
作者
Jinbao Zhang,Xiaojuan Liu,Weilin Liao,Xia Li
出处
期刊:Isprs Journal of Photogrammetry and Remote Sensing 卷期号:188: 201-219 被引量:10
标识
DOI:10.1016/j.isprsjprs.2022.04.004
摘要

Point of interest (POI) is essential to urban scene understanding and location-based services. However, most of the POI data sets are collected manually on the spot, which is time-consuming and laborious. In this study, we propose a deep learning-based three-stage framework to automatically generate POI data sets from scene images by integrating instance segmentation, scene text recognition (STR), and multimodal technology. Firstly, we utilize an instance segmentation model to extract the region of interest (ROI) that contains POI text information from the scene images. Secondly, a STR method is used to locate and identify the text lines from the ROI. Thirdly, we develop a novel visual-linguistic multi-task classification model (VLMC) to classify ROIs and text lines through fusing text and image information. It is the first deep learning-based framework that allows generating POI information with different attributes (such as title, address, and tag) from the text lines of scene images and updating with high-performance models in the three-stage technique. In the experiments, we employ multiple STR data sets and annotated street view images for model training. The result shows that the deep learning-based framework can generate POI records from scene images with high accuracy (F1-score = 52.62%). Moreover, we find that the multi-modal VLMC model integrating the linguistic and visual embeddings has a higher accuracy in POI-generation than single-modal methods. We further use a trained framework to generate POI from Baidu Street View (BSV) images and Tencent Street View (TSV) images in Shenzhen, China, and ultimately obtain a long-term POI data set during 2013 – 2020 with 2,699,895 street view images. Of 815,616 records in the generated POI data set in 2020, 70.94% are covered by the existing Baidu POI data set of Shenzhen in 2013. This confirms the validity of the newly generated POI data set. These results demonstrate that the proposed deep-learning POI-generation framework and dataset can provide new insights for geographic data updating and urban scene understanding for fast growing cities. To facilitate future research, an implementation is made available at https://github.com/KampauCheung/scene-image-poi-generation.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
sss发布了新的文献求助10
刚刚
binban128完成签到,获得积分10
刚刚
柏林寒冬应助Lion采纳,获得10
1秒前
Kkxx完成签到 ,获得积分10
2秒前
3秒前
3秒前
3秒前
米线完成签到,获得积分10
4秒前
哦哦哦完成签到,获得积分10
4秒前
火星上凌雪完成签到 ,获得积分10
4秒前
5秒前
6秒前
WWW发布了新的文献求助10
6秒前
zzz完成签到,获得积分10
6秒前
Moonpie应助wonder采纳,获得10
7秒前
8秒前
科研通AI2S应助高高碧玉采纳,获得10
8秒前
博士伦666发布了新的文献求助10
9秒前
Aline发布了新的文献求助10
9秒前
9秒前
yourenpkma123完成签到,获得积分10
10秒前
10秒前
10秒前
赘婿应助sss采纳,获得10
10秒前
四姑娘发布了新的文献求助10
11秒前
11秒前
ZhouZhoukkk完成签到,获得积分10
11秒前
蓝天发布了新的文献求助10
11秒前
66发布了新的文献求助10
12秒前
舜瞬应助SHIT采纳,获得10
12秒前
butterflycat完成签到,获得积分10
13秒前
Lion完成签到,获得积分20
13秒前
good慧发布了新的文献求助30
13秒前
傲娇蜻蜓完成签到,获得积分10
14秒前
15秒前
852应助mmm采纳,获得10
15秒前
17秒前
小羊打嗝发布了新的文献求助10
17秒前
18秒前
Robin发布了新的文献求助10
20秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Emmy Noether's Wonderful Theorem 1200
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
基于非线性光纤环形镜的全保偏锁模激光器研究-上海科技大学 800
Signals, Systems, and Signal Processing 610
Research Methods for Business: A Skill Building Approach, 9th Edition 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6411049
求助须知:如何正确求助?哪些是违规求助? 8230264
关于积分的说明 17465501
捐赠科研通 5463990
什么是DOI,文献DOI怎么找? 2887100
邀请新用户注册赠送积分活动 1863669
关于科研通互助平台的介绍 1702596