计算机科学
人工智能
计算机视觉
遥感
模式识别(心理学)
地质学
作者
Han Yue,Ville Lehtola,Hangbin Wu,George Vosselman,Jincheng Li,Chun Liu
标识
DOI:10.1109/tgrs.2024.3387556
摘要
Scene recognition is a fundamental task in 3D scene understanding. It answers the question 'What is this place?'. In an indoor environment, the answer can be an office, kitchen, lobby, and so on. As the number of point clouds increases, using embedded point information in scene recognition becomes computationally heavy to process. To achieve computational efficiency and accurate classification, our idea is to use indoor scene graph that represents the 3D spatial structures via object instances. The proposed method comprises two parts, namely, (i) construction of indoor scene graphs leveraging object instances and their spatial relationships and (ii) classification of these graphs using a deep learning network. Specifically, each indoor scene is represented by a graph, where each node represents either a structural element (like a ceiling, a wall, or a floor) or a piece of furniture (like a chair or a table) and each edge encodes the spatial relationship between these elements. Then these graphs are used as input for our proposed graph classification network to learn different scene representations. The public indoor dataset, ScanNet v2, with 625.53 million points is selected to test our method. Experiments yield good results with up to 88.00% accuracy and 82.30% F1-score in the fixed validation dataset, and 90.46% accuracy and 81.45% F1-score in 10-fold cross validation method. Moreover, if some indoor objects can't be successfully identified, the scene classification accuracy depends sub-linearly on the rate of missing objects in the scene.
科研通智能强力驱动
Strongly Powered by AbleSci AI