点云
计算机科学
分割
人工智能
特征(语言学)
八叉树
棱锥(几何)
计算机视觉
模式识别(心理学)
数据挖掘
语言学
光学
物理
哲学
作者
Hao Bai,Xiongwei Li,Qing Meng,Shulong Zhuo,Lili Yan
出处
期刊:IEEE Access
[Institute of Electrical and Electronics Engineers]
日期:2025-01-01
卷期号:13: 9462-9472
被引量:3
标识
DOI:10.1109/access.2025.3528245
摘要
Intelligent analysis of 3D point clouds has become a frontier in emerging fields such as autonomous driving, digital twins, and the metaverse. Precise segmentation of 3D point clouds is particularly important within these domains; however, it faces several challenges: (1) point cloud data inherently lacks structured topological information; (2) point cloud shapes are complex and highly variable, making it difficult to utilize semantic priors; and (3) the sampling process of point clouds may result in sparse and uneven data. To address these issues, this paper proposes a novel Point Cloud Segmentation Network based on multi-scale feature fusion and Transformer architecture (MFFTNet). MFFTNet enhances the performance of existing segmentation methods by globally modeling the overall point cloud shape and embedding local point cloud details. Specifically, MFFTNet divides the segmentation task into encoding and decoding stages. The encoder is designed as a hierarchical pyramid structure that extracts relatively sparse local center points and fuses local features during progressive downsampling. It also utilizes a Transformer for global feature modeling to establish multi-scale topological and semantic information of the point cloud. Subsequently, multi-scale feature fusion further enhances the network’s perception of local features and global structure. The decoder progressively upsamples to restore the original point cloud and injects multi-scale feature information to achieve precise segmentation. Based on the aforementioned encoding-decoding structure and multi-scale feature fusion, MFFTNet outperforms existing methods on the point cloud semantic segmentation datasets ShapeNetPart and S3DIS.
科研通智能强力驱动
Strongly Powered by AbleSci AI