计算机科学
人工智能
特征(语言学)
计算机视觉
RGB颜色模型
对象(语法)
融合
目标检测
模式识别(心理学)
哲学
语言学
作者
Li Zhu,Tuanjie Li,Yuming Ning,Yan Zhang
标识
DOI:10.1177/17298806241283373
摘要
Transparent objects are ubiquitous in everyday life, but how to detect them is full of challenges. Transparent objects hardly reflect light, and they usually transmit the appearance of their surroundings, making it difficult to distinguish them from their surroundings. Existing methods usually use only RGB (Red Green Blue) images as input, ignoring the role of depth maps in transparent object detection. In this article, we try to improve the detection performance of transparent objects by fusing RGB and depth information. Specifically, we propose a multimodal fusion network that fuses RGB and depth modalities in a complementary way. Moreover, extensive experiments and ablation studies on the RGB-D (RGB-Depth) transparent object dataset demonstrate the excellent performance of our method.
科研通智能强力驱动
Strongly Powered by AbleSci AI