轮廓仪
变压器
计算机科学
高动态范围
投影(关系代数)
航程(航空)
人工智能
计算机视觉
动态范围
材料科学
算法
电气工程
工程类
电压
复合材料
表面粗糙度
作者
Jianbin Cao,Xingzhao Wang,Xu Zhang,Dawei Tu
标识
DOI:10.1088/1361-6501/ae00e3
摘要
Abstract 3D reconstruction of high-dynamic-range (HDR) scenes using a minimal number of images is a critical challenge for fringe projection profilometry. Deep learning (DL) has demonstrated significant potential, but previous approaches have never considered a balance between accuracy and speed, resulting in a slow inference latency for networks. A lightweight U-shaped transformer-CNN architecture is proposed that integrates dual attention mechanisms (channel and pixel attention) which enable the network to focus more attention on HDR regions. In addition, an efficient transformer (ET) layer establishes a longer dependence that enables compensation for the HDR regions by utilizing their surrounding pixels’ phase information. To address the data scarcity challenge, the network is first pre-trained on a synthetic dataset and subsequently fine-tuned using a real-world dataset that contains 720 HDR scenes. Experiments have proven that the proposed network achieves a phase error as low as 5.32 × 10 − 4 rad with it only containing 0.55 million parameters and a real-time processing speed of 16.56 frames per second (FPS) on an RTX 4060 GPU. Compared with other networks, the proposed network provides an effective phase recovery performance with faster inference speed for HDR scene reconstruction, which is crucial for actual industry applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI