计算机科学
人工智能
计算机视觉
光学(聚焦)
会聚(光学)
块(置换群论)
人工神经网络
投影机
结构光
深度知觉
端到端原则
解码方法
立体视
内窥镜
感知
算法
光学
数学
医学
物理
几何学
放射科
神经科学
生物
作者
Chi‐Sheng Shih,Yu Shian Lin,Kai Ju Cheng,Chin Kang Chang
标识
DOI:10.1145/3555776.3577691
摘要
Three-dimensional endoscopes have been widely studied and developed for video-guided mini-invasive surgery to improve depth perception. Modern 3D endoscopes still cannot provide accurate depth readings. Moreover, the users may suffer from symptoms such as dizziness and nausea resulting from the vergence-accommodation conflict on stereo endoscopes. To resolve the aforementioned challenges, this work takes advantage of both structured light techniques and neural-network-based methods to reconstruct depth information on endoscopic images. The developed method includes SLResNet, a neural network model for end-to-end structure light pattern decoding, and a coordinate refinement algorithm. To focus on algorithm design, this work evaluated the algorithms on the projector-camera system. Using a metal gauge block as the targeted object, the maximum relative depth error is 0.396mm. This method can reconstruct at steepness up to 70 degrees stably. The errors in reconstructing the human upper jaw are less than 1mm in depth.
科研通智能强力驱动
Strongly Powered by AbleSci AI