自动对焦
计算机科学
光学(聚焦)
人工智能
计算机视觉
特征(语言学)
核(代数)
单眼
推论
匹配(统计)
立体视觉
机器视觉
仿真
钥匙(锁)
编码
机器人
重射误差
特征提取
作者
Pan Fu,Zhen Li,Mingyang Zhang,Yu-Peng Zhai,Junzheng Wang,Wenhao He,Gui-Bin Bian
标识
DOI:10.1109/iros60139.2025.11246648
摘要
Creating an intelligent surgical environment requires not only advanced robotic systems but also optimized microscopic imaging. However, autofocus remains a fundamental challenge, with current methods suffering from slow iterative processes or directional ambiguity, which compromises real-time performance. This paper presents an implicit disparity-blur alignment approach for robotic microsurgical autofocus, integrating stereo geometry’s monotonic depth cues with de-focus characteristics for rapid convergence. A novel physics-guided dual-stream network is developed to encode implicit depth representations through hierarchical cross-pathway feature fusion, enabling reliable focus prediction without explicit stereo matching in blur-degraded regions. An ROI-aware attention module is proposed to dynamically optimize focus-critical regions, coupled with learnable physics-guided kernel learning for precise Z-offset estimation. The approach achieves a top directional accuracy of 94.85% and a single-pass focus error of 0.20 mm with an inference time of 53 ms on a surgical dataset, which outperforms state-of-the-art methods in reducing iteration count by 22.8% and inference time by 51.8%. An intelligent robotic microscope prototype is developed, with validation through ex vivo tests demonstrating its ability to enable fast and precise multi-region focusing for microsurgeries.
科研通智能强力驱动
Strongly Powered by AbleSci AI