计算机科学
粒度
变压器
工作流程
人工智能
模式识别(心理学)
数据挖掘
电压
工程类
数据库
电气工程
操作系统
作者
Huabin Chen,Zhen Li,Pan Fu,Zhen-Liang Ni,Gui‐Bin Bian
标识
DOI:10.1109/embc48229.2022.9871004
摘要
Automatic surgical phase recognition plays a key role in surgical workflow analysis and overall optimization in clinical work. In the complicated surgical procedures, similar inter-class appearance and drastic variability in phase duration make this still a challenging task. In this paper, a spatio-temporal transformer is proposed for online surgical phase recognition with different granularity. To extract rich spatial information, a spatial transformer is used to model global spatial dependencies of each time index. To overcome the variability in phase duration, a temporal transformer captures the multi-scale temporal context of different time indexes with a dual pyramid pattern. Our method is thoroughly validated on the public Cholec80 dataset with 7 coarse-grained phases and the CATARACTS2020 dataset with 19 fine-grained phases, outperforming state-of-the-art approaches with 91.4% and 84.2% accuracy, taking only 24.5M parameters.
科研通智能强力驱动
Strongly Powered by AbleSci AI