Instrument-tissue Interaction Detection Framework for Surgical Video Understanding

计算机科学 代码段 最小边界框 帧(网络) 跳跃式监视 背景(考古学) 特征(语言学) 人工智能 班级(哲学) 计算机视觉 利用 人机交互 任务(项目管理) 特征提取 情报检索 图像(数学) 哲学 古生物学 经济 生物 管理 电信 语言学 计算机安全
作者
Wenjun Lin,Yan Hu,Huazhu Fu,Mingming Yang,Chin-Boon Chng,Ryo Kawasaki,Chee‐Kong Chui,Jiang Liu
出处
期刊:IEEE Transactions on Medical Imaging [Institute of Electrical and Electronics Engineers]
卷期号:43 (8): 2803-2813
标识
DOI:10.1109/tmi.2024.3381209
摘要

Instrument-tissue interaction detection task, which helps understand surgical activities, is vital for constructing computer-assisted surgery systems but with many challenges. Firstly, most models represent instrument-tissue interaction in a coarse-grained way which only focuses on classification and lacks the ability to automatically detect instruments and tissues. Secondly, existing works do not fully consider relations between intra-and inter-frame of instruments and tissues. In the paper, we propose to represent instrument-tissue interaction as ⟨instrument class, instrument bounding box, tissue class, tissue bounding box, action class⟩ quintuple and present an Instrument-Tissue Interaction Detection Network (ITIDNet) to detect the quintuple for surgery videos understanding. Specifically, we propose a Snippet Consecutive Feature (SCF) Layer to enhance features by modeling relationships of proposals in the current frame using global context information in the video snippet. We also propose a Spatial Corresponding Attention (SCA) Layer to incorporate features of proposals between adjacent frames through spatial encoding. To reason relationships between instruments and tissues, a Temporal Graph (TG) Layer is proposed with intra-frame connections to exploit relationships between instruments and tissues in the same frame and inter-frame connections to model the temporal information for the same instance. For evaluation, we build a cataract surgery video (PhacoQ) dataset and a cholecystectomy surgery video (CholecQ) dataset. Experimental results demonstrate the promising performance of our model, which outperforms other state-of-the-art models on both datasets.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
zyl发布了新的文献求助10
刚刚
刚刚
1秒前
OKAY完成签到,获得积分10
1秒前
3秒前
4秒前
要减肥白桃完成签到,获得积分10
4秒前
云峰发布了新的文献求助10
4秒前
一只小鸮发布了新的文献求助10
5秒前
OKAY发布了新的文献求助10
5秒前
汉堡包应助susu采纳,获得10
6秒前
万能图书馆应助xhstky采纳,获得10
6秒前
大力的灵雁应助zsl采纳,获得10
8秒前
lishihao发布了新的文献求助10
8秒前
我是老大应助能干的访梦采纳,获得10
11秒前
Disguise发布了新的文献求助10
12秒前
思源应助lialia采纳,获得10
13秒前
15秒前
Anna完成签到,获得积分10
15秒前
笑一笑完成签到,获得积分10
15秒前
风中樱桃完成签到,获得积分10
15秒前
欧米伽发布了新的文献求助10
16秒前
木槿完成签到,获得积分10
16秒前
16秒前
tangyuan完成签到 ,获得积分10
18秒前
20秒前
20秒前
21秒前
hutingting完成签到 ,获得积分10
21秒前
yyyyyy发布了新的文献求助10
22秒前
lishihao完成签到,获得积分10
23秒前
晴天发布了新的文献求助10
23秒前
空空1213发布了新的文献求助200
24秒前
西一兮发布了新的文献求助10
25秒前
科研通AI6.1应助自然白安采纳,获得10
26秒前
11发布了新的文献求助10
26秒前
26秒前
小黎给小黎的求助进行了留言
28秒前
寻度完成签到 ,获得积分10
28秒前
高分求助中
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Organic Reactions Volume 118 400
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6466799
求助须知:如何正确求助?哪些是违规求助? 8273127
关于积分的说明 17639885
捐赠科研通 5541883
什么是DOI,文献DOI怎么找? 2908026
邀请新用户注册赠送积分活动 1884980
关于科研通互助平台的介绍 1733225