亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Enhancing temporal action localization in an end-to-end network through estimation error incorporation

端到端原则 动作(物理) 估计 计算机科学 人工智能 算法 工程类 物理 系统工程 量子力学
作者
M.E. Mokari,Khosrow Hajsadeghi
出处
期刊:Image and Vision Computing [Elsevier BV]
卷期号:145: 104994-104994
标识
DOI:10.1016/j.imavis.2024.104994
摘要

Temporal action localization presents a significant challenge in computer vision, as the development of an efficient method for this task remains elusive. The objective is to identify human activities within untrimmed videos, determining when and which actions occur in each video. While using trimmed videos could potentially resolve the localization problem and enhance classification accuracy, it is impractical for real-world applications as the trimming process itself requires human intervention. This highlights the importance of temporal localization. Due to the availability of several successful approaches for action recognition in trimmed video, conventional multi-stage methods for untrimmed video, commonly employ a network to generate activity proposals, followed by a separate network for classification. These disjoint networks are optimized individually and thus usually vary from the global optimum, leading to less precise candidate action proposals. To address this challenge, we propose a novel end-to-end neural network that utilizes error estimation for precise action localization and recognition in untrimmed videos. The proposed method performs the localization and classification of action instances simultaneously, thereby optimizing the corresponding networks concurrently. To increase the precision of the action proposal boundaries, the Regression module is innovatively utilized as part of the proposed end-to-end network, along with the Evaluation and Classification modules. This module estimates the potential error in proposal time boundaries and enhances the result accuracy. We have conducted experiments on THUMOS 14 and ActivityNet-1.3, which are considered the most challenging datasets for temporal action localization. The novel, yet fairly simple, proposed network achieves remarkable performance improvement compared to the other state-of-the-art methods. This improvement, which is more pronounced in the cases of high temporal intersection with ground truth, is accomplished without requiring extra data or complicated architecture. By incorporating error estimation, we achieved improvement in mean Average Precision (mAP). The proposed approach particularly shines for the localization of challenging activities in the complex and diverse dataset ActivityNet-1.3. For instance, for the "drinking coffee" activity, the mean Average Precision (mAP) was enhanced fivefold compared to the best-reported results.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
禄禄完成签到 ,获得积分10
3秒前
yy完成签到,获得积分10
6秒前
13秒前
15秒前
17秒前
kkkkkk完成签到,获得积分10
17秒前
现代海发布了新的文献求助10
19秒前
kkkkkk发布了新的文献求助10
21秒前
吴家良完成签到,获得积分10
26秒前
27秒前
希望天下0贩的0应助kkkkkk采纳,获得10
28秒前
二中所长完成签到,获得积分10
29秒前
30秒前
心行完成签到 ,获得积分10
31秒前
monk完成签到 ,获得积分10
33秒前
万能图书馆应助zzl采纳,获得10
35秒前
36秒前
Lyx122524完成签到,获得积分10
37秒前
37秒前
神勇马里奥完成签到 ,获得积分10
38秒前
平淡的书白完成签到,获得积分20
40秒前
lishuang发布了新的文献求助10
41秒前
银小鱼发布了新的文献求助10
41秒前
catherine完成签到,获得积分10
45秒前
MiRoRo完成签到 ,获得积分10
45秒前
颜林林完成签到,获得积分10
48秒前
CX_Liu完成签到,获得积分10
50秒前
海绵宝宝与派大星完成签到 ,获得积分10
53秒前
57秒前
57秒前
Ethan应助科研通管家采纳,获得10
58秒前
小蘑菇应助科研通管家采纳,获得10
58秒前
Akim应助科研通管家采纳,获得10
58秒前
思源应助科研通管家采纳,获得10
58秒前
Ethan应助科研通管家采纳,获得10
58秒前
嘻嘻哈哈应助科研通管家采纳,获得10
58秒前
桐桐应助科研通管家采纳,获得10
58秒前
Ethan应助科研通管家采纳,获得10
58秒前
obedVL完成签到,获得积分10
58秒前
晴朗泥泞完成签到,获得积分10
59秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Introduction to Helicopter and Tiltrotor Flight Simulation, Second Edition 2500
Developing Genetic Editing Tools for Lysobacter 2000
卤化钙钛矿人工突触的研究 2000
Моделирование процессов самоорганизации в кристаллообразующих системах 1000
History of U.S. Space Surveillance and Satellite Cataloging 1000
Malcolm Fraser : a biography 700
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6512036
求助须知:如何正确求助?哪些是违规求助? 8305486
关于积分的说明 17740957
捐赠科研通 5613548
什么是DOI,文献DOI怎么找? 2923593
邀请新用户注册赠送积分活动 1900830
关于科研通互助平台的介绍 1762526