计算机科学
可解释性
人工智能
机器学习
多样性(控制论)
分割
钥匙(锁)
动作(物理)
软件部署
特征(语言学)
人机交互
语言学
哲学
物理
计算机安全
量子力学
操作系统
作者
Harshala Gammulle,David Ahmedt‐Aristizabal,Simon Denman,Lachlan Tychsen-Smith,Lars Petersson,Clinton Fookes
出处
期刊:ACM Computing Surveys
[Association for Computing Machinery]
日期:2023-03-14
卷期号:55 (13s): 1-38
被引量:19
摘要
With advances in data-driven machine learning research, a wide variety of prediction models have been proposed to capture spatio-temporal features for the analysis of video streams. Recognising actions and detecting action transitions within an input video are challenging but necessary tasks for applications that require real-time human-machine interaction. By reviewing a large body of recent related work in the literature, we thoroughly analyse, explain, and compare action segmentation methods and provide details on the feature extraction and learning strategies that are used on most state-of-the-art methods. We cover the impact of the performance of object detection and tracking techniques on human action segmentation methodologies. We investigate the application of such models to real-world scenarios and discuss several limitations and key research directions towards improving interpretability, generalisation, optimisation, and deployment.
科研通智能强力驱动
Strongly Powered by AbleSci AI