SLR-YOLO: An improved YOLOv8 network for real-time sign language recognition

计算机科学 手语 人工智能 一般化 特征(语言学) 杂乱 模式识别(心理学) 计算机视觉 雷达 数学分析 电信 哲学 语言学 数学
作者
Wei Jia,Changyong Li
出处
期刊:Journal of Intelligent and Fuzzy Systems [IOS Press]
卷期号:: 1-18
标识
DOI:10.3233/jifs-235132
摘要

This study proposes a method to help people with different degrees of hearing impairment to better integrate into society and perform more convenient human-to-human and human-to-robot sign language interaction through computer vision. Traditional sign language recognition methods make it challenging to get good results on scenes with backgrounds close to skin color, background clutter, and partial occlusion. In order to realize faster real-time display, by comparing standard single-target recognition algorithms, we choose the best effect YOLOv8 model, and based on this, we propose a lighter and more accurate SLR-YOLO network model that improves YOLOv8. Firstly, the SPPF module is replaced with RFB module in the backbone network to enhance the feature extraction capability of the network; secondly, in the neck, BiFPN is used to enhance the feature fusion of the network, and the Ghost module is added to make the network lighter; lastly, in order to introduce partial masking during the training process and to improve the data generalization capability, Mixup, Random Erasing and Cutout three data enhancement methods are compared, and finally the Cutout method is selected. The accuracy of the improved SLR-YOLO model on the validation sets of the American Sign Language Letters Dataset and Bengali Sign Language Alphabet Dataset is 90.6% and 98.5%, respectively. Compared with the performance of the original YOLOv8, the accuracy of both is improved by 1.3 percentage points, the amount of parameters is reduced by 11.31%, and FLOPs are reduced by 11.58% .
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
宋行远发布了新的文献求助10
6秒前
6秒前
qnmlgbd55发布了新的文献求助10
8秒前
ANGHUI完成签到,获得积分10
10秒前
枫叶发布了新的文献求助30
10秒前
在水一方应助晾猫人采纳,获得10
16秒前
深情安青应助晾猫人采纳,获得30
16秒前
可爱的函函应助晾猫人采纳,获得10
16秒前
烟花应助晾猫人采纳,获得30
16秒前
传奇3应助晾猫人采纳,获得10
16秒前
小二郎应助晾猫人采纳,获得10
16秒前
彭于晏应助晾猫人采纳,获得10
16秒前
21秒前
JamesPei应助qnmlgbd55采纳,获得10
23秒前
26秒前
小马甲应助受伤的便当采纳,获得10
27秒前
ad钙发布了新的文献求助10
27秒前
37秒前
Maestro_S应助枫叶采纳,获得10
37秒前
37秒前
可可发布了新的文献求助10
39秒前
40秒前
疯度发布了新的文献求助10
42秒前
45秒前
kunyi发布了新的文献求助30
47秒前
47秒前
小小aa16完成签到,获得积分10
47秒前
wxt完成签到 ,获得积分10
48秒前
49秒前
疯度完成签到,获得积分20
49秒前
50秒前
50秒前
qnmlgbd55发布了新的文献求助10
51秒前
Xi ~发布了新的文献求助10
51秒前
南风发布了新的文献求助10
53秒前
qiu发布了新的文献求助20
55秒前
55秒前
XP416完成签到,获得积分10
56秒前
56秒前
Xi ~完成签到,获得积分10
58秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 800
Recherches Ethnographiques sue les Yao dans la Chine du Sud 500
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 500
Chinese-English Translation Lexicon Version 3.0 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 440
Wisdom, Gods and Literature Studies in Assyriology in Honour of W. G. Lambert 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2389769
求助须知:如何正确求助?哪些是违规求助? 2095772
关于积分的说明 5278818
捐赠科研通 1822898
什么是DOI,文献DOI怎么找? 909318
版权声明 559593
科研通“疑难数据库(出版商)”最低求助积分说明 485920