Lightweight and computationally faster Hypermetropic Convolutional Neural Network for small size object detection

卷积神经网络 计算机科学 目标检测 对象(语法) 人工智能 深度学习 人工神经网络 视觉对象识别的认知神经科学 模式识别(心理学) 计算机视觉
作者
Amudhan A.N.,Sudheer A.P.
出处
期刊:Image and Vision Computing [Elsevier BV]
卷期号:119: 104396-104396 被引量:31
标识
DOI:10.1016/j.imavis.2022.104396
摘要

Object detection has been an active area of research over the past two decades. The complexity of detecting an object increases with the increase in object speed and decrease in object size. Similar scenarios are observed in sports video analysis, vision systems of robots, driverless cars and much more. This led to the need for an efficient neural network that can detect small size objects. Further, most of the real-time applications use single board computers such as Jetson Nano, TX2, Xavier, Raspberry Pi and the like. The state-of-the-art of Deep Learning models such as YOLOv4, v3, YOLOR, YOLOX and SSD show poor run-time performance on these devices. Their lighter versions YOLOv3-tiny, YOLOv4-tiny and YOLOX-nano run nearly at 24 frames per second (fps) on Jetson Nano; however, their detection accuracy on small-sized objects is unsatisfactory. This paper focuses on developing a computationally lighter Convolutional Neural network(CNN) to detect small-sized objects efficiently. A novel hypermetropic CNN was developed to meet the above requirements. The improvement in detection is made by extracting more features from the shallow layers and transferring low-level features to the deeper layers. The network is hypermetropic because it performs well on distant objects and lags on nearby objects. The proposed model's performance is compared with the state-of-the-art models on various public datasets such as the VEDAI dataset, Visdrone dataset, and a few classes from the MS COCO and OID dataset. The proposed model shows impressive improvements in detecting small-size objects, and a 32% increase in the fps is observed on Jetson Nano. • A novel CNN architecture to detect small-sized objects is proposed. • Validation is carried out on various public datasets. • Results show impressive improvements in detection accuracy and real-time performance. • It is lighter, smaller and has reduced training time than the state-of-the-art models. • It is suitable for use in any single-board computer and platforms devoid of GPUs.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
言余完成签到,获得积分10
1秒前
E10100完成签到,获得积分10
1秒前
三十三完成签到,获得积分10
1秒前
1秒前
MZT完成签到,获得积分10
2秒前
yuyu完成签到,获得积分10
3秒前
科目三应助乐观寒天采纳,获得10
3秒前
魁梧的黑猫完成签到,获得积分10
3秒前
Au完成签到,获得积分10
3秒前
jieruwei发布了新的文献求助10
3秒前
orixero应助SL1181采纳,获得20
3秒前
4秒前
4秒前
深情宝马完成签到,获得积分10
5秒前
5秒前
2044405633完成签到,获得积分20
5秒前
5秒前
hmx完成签到,获得积分10
5秒前
整箱发布了新的文献求助10
6秒前
6秒前
研友_闾丘枫完成签到,获得积分10
6秒前
6秒前
7秒前
7秒前
7秒前
Baymax完成签到 ,获得积分10
7秒前
爆米花应助含蓄冬卉采纳,获得10
8秒前
8秒前
8秒前
科研通AI6.2应助xiu采纳,获得10
9秒前
9秒前
整箱发布了新的文献求助10
9秒前
9秒前
10秒前
10秒前
禤禤发布了新的文献求助10
10秒前
galaxy发布了新的文献求助30
10秒前
11秒前
顺顺利利完成签到 ,获得积分20
11秒前
科研通AI6.2应助茶壶喝茶采纳,获得10
11秒前
高分求助中
Lewis’s Child and Adolescent Psychiatry: A Comprehensive Textbook Sixth Edition 2000
Cronologia da história de Macau 1600
Treatment response-adapted risk index model for survival prediction and adjuvant chemotherapy selection in nonmetastatic nasopharyngeal carcinoma 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
BRITTLE FRACTURE IN WELDED SHIPS 1000
Intentional optical interference with precision weapons (in Russian) Преднамеренные оптические помехи высокоточному оружию 1000
Atlas of Anatomy 5th original digital 2025的PDF高清电子版(非压缩版,大小约400-600兆,能更大就更好了) 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6198722
求助须知:如何正确求助?哪些是违规求助? 8026063
关于积分的说明 16708803
捐赠科研通 5292409
什么是DOI,文献DOI怎么找? 2820407
邀请新用户注册赠送积分活动 1800139
关于科研通互助平台的介绍 1662592