亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Data Augmentation for Offline Arabic Handwritten Text Recognition Using Moving Least Squares

计算机科学 笔迹 人工智能 卷积神经网络 任务(项目管理) 深度学习 手写体识别 阿拉伯语 生成语法 自然语言处理 人工神经网络 语音识别 模式识别(心理学) 特征提取 语言学 哲学 经济 管理
作者
Mohamed Amine Chadli,Rochdi Bachir Bouiadjra,Abdelkader Fekir,Jesús Martínez-Gómez,José A. Gámez
出处
期刊:Revue d'intelligence artificielle [International Information and Engineering Technology Association]
卷期号:38 (1): 1-9 被引量:2
标识
DOI:10.18280/ria.380101
摘要

This paper addresses the research problem of Offline Arabic Handwriting Text Recognition (HTR).One of the most important approaches to HTR systems is deep learning.A large amount of annotated data is needed to train deep learning-based HTR systems.The Arabic language is spoken by hundreds of millions of people in North Africa and the Middle East.Writing styles and common words differ significantly between those regions.Due to the great diversity possible, designing a statistically represented and balanced database of Arabic handwritten texts by gathering and labeling the texts is an arduous task to achieve.One of the ways to enrich the training databases is by augmenting the existing data.We have developed a new data augmentation technique for Arabic handwritten texts using Moving Least Squares (MLS) to deform the images.This technique results in realistic images that look like manipulating real-world images, and the deformations are done using linear functions that produce deformations in real time.We aim to deform the training data images randomly in a way that the text present in the images is still recognizable by a human.This augmentation technique can be used directly on images to augment them unlike other techniques such as Generative Adversarial Networks (GAN) where they must be trained beforehand.At the same time, it produces new complex augmented images compared to simple traditional augmentation techniques such as rotations and translations.In addition to this augmentation technique, we used a deep learning system called Convolutional Recurrent Neural Networks (CRNN) to test the new technique, and we have experimented with a CRNN model that accepts small input-size images to boost the time needed for both training and image augmentations.All the experimentations are carried out on the Arabic IFN/ENIT database.The results show that the small input size CRNN model outperforms the large input size CRNN model by a big margin.The results also show that the integration of images augmented by the MLS technique can help the recognition system to generalize better on the test data, therefore, it can slightly improve the performance of the recognition system.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
JamesPei应助11采纳,获得10
11秒前
充电宝应助科研通管家采纳,获得10
16秒前
华仔应助科研通管家采纳,获得10
16秒前
16秒前
22秒前
11发布了新的文献求助10
26秒前
科研通AI6.3应助yu采纳,获得10
31秒前
1分钟前
dongdong发布了新的文献求助10
1分钟前
dongdong完成签到,获得积分20
1分钟前
尹静涵完成签到 ,获得积分10
1分钟前
1分钟前
ppp完成签到 ,获得积分20
2分钟前
ppp关注了科研通微信公众号
2分钟前
2分钟前
yu发布了新的文献求助10
2分钟前
3分钟前
凯凯宝发布了新的文献求助10
3分钟前
3分钟前
3分钟前
4分钟前
智慧金刚完成签到 ,获得积分10
4分钟前
duhajisoijqw发布了新的文献求助10
4分钟前
FashionBoy应助凯凯宝采纳,获得30
4分钟前
4分钟前
情怀应助yu采纳,获得10
4分钟前
4分钟前
qwe发布了新的文献求助10
4分钟前
年年有余完成签到,获得积分10
5分钟前
研友_VZG7GZ应助耀眼采纳,获得10
5分钟前
Del关注了科研通微信公众号
5分钟前
领导范儿应助swz采纳,获得10
5分钟前
5分钟前
yu发布了新的文献求助10
5分钟前
yoqalux完成签到 ,获得积分10
5分钟前
6分钟前
hutao发布了新的文献求助10
6分钟前
英俊的铭应助yu采纳,获得10
6分钟前
赘婿应助科研通管家采纳,获得10
6分钟前
科研通AI2S应助科研通管家采纳,获得10
6分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Lewis’s Child and Adolescent Psychiatry: A Comprehensive Textbook Sixth Edition 2000
Cronologia da história de Macau 1600
Continuing Syntax 1000
Encyclopedia of Quaternary Science Reference Work • Third edition • 2025 800
Influence of graphite content on the tribological behavior of copper matrix composites 658
Interaction between asthma and overweight/obesity on cancer results from the National Health and Nutrition Examination Survey 2005‐2018 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6210906
求助须知:如何正确求助?哪些是违规求助? 8037145
关于积分的说明 16743943
捐赠科研通 5300292
什么是DOI,文献DOI怎么找? 2824047
邀请新用户注册赠送积分活动 1802621
关于科研通互助平台的介绍 1663749