Data Augmentation for Offline Arabic Handwritten Text Recognition Using Moving Least Squares

计算机科学 笔迹 人工智能 卷积神经网络 任务(项目管理) 深度学习 手写体识别 阿拉伯语 生成语法 自然语言处理 人工神经网络 语音识别 模式识别(心理学) 特征提取 语言学 哲学 经济 管理
作者
Mohamed Amine Chadli,Rochdi Bachir Bouiadjra,Abdelkader Fekir,Jesús Martínez-Gómez,José A. Gámez
出处
期刊:Revue d'intelligence artificielle [International Information and Engineering Technology Association]
卷期号:38 (1): 1-9 被引量:2
标识
DOI:10.18280/ria.380101
摘要

This paper addresses the research problem of Offline Arabic Handwriting Text Recognition (HTR).One of the most important approaches to HTR systems is deep learning.A large amount of annotated data is needed to train deep learning-based HTR systems.The Arabic language is spoken by hundreds of millions of people in North Africa and the Middle East.Writing styles and common words differ significantly between those regions.Due to the great diversity possible, designing a statistically represented and balanced database of Arabic handwritten texts by gathering and labeling the texts is an arduous task to achieve.One of the ways to enrich the training databases is by augmenting the existing data.We have developed a new data augmentation technique for Arabic handwritten texts using Moving Least Squares (MLS) to deform the images.This technique results in realistic images that look like manipulating real-world images, and the deformations are done using linear functions that produce deformations in real time.We aim to deform the training data images randomly in a way that the text present in the images is still recognizable by a human.This augmentation technique can be used directly on images to augment them unlike other techniques such as Generative Adversarial Networks (GAN) where they must be trained beforehand.At the same time, it produces new complex augmented images compared to simple traditional augmentation techniques such as rotations and translations.In addition to this augmentation technique, we used a deep learning system called Convolutional Recurrent Neural Networks (CRNN) to test the new technique, and we have experimented with a CRNN model that accepts small input-size images to boost the time needed for both training and image augmentations.All the experimentations are carried out on the Arabic IFN/ENIT database.The results show that the small input size CRNN model outperforms the large input size CRNN model by a big margin.The results also show that the integration of images augmented by the MLS technique can help the recognition system to generalize better on the test data, therefore, it can slightly improve the performance of the recognition system.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
思源应助爱听歌的胡萝卜采纳,获得10
刚刚
刚刚
刚刚
1秒前
YIQISUDA发布了新的文献求助10
1秒前
舒心的秋荷完成签到,获得积分10
2秒前
充电宝应助秦大帅采纳,获得10
2秒前
Jasper应助结实又晴采纳,获得10
2秒前
田様应助激动的尔烟采纳,获得10
3秒前
3秒前
追寻的珠完成签到,获得积分10
3秒前
hehehe完成签到,获得积分10
3秒前
折耳根拌香菜完成签到,获得积分10
4秒前
墨辰完成签到,获得积分10
4秒前
banqia发布了新的文献求助10
4秒前
小狒狒发布了新的文献求助10
4秒前
喜羊羊发布了新的文献求助10
4秒前
orixero应助柒tt采纳,获得10
4秒前
小鱼关注了科研通微信公众号
4秒前
晶aaaaa完成签到 ,获得积分10
5秒前
5秒前
liuyaohan0726完成签到,获得积分10
5秒前
5秒前
6秒前
爱吃香菜发布了新的文献求助10
7秒前
8秒前
娜娜家的大宝贝完成签到,获得积分10
8秒前
彭于晏应助zwk66637采纳,获得10
8秒前
9秒前
9秒前
Jacky77发布了新的文献求助10
9秒前
无极微光应助xuan采纳,获得20
10秒前
cptbtptp发布了新的文献求助20
10秒前
10秒前
11秒前
三七完成签到,获得积分10
11秒前
gg发布了新的文献求助10
11秒前
11秒前
言辞完成签到,获得积分10
12秒前
大模型应助谷雨采纳,获得10
12秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Lewis’s Child and Adolescent Psychiatry: A Comprehensive Textbook Sixth Edition 2000
Cronologia da história de Macau 1600
Continuing Syntax 1000
Encyclopedia of Quaternary Science Reference Work • Third edition • 2025 800
Signals, Systems, and Signal Processing 510
Pharma R&D Annual Review 2026 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6214463
求助须知:如何正确求助?哪些是违规求助? 8039953
关于积分的说明 16755030
捐赠科研通 5302723
什么是DOI,文献DOI怎么找? 2825123
邀请新用户注册赠送积分活动 1803533
关于科研通互助平台的介绍 1663987