计算机科学
定位
人工智能
定位关键字
词(群论)
笔迹
匹配(统计)
水准点(测量)
模式识别(心理学)
特征(语言学)
查询扩展
手写体识别
分割
语音识别
自然语言处理
特征提取
情报检索
数学
语言学
统计
哲学
大地测量学
地理
几何学
作者
Ekta Vats,Anders Hast,Alícia Fornés
标识
DOI:10.1109/icdar.2019.00209
摘要
Historical handwritten text recognition is an interesting yet challenging problem. In recent times, deep learning based methods have achieved significant performance in handwritten text recognition. However, handwriting recognition using deep learning needs training data, and often, text must be previously segmented into lines (or even words). These limitations constrain the application of HTR techniques in document collections, because training data or segmented words are not always available. Therefore, this paper proposes a training-free and segmentation-free word spotting approach that can be applied in unconstrained scenarios. The proposed word spotting framework is based on document query word expansion and relaxed feature matching algorithm, which can easily be parallelised. Since handwritten words posses distinct shape and characteristics, this work uses a combination of different keypoint detectors and Fourier-based descriptors to obtain a sufficient degree of relaxed matching. The effectiveness of the proposed method is empirically evaluated on well-known benchmark datasets using standard evaluation measures. The use of informative features along with query expansion significantly contributed in efficient performance of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI