计算机科学
卷积神经网络
人工智能
定位关键字
定位
词(群论)
代表(政治)
模式识别(心理学)
自然语言处理
相似性(几何)
性格(数学)
图像(数学)
情报检索
数学
法学
几何学
政治
政治学
作者
Hongxi Wei,Jing Zhang,Hui Zhang
标识
DOI:10.1109/ictai50040.2020.00071
摘要
Due to degradation of historical Mongolian documents, a task for retrieving them is challenging. In the field of document image retrieval, keyword spotting technology is an alternative when optical character recognition is infeasible. Representation of word images plays a very important role in keyword spotting. In this paper, various of convolutional neural networks have been used for representing word images of historical Mongolian documents. To be specific, activations of the fully-connected layer in convolutional neural network are extracted and taken as representation vectors of word images. And then, similarity can be calculated between their representation vectors of word images. Several classic structures of convolutional neural networks have been compared with each other and the best one has been determined. Furthermore, convolutional neural network has been also compared with several baselines and the state-of-the-art method on a dataset of historical Mongolian documents. Experimental results indicates that the performance of convolutional neural network is superior to these baseline and state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI