计算机科学
钥匙(锁)
相似性(几何)
情报检索
信息抽取
萃取(化学)
人工智能
数据挖掘
图像(数学)
计算机安全
色谱法
化学
作者
Maosheng Zhu,Ruijie Ni
标识
DOI:10.1109/icicml60161.2023.10424746
摘要
Image key information extraction is an important technique in automatic digital image recognizers. To deal with complex image layout, vague image semantics and image defects, we propose a layout similarity-based model to extract image key information under certain image distortion situations. Our model uses perceptual hash algorithm (PHA) to detect visually similar images and uses a template matching based algorithm to solve distortion problems when PHA cannot find a match. The framework is tested on a dataset with more than 17,000 images and we demonstrate that our model can reach 99.5% accuracy on the original dataset, significantly outperforming Chargrid model and a parity-based model. We also compare the performance of different models on images with artificially overlaid boxes, which is commonly seen on screenshots. Results show that our model still performs much better than the parity-based model.
科研通智能强力驱动
Strongly Powered by AbleSci AI