计算机科学
分割
文本检测
过程(计算)
人工智能
图像分割
模式识别(心理学)
目标检测
数据挖掘
计算机视觉
图像(数学)
操作系统
作者
Chen Chen,Zheng Fang,Shaohui Zhang,Wei Li
标识
DOI:10.1109/cvidl58838.2023.10166555
摘要
Compared to regression based text detection methods, segmentation based methods can better handle irregularly shaped text regions and effectively reduce detection errors. They can flexibly cope with different text forms and better handle the problem of multilingual text detection. However, due to the limitations of segmentation algorithms themselves, segmentation based methods may result in significant errors in situations where the boundaries between text and non text regions are blurred. In this article, we propose a novel strategy to address a series of inaccuracies and errors that arise during the detection process. This method also simplifies some tedious operations during the detection process, thereby improving the overall performance of the model. We have conducted multiple experiments on multiple common benchmarks to verify the performance and effectiveness of our proposed model, which has reached the level of state-of-the-art methods in terms of accuracy and speed in detection results.
科研通智能强力驱动
Strongly Powered by AbleSci AI