TY - GEN
T1 - Multi-orientation scene text detection leveraging background suppression
AU - Wang, Xihan
AU - Feng, Xiaoyi
AU - Xia, Zhaoqiang
AU - Peng, Jinye
AU - Granger, Eric
N1 - Publisher Copyright:
© 2017, Springer International Publishing AG.
PY - 2017
Y1 - 2017
N2 - Most state-of-the-art text detection methods are devoted to horizontal texts and these methods cannot work well when encountering blurred, multi-oriented, low-resolution and small-sized texts. In this paper, we propose to localize texts from the perspective of suppressing more non-text backgrounds, in which a coarse-to-fine strategy is presented to remove non-text pixels from images. Firstly, the fully convolutional network (FCN) framework is utilized to make the coarse prediction of text labeling. Secondly, an efficient saliency measure based on background priors is employed to further suppress non-text pixels and generate fine character candidate regions. The remaining candidates of character regions composite text lines, so that the proposed method can handle multi-orientation texts in natural scene images. Two public datasets, MSRA-TD500 and ICDAR2013 are utilized to evaluate the performance of our proposed method. Experimental results show that our method achieves high recall rate and demonstrates the competitive performance.
AB - Most state-of-the-art text detection methods are devoted to horizontal texts and these methods cannot work well when encountering blurred, multi-oriented, low-resolution and small-sized texts. In this paper, we propose to localize texts from the perspective of suppressing more non-text backgrounds, in which a coarse-to-fine strategy is presented to remove non-text pixels from images. Firstly, the fully convolutional network (FCN) framework is utilized to make the coarse prediction of text labeling. Secondly, an efficient saliency measure based on background priors is employed to further suppress non-text pixels and generate fine character candidate regions. The remaining candidates of character regions composite text lines, so that the proposed method can handle multi-orientation texts in natural scene images. Two public datasets, MSRA-TD500 and ICDAR2013 are utilized to evaluate the performance of our proposed method. Experimental results show that our method achieves high recall rate and demonstrates the competitive performance.
KW - Background suppression
KW - Fully Convolutional Network
KW - Multi-orientation texts
KW - Scene text detection
UR - http://www.scopus.com/inward/record.url?scp=85040249159&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-71607-7_49
DO - 10.1007/978-3-319-71607-7_49
M3 - 会议稿件
AN - SCOPUS:85040249159
SN - 9783319716060
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 555
EP - 566
BT - Image and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers
A2 - Zhao, Yao
A2 - Taubman, David
A2 - Kong, Xiangwei
PB - Springer Verlag
T2 - 9th International Conference on Image and Graphics, ICIG 2017
Y2 - 13 September 2017 through 15 September 2017
ER -