Multi-orientation scene text detection leveraging background suppression

Xihan Wang; Xiaoyi Feng; Zhaoqiang Xia; Jinye Peng; Eric Granger

doi:10.1007/978-3-319-71607-7_49

Multi-orientation scene text detection leveraging background suppression

Xihan Wang, Xiaoyi Feng, Zhaoqiang Xia, Jinye Peng, Eric Granger

School of Electronics and Information

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Most state-of-the-art text detection methods are devoted to horizontal texts and these methods cannot work well when encountering blurred, multi-oriented, low-resolution and small-sized texts. In this paper, we propose to localize texts from the perspective of suppressing more non-text backgrounds, in which a coarse-to-fine strategy is presented to remove non-text pixels from images. Firstly, the fully convolutional network (FCN) framework is utilized to make the coarse prediction of text labeling. Secondly, an efficient saliency measure based on background priors is employed to further suppress non-text pixels and generate fine character candidate regions. The remaining candidates of character regions composite text lines, so that the proposed method can handle multi-orientation texts in natural scene images. Two public datasets, MSRA-TD500 and ICDAR2013 are utilized to evaluate the performance of our proposed method. Experimental results show that our method achieves high recall rate and demonstrates the competitive performance.

Original language	English
Title of host publication	Image and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers
Editors	Yao Zhao, David Taubman, Xiangwei Kong
Publisher	Springer Verlag
Pages	555-566
Number of pages	12
ISBN (Print)	9783319716060
DOIs	https://doi.org/10.1007/978-3-319-71607-7_49
State	Published - 2017
Event	9th International Conference on Image and Graphics, ICIG 2017 - Shanghai, China Duration: 13 Sep 2017 → 15 Sep 2017

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	10666 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	9th International Conference on Image and Graphics, ICIG 2017
Country/Territory	China
City	Shanghai
Period	13/09/17 → 15/09/17

Keywords

Background suppression
Fully Convolutional Network
Multi-orientation texts
Scene text detection

Access to Document

10.1007/978-3-319-71607-7_49

Cite this

Wang, X., Feng, X., Xia, Z., Peng, J., & Granger, E. (2017). Multi-orientation scene text detection leveraging background suppression. In Y. Zhao, D. Taubman, & X. Kong (Eds.), Image and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers (pp. 555-566). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10666 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-71607-7_49

Wang, Xihan ; Feng, Xiaoyi ; Xia, Zhaoqiang et al. / Multi-orientation scene text detection leveraging background suppression. Image and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers. editor / Yao Zhao ; David Taubman ; Xiangwei Kong. Springer Verlag, 2017. pp. 555-566 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{a5692fe5cf4843819982d1a2c2efaa6f,

title = "Multi-orientation scene text detection leveraging background suppression",

abstract = "Most state-of-the-art text detection methods are devoted to horizontal texts and these methods cannot work well when encountering blurred, multi-oriented, low-resolution and small-sized texts. In this paper, we propose to localize texts from the perspective of suppressing more non-text backgrounds, in which a coarse-to-fine strategy is presented to remove non-text pixels from images. Firstly, the fully convolutional network (FCN) framework is utilized to make the coarse prediction of text labeling. Secondly, an efficient saliency measure based on background priors is employed to further suppress non-text pixels and generate fine character candidate regions. The remaining candidates of character regions composite text lines, so that the proposed method can handle multi-orientation texts in natural scene images. Two public datasets, MSRA-TD500 and ICDAR2013 are utilized to evaluate the performance of our proposed method. Experimental results show that our method achieves high recall rate and demonstrates the competitive performance.",

keywords = "Background suppression, Fully Convolutional Network, Multi-orientation texts, Scene text detection",

author = "Xihan Wang and Xiaoyi Feng and Zhaoqiang Xia and Jinye Peng and Eric Granger",

note = "Publisher Copyright: {\textcopyright} 2017, Springer International Publishing AG.; 9th International Conference on Image and Graphics, ICIG 2017 ; Conference date: 13-09-2017 Through 15-09-2017",

year = "2017",

doi = "10.1007/978-3-319-71607-7_49",

language = "英语",

isbn = "9783319716060",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "555--566",

editor = "Yao Zhao and David Taubman and Xiangwei Kong",

booktitle = "Image and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers",

}

Wang, X, Feng, X , Xia, Z, Peng, J & Granger, E 2017, Multi-orientation scene text detection leveraging background suppression. in Y Zhao, D Taubman & X Kong (eds), Image and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10666 LNCS, Springer Verlag, pp. 555-566, 9th International Conference on Image and Graphics, ICIG 2017, Shanghai, China, 13/09/17. https://doi.org/10.1007/978-3-319-71607-7_49

Multi-orientation scene text detection leveraging background suppression. / Wang, Xihan; Feng, Xiaoyi ; Xia, Zhaoqiang et al.
Image and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers. ed. / Yao Zhao; David Taubman; Xiangwei Kong. Springer Verlag, 2017. p. 555-566 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10666 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi-orientation scene text detection leveraging background suppression

AU - Wang, Xihan

AU - Feng, Xiaoyi

AU - Xia, Zhaoqiang

AU - Peng, Jinye

AU - Granger, Eric

PY - 2017

Y1 - 2017

N2 - Most state-of-the-art text detection methods are devoted to horizontal texts and these methods cannot work well when encountering blurred, multi-oriented, low-resolution and small-sized texts. In this paper, we propose to localize texts from the perspective of suppressing more non-text backgrounds, in which a coarse-to-fine strategy is presented to remove non-text pixels from images. Firstly, the fully convolutional network (FCN) framework is utilized to make the coarse prediction of text labeling. Secondly, an efficient saliency measure based on background priors is employed to further suppress non-text pixels and generate fine character candidate regions. The remaining candidates of character regions composite text lines, so that the proposed method can handle multi-orientation texts in natural scene images. Two public datasets, MSRA-TD500 and ICDAR2013 are utilized to evaluate the performance of our proposed method. Experimental results show that our method achieves high recall rate and demonstrates the competitive performance.

AB - Most state-of-the-art text detection methods are devoted to horizontal texts and these methods cannot work well when encountering blurred, multi-oriented, low-resolution and small-sized texts. In this paper, we propose to localize texts from the perspective of suppressing more non-text backgrounds, in which a coarse-to-fine strategy is presented to remove non-text pixels from images. Firstly, the fully convolutional network (FCN) framework is utilized to make the coarse prediction of text labeling. Secondly, an efficient saliency measure based on background priors is employed to further suppress non-text pixels and generate fine character candidate regions. The remaining candidates of character regions composite text lines, so that the proposed method can handle multi-orientation texts in natural scene images. Two public datasets, MSRA-TD500 and ICDAR2013 are utilized to evaluate the performance of our proposed method. Experimental results show that our method achieves high recall rate and demonstrates the competitive performance.

KW - Background suppression

KW - Fully Convolutional Network

KW - Multi-orientation texts

KW - Scene text detection

UR - http://www.scopus.com/inward/record.url?scp=85040249159&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-71607-7_49

DO - 10.1007/978-3-319-71607-7_49

M3 - 会议稿件

AN - SCOPUS:85040249159

SN - 9783319716060

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 555

EP - 566

BT - Image and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers

A2 - Zhao, Yao

A2 - Taubman, David

A2 - Kong, Xiangwei

PB - Springer Verlag

T2 - 9th International Conference on Image and Graphics, ICIG 2017

Y2 - 13 September 2017 through 15 September 2017

ER -

Wang X, Feng X , Xia Z, Peng J, Granger E. Multi-orientation scene text detection leveraging background suppression. In Zhao Y, Taubman D, Kong X, editors, Image and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers. Springer Verlag. 2017. p. 555-566. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-71607-7_49

Multi-orientation scene text detection leveraging background suppression

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this