Multi-orientation scene text detection leveraging background suppression

Xihan Wang, Xiaoyi Feng, Zhaoqiang Xia, Jinye Peng, Eric Granger

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Most state-of-the-art text detection methods are devoted to horizontal texts and these methods cannot work well when encountering blurred, multi-oriented, low-resolution and small-sized texts. In this paper, we propose to localize texts from the perspective of suppressing more non-text backgrounds, in which a coarse-to-fine strategy is presented to remove non-text pixels from images. Firstly, the fully convolutional network (FCN) framework is utilized to make the coarse prediction of text labeling. Secondly, an efficient saliency measure based on background priors is employed to further suppress non-text pixels and generate fine character candidate regions. The remaining candidates of character regions composite text lines, so that the proposed method can handle multi-orientation texts in natural scene images. Two public datasets, MSRA-TD500 and ICDAR2013 are utilized to evaluate the performance of our proposed method. Experimental results show that our method achieves high recall rate and demonstrates the competitive performance.

Original languageEnglish
Title of host publicationImage and Graphics - 9th International Conference, ICIG 2017, Revised Selected Papers
EditorsYao Zhao, David Taubman, Xiangwei Kong
PublisherSpringer Verlag
Pages555-566
Number of pages12
ISBN (Print)9783319716060
DOIs
StatePublished - 2017
Event9th International Conference on Image and Graphics, ICIG 2017 - Shanghai, China
Duration: 13 Sep 201715 Sep 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10666 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Image and Graphics, ICIG 2017
Country/TerritoryChina
CityShanghai
Period13/09/1715/09/17

Keywords

  • Background suppression
  • Fully Convolutional Network
  • Multi-orientation texts
  • Scene text detection

Fingerprint

Dive into the research topics of 'Multi-orientation scene text detection leveraging background suppression'. Together they form a unique fingerprint.

Cite this