跳到主要导航 跳到搜索 跳到主要内容

Pushing the Performance Limit of Scene Text Recognizer without Human Annotation

  • Caiyuan Zheng
  • , Hui Li
  • , Seon Min Rhee
  • , Seungju Han
  • , Jae Joon Han
  • , Peng Wang
  • Northwestern Polytechnical University Xian
  • National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology
  • Samsung R&d Institute China Xi'an (SRCX)
  • Samsung

科研成果: 书/报告/会议事项章节会议稿件同行评审

19 引用 (Scopus)

摘要

Scene text recognition (STR) attracts much attention over the years because of its wide application. Most methods train STR model in a fully supervised manner which requires large amounts of labeled data. Although synthetic data contributes a lot to STR, it suffers from the real-to-synthetic domain gap the restricts model performance. In this work, we aim to boost STR models by leveraging both synthetic data and the numerous real unlabeled images, exempting human annotation cost thoroughly. A robust con-sistency regularization based semi-supervised framework is proposed for STR, which can effectively solve the instability issue due to domain inconsistency between synthetic and real images. A character-level consistency regularization is designed to mitigate the misalignment between characters in sequence recognition. Extensive experiments on standard text recognition benchmarks demonstrate the effectiveness of the proposed method. It can steadily improve existing STR models, and boost an STR model to achieve new state-of-the-art results. To our best knowledge, this is the first consistency regularization based framework that applies successfully to STR.

源语言英语
主期刊名Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
出版商IEEE Computer Society
14096-14105
页数10
ISBN(电子版)9781665469463
DOI
出版状态已出版 - 2022
已对外发布
活动2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 - New Orleans, 美国
期限: 19 6月 202224 6月 2022

出版系列

姓名Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
2022-June
ISSN(印刷版)1063-6919

会议

会议2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
国家/地区美国
New Orleans
时期19/06/2224/06/22

指纹

探究 'Pushing the Performance Limit of Scene Text Recognizer without Human Annotation' 的科研主题。它们共同构成独一无二的指纹。

引用此