Real-Time Lexicon-Free Scene Text Localization and Recognition

  • Lukas Neumann*
  • , Jiri Matas
  • *Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

134 Citations (Scopus)

Abstract

An end-to-end real-time text localization and recognition method is presented. Its real-time performance is achieved by posing the character detection and segmentation problem as an efficient sequential selection from the set of Extremal Regions. The ER detector is robust against blur, low contrast and illumination, color and texture variation. In the first stage, the probability of each ER being a character is estimated using features calculated by a novel algorithm in constant time and only ERs with locally maximal probability are selected for the second stage, where the classification accuracy is improved using computationally more expensive features. A highly efficient clustering algorithm then groups ERs into text lines and an OCR classifier trained on synthetic fonts is exploited to label character regions. The most probable character sequence is selected in the last stage when the context of each character is known. The method was evaluated on three public datasets. On the ICDAR 2013 dataset the method achieves state-of-the-art results in text localization; on the more challenging SVT dataset, the proposed method significantly outperforms the state-of-the-art methods and demonstrates that the proposed pipeline can incorporate additional prior knowledge about the detected text. The proposed method was exploited as the baseline in the ICDAR 2015 Robust Reading competition, where it compares favourably to the state-of-the art.

Original languageEnglish
Pages (from-to)1872-1885
Number of pages14
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume38
Issue number9
DOIs
Publication statusPublished - Sept 2016
Externally publishedYes
Publication typeA1 Journal article-refereed

Keywords

  • Text-in-the wild
  • scene text
  • end-to-end text recognition
  • photo OCR
  • ALGORITHMS
  • IMAGES
  • SEGMENTATION

Fingerprint

Dive into the research topics of 'Real-Time Lexicon-Free Scene Text Localization and Recognition'. Together they form a unique fingerprint.

Cite this