Text detection and recognition in real world images

Raid Saabni, Moti Zwilling

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Detecting and recognizing texts in real world images such as sign boards and advertisements is an important part of computer vision applications. The complexity of the problem comes out of many factors such as nonuniform background, different languages and fonts, and non consistent text alignment and orientation. In this paper, we present a novel approach to detect characters and words in real-world images. The presented approach decompose the gray level image into sequence of images, each one includes pixels with gray level values from different disjoint ranges. This decomposition enables extracting connected components representing characters or other non textual objects separated from their neighborhood background. An interpolation of two classes of features translated to histograms is used by a support vector machine to classify and collect the textual objects generating the textual zones. The Shape Context Descriptor [1], is used by the Earth Movers Distance(EMD) method to recognize the characters within the image. The recognized characters are fed to heuristic rule based system to determine words and give final results. To optimize the speed of the system, we follow the embedding of the EMD metric presented in [22] to a normed space to enable fast approximation of the κ-Nearest Neighbors using Local Sensitivity Hashing functions(LSH). Experiments show that our algorithm can detect and recognize text regions from the ICDAR 2005 datasets [17] with high rates.

Original languageEnglish
Title of host publicationProceedings - 13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012
Pages443-448
Number of pages6
DOIs
StatePublished - 2012
Externally publishedYes
Event13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012 - Bari, Italy
Duration: 18 Sep 201220 Sep 2012

Publication series

NameProceedings - International Workshop on Frontiers in Handwriting Recognition, IWFHR

Conference

Conference13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012
Country/TerritoryItaly
CityBari
Period18/09/1220/09/12

Keywords

  • Earth movers distance
  • Embedding
  • Local sensitivity hashing
  • Text detection
  • Word searching
  • κ-nearest neighbor

All Science Journal Classification (ASJC) codes

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Text detection and recognition in real world images'. Together they form a unique fingerprint.

Cite this