Efficient three dimensional rotation estimation for camera-based OCR

Kanta Kuramoto, Wataru Oyama, Tetsushi Wakabayashi, Fumitaka Kimura

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Camera-Based Optical Character Recognition (CBOCR) has attracted interests of many researchers in both computer vision and document analysis research fields. A significant challenge in CBOCR is how we handle characters of those appearances are affected by three-dimensional (3D) rotation due to locational relationship between a printing plane and camera. Proper handling of these 3D rotated characters is expected to improve the performance of both detection and recognition of camera-captured characters. In this paper, we propose an efficient implementation of 3D rotation estimation for camera-captured characters. The proposed implementation requires small memory load and short computational time. We employ Linear Discriminant Function (LDF) instead of Modified Quadratic Discriminant Function (MQDF) for further memory reduction. The results of experimental evaluation using a large-scale alphanumeric character dataset showed that small number of dimensionality of original feature vector is sufficient for keeping accuracy of 3D rotation estimation and total amount of memory required for 3D rotation estimation is reduced from 141.0 MB to 6.6 MB.

Original languageEnglish
Title of host publicationProceedings of the 14th IAPR International Conference on Machine Vision Applications, MVA 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages459-462
Number of pages4
ISBN (Electronic)9784901122153
DOIs
Publication statusPublished - Jul 8 2015
Event14th IAPR International Conference on Machine Vision Applications, MVA 2015 - Tokyo, Japan
Duration: May 18 2015May 22 2015

Publication series

NameProceedings of the 14th IAPR International Conference on Machine Vision Applications, MVA 2015

Other

Other14th IAPR International Conference on Machine Vision Applications, MVA 2015
Country/TerritoryJapan
CityTokyo
Period5/18/155/22/15

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Efficient three dimensional rotation estimation for camera-based OCR'. Together they form a unique fingerprint.

Cite this