Book Title Recognition on 360VR Images for VR Tour of Library Building

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper treats the optical character recognition (OCR) on 360VR images. The authors have already proposed a development framework for web-based VR tours, i.e., navigation VR tours of 360VR videos and walkthrough VR tours of 360VR images. They also proposed the extended development framework for web-based VR tours of 360VR videos based on OpenVSLAM(Open Visual SLAM: Simultaneous Localization and Mapping). Using OpenVSLAM, it is possible to generate a map consisting of several time-sequential locations and to extract their corresponding 360VR frame images from a navigation 360VR video. Furthermore, the authors introduced a keyword search function and a subtitle function into the proposed OpenVSLAM-Based development framework. The keyword search function is realized by OCR technology, i.e., Google Cloud Vision AI service, applied to extracted 360VR frame images and the subtitle function is realized by speech-to-text technology, i.e., Assembly AI service, applied to a narration sound file extracted from a navigation 360VR video. The authors have already developed web-based VR tours of the library building of the authors' university using the proposed framework. If the locations of books can be obtained by entering their book titles, the availability of the library building VR tour will become higher than ever. So, this paper treats the book title recognition on 360VR images. Although OCR results by Google Cloud Vision AI service are very good, the problem is that the service is not free. On the other hand, there are free OCR software, i.e., EasyOCR, TesseractOCR, and PaddleOCR. The authors performed OCR experiments for recognizing vertical book titles in Japanese kanji characters on 360VR images by the three OCR software and found that TesseractOCR has the best experimental results.

Original languageEnglish
Title of host publicationProceedings - 2024 16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages196-201
Number of pages6
ISBN (Electronic)9798350377903
DOIs
Publication statusPublished - 2024
Event16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024 - Takamatsu, Japan
Duration: Jul 6 2024Jul 12 2024

Publication series

NameProceedings - 2024 16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024

Conference

Conference16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024
Country/TerritoryJapan
CityTakamatsu
Period7/6/247/12/24

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications
  • Information Systems
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Book Title Recognition on 360VR Images for VR Tour of Library Building'. Together they form a unique fingerprint.

Cite this