Book Title Recognition on 360VR Images for VR Tour of Library Building

研究成果: 書籍/レポート タイプへの寄稿会議への寄与

抄録

This paper treats the optical character recognition (OCR) on 360VR images. The authors have already proposed a development framework for web-based VR tours, i.e., navigation VR tours of 360VR videos and walkthrough VR tours of 360VR images. They also proposed the extended development framework for web-based VR tours of 360VR videos based on OpenVSLAM(Open Visual SLAM: Simultaneous Localization and Mapping). Using OpenVSLAM, it is possible to generate a map consisting of several time-sequential locations and to extract their corresponding 360VR frame images from a navigation 360VR video. Furthermore, the authors introduced a keyword search function and a subtitle function into the proposed OpenVSLAM-Based development framework. The keyword search function is realized by OCR technology, i.e., Google Cloud Vision AI service, applied to extracted 360VR frame images and the subtitle function is realized by speech-to-text technology, i.e., Assembly AI service, applied to a narration sound file extracted from a navigation 360VR video. The authors have already developed web-based VR tours of the library building of the authors' university using the proposed framework. If the locations of books can be obtained by entering their book titles, the availability of the library building VR tour will become higher than ever. So, this paper treats the book title recognition on 360VR images. Although OCR results by Google Cloud Vision AI service are very good, the problem is that the service is not free. On the other hand, there are free OCR software, i.e., EasyOCR, TesseractOCR, and PaddleOCR. The authors performed OCR experiments for recognizing vertical book titles in Japanese kanji characters on 360VR images by the three OCR software and found that TesseractOCR has the best experimental results.

本文言語英語
ホスト出版物のタイトルProceedings - 2024 16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024
出版社Institute of Electrical and Electronics Engineers Inc.
ページ196-201
ページ数6
ISBN(電子版)9798350377903
DOI
出版ステータス出版済み - 2024
イベント16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024 - Takamatsu, 日本
継続期間: 7月 6 20247月 12 2024

出版物シリーズ

名前Proceedings - 2024 16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024

会議

会議16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024
国/地域日本
CityTakamatsu
Period7/6/247/12/24

!!!All Science Journal Classification (ASJC) codes

  • 人工知能
  • コンピュータ ビジョンおよびパターン認識
  • コンピュータ ネットワークおよび通信
  • 情報システム
  • 情報システムおよび情報管理

フィンガープリント

「Book Title Recognition on 360VR Images for VR Tour of Library Building」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル