メインナビゲーションにスキップ 検索にスキップ メインコンテンツにスキップ

Fourth-Person Captioning: Describing Daily Events by Uni-supervised and Tri-regularized Training

研究成果: 書籍/レポート タイプへの寄稿会議への寄与

抄録

We aim to develop a supporting system which enhances the ability of human's short-term visual memory in an intelligent space where the human and a service robot coexist. Particularly, this paper focuses on how we can interpret and record diverse and complex life events on behalf of humans, from a multi-perspective viewpoint. We propose a novel method named 'fourth-person captioning', which generates natural language descriptions by summarizing visual contexts complementarily from three types of cameras corresponding the first-, second-, and third-person viewpoint. We first extend the latest image captioning technique and design a new model to generate a sequence of words given the multiple images. Then we provide an effective training strategy that needs only annotations supervising images from a single viewpoint in a general caption dataset and unsupervised triplet instances in the intelligent space. As the three types of cameras, we select a wearable camera on the human, a robot-mounted camera, and an embedded camera, which can be defined as the first-, second-, and third-person viewpoint, respectively. We hope our work will accelerate a cross-modal interaction bridging the human's egocentric cognition and multi-perspective intelligence.

本文言語英語
ホスト出版物のタイトルProceedings - 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018
出版社Institute of Electrical and Electronics Engineers Inc.
ページ2122-2127
ページ数6
ISBN(電子版)9781538666500
DOI
出版ステータス出版済み - 1月 16 2019
イベント2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 - Miyazaki, 日本
継続期間: 10月 7 201810月 10 2018

出版物シリーズ

名前Proceedings - 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018

会議

会議2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018
国/地域日本
CityMiyazaki
Period10/7/1810/10/18

UN SDG

この成果は、次の持続可能な開発目標に貢献しています

  1. SDG 3 - すべての人に健康と福祉を
    SDG 3 すべての人に健康と福祉を

!!!All Science Journal Classification (ASJC) codes

  • 情報システム
  • 情報システムおよび情報管理
  • 健康情報学
  • 人工知能
  • コンピュータ ネットワークおよび通信
  • 人間とコンピュータの相互作用

フィンガープリント

「Fourth-Person Captioning: Describing Daily Events by Uni-supervised and Tri-regularized Training」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル