抄録
We aim to develop a supporting system which enhances the ability of human's short-term visual memory in an intelligent space where the human and a service robot coexist. Particularly, this paper focuses on how we can interpret and record diverse and complex life events on behalf of humans, from a multi-perspective viewpoint. We propose a novel method named 'fourth-person captioning', which generates natural language descriptions by summarizing visual contexts complementarily from three types of cameras corresponding the first-, second-, and third-person viewpoint. We first extend the latest image captioning technique and design a new model to generate a sequence of words given the multiple images. Then we provide an effective training strategy that needs only annotations supervising images from a single viewpoint in a general caption dataset and unsupervised triplet instances in the intelligent space. As the three types of cameras, we select a wearable camera on the human, a robot-mounted camera, and an embedded camera, which can be defined as the first-, second-, and third-person viewpoint, respectively. We hope our work will accelerate a cross-modal interaction bridging the human's egocentric cognition and multi-perspective intelligence.
| 本文言語 | 英語 |
|---|---|
| ホスト出版物のタイトル | Proceedings - 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 |
| 出版社 | Institute of Electrical and Electronics Engineers Inc. |
| ページ | 2122-2127 |
| ページ数 | 6 |
| ISBN(電子版) | 9781538666500 |
| DOI | |
| 出版ステータス | 出版済み - 1月 16 2019 |
| イベント | 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 - Miyazaki, 日本 継続期間: 10月 7 2018 → 10月 10 2018 |
出版物シリーズ
| 名前 | Proceedings - 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 |
|---|
会議
| 会議 | 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 |
|---|---|
| 国/地域 | 日本 |
| City | Miyazaki |
| Period | 10/7/18 → 10/10/18 |
UN SDG
この成果は、次の持続可能な開発目標に貢献しています
-
SDG 3 すべての人に健康と福祉を
!!!All Science Journal Classification (ASJC) codes
- 情報システム
- 情報システムおよび情報管理
- 健康情報学
- 人工知能
- コンピュータ ネットワークおよび通信
- 人間とコンピュータの相互作用
フィンガープリント
「Fourth-Person Captioning: Describing Daily Events by Uni-supervised and Tri-regularized Training」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。引用スタイル
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS