TY - GEN
T1 - An End-To-End Hybrid Overhead Facial Orientation Estimation System for Offline Group Discussions
AU - Chen, Chenhao
AU - Nakamura, Yugo
AU - Arakawa, Yutaka
N1 - Publisher Copyright:
© 2023 IPSJ.
PY - 2023
Y1 - 2023
N2 - Aiming to evaluate the concentration and engagement level in offline group discussion scenarios, we seek to capture the interaction information among meeting participants. In this paper, we focus on the facial orientation and present an end-To-end hybrid system capable of 2D overhead facial orientation estimation. Compared to common perspectives from which personal identity information is exposed, we explore approaches to perform overhead facial orientation estimation due to the privacy problem. The proposed system is a neural network-based architecture and composed of three modules. First we leverage and train from scratch the YOLOv8 model on the HollywoodHeads dataset, which allows full-perspective head detection in visual scenes. With the obtained head location, we then generate the corresponding head masks from the visual information within the bounding box. Finally, we fit them to ellipse so that 2D overhead facial orientation of all the meeting participants in the visual scenes can be estimated with the ellipse properties. Experiments demonstrate that our system achieves an MAE of 12.47 on the test dataset.
AB - Aiming to evaluate the concentration and engagement level in offline group discussion scenarios, we seek to capture the interaction information among meeting participants. In this paper, we focus on the facial orientation and present an end-To-end hybrid system capable of 2D overhead facial orientation estimation. Compared to common perspectives from which personal identity information is exposed, we explore approaches to perform overhead facial orientation estimation due to the privacy problem. The proposed system is a neural network-based architecture and composed of three modules. First we leverage and train from scratch the YOLOv8 model on the HollywoodHeads dataset, which allows full-perspective head detection in visual scenes. With the obtained head location, we then generate the corresponding head masks from the visual information within the bounding box. Finally, we fit them to ellipse so that 2D overhead facial orientation of all the meeting participants in the visual scenes can be estimated with the ellipse properties. Experiments demonstrate that our system achieves an MAE of 12.47 on the test dataset.
UR - http://www.scopus.com/inward/record.url?scp=85185559237&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85185559237&partnerID=8YFLogxK
U2 - 10.23919/ICMU58504.2023.10412258
DO - 10.23919/ICMU58504.2023.10412258
M3 - Conference contribution
AN - SCOPUS:85185559237
T3 - 2023 14th International Conference on Mobile Computing and Ubiquitous Network, ICMU 2023
BT - 2023 14th International Conference on Mobile Computing and Ubiquitous Network, ICMU 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 14th International Conference on Mobile Computing and Ubiquitous Network, ICMU 2023
Y2 - 29 November 2023 through 1 December 2023
ER -