Samenvatting
Head gestures and facial expressions - like, e.g., nodding or smiling - are important indicators of the quality of human interactions in physical meetings as well as in computer-mediated settings. Computer systems able to recognize such behavioral cues can support and improve human interactions. Several researchers have thus tackled the problem of automatically recognizing head gestures and facial expressions, mainly leveraging video data. In this paper, we instead consider inertial signals collected from unobtrusive, ear-mounted devices. We focus on typical activities performed during social interactions - head shaking, nodding, smiling, talking and yawning - and propose a hierarchical classification approach to discriminate them from each other. Further, we investigate whether the transfer of knowledge learned from publicly available datasets leads to further performance improvements. Our results show that the combined use of our hierarchical approach and transfer learning allows the classifier to discriminate head and mouth activities with an F1 score of 84.79, smile, talk and yawn with an F1 score of 45.42, and nodding and head shaking with an F1 score of 88.24, outperforming shallow classifiers by 2-9 percentage points.
Originele taal-2 | Engels |
---|---|
Titel | ICMI 2021 - Proceedings of the 2021 International Conference on Multimodal Interaction |
Uitgeverij | Association for Computing Machinery, Inc |
Pagina's | 168-176 |
Aantal pagina's | 9 |
ISBN van elektronische versie | 9781450384810 |
DOI's | |
Status | Gepubliceerd - 18 okt. 2021 |
Evenement | 23rd ACM International Conference on Multimodal Interaction, ICMI 2021 - Virtual, Online, Canada Duur: 18 okt. 2021 → 22 okt. 2021 |
Congres
Congres | 23rd ACM International Conference on Multimodal Interaction, ICMI 2021 |
---|---|
Land/Regio | Canada |
Stad | Virtual, Online |
Periode | 18/10/21 → 22/10/21 |
Bibliografische nota
Publisher Copyright:© 2021 ACM.