Hierarchical Classification and Transfer Learning to Recognize Head Gestures and Facial Expressions Using Earbuds

Shkurta Gashi, Aaqib Saeed, Alessandra Vicini, Elena Di Lascio, Silvia Santini

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

15 Citations (Scopus)

Abstract

Head gestures and facial expressions - like, e.g., nodding or smiling - are important indicators of the quality of human interactions in physical meetings as well as in computer-mediated settings. Computer systems able to recognize such behavioral cues can support and improve human interactions. Several researchers have thus tackled the problem of automatically recognizing head gestures and facial expressions, mainly leveraging video data. In this paper, we instead consider inertial signals collected from unobtrusive, ear-mounted devices. We focus on typical activities performed during social interactions - head shaking, nodding, smiling, talking and yawning - and propose a hierarchical classification approach to discriminate them from each other. Further, we investigate whether the transfer of knowledge learned from publicly available datasets leads to further performance improvements. Our results show that the combined use of our hierarchical approach and transfer learning allows the classifier to discriminate head and mouth activities with an F1 score of 84.79, smile, talk and yawn with an F1 score of 45.42, and nodding and head shaking with an F1 score of 88.24, outperforming shallow classifiers by 2-9 percentage points.

Original languageEnglish
Title of host publicationICMI 2021 - Proceedings of the 2021 International Conference on Multimodal Interaction
PublisherAssociation for Computing Machinery, Inc
Pages168-176
Number of pages9
ISBN (Electronic)9781450384810
DOIs
Publication statusPublished - 18 Oct 2021
Event23rd ACM International Conference on Multimodal Interaction, ICMI 2021 - Virtual, Online, Canada
Duration: 18 Oct 202122 Oct 2021

Conference

Conference23rd ACM International Conference on Multimodal Interaction, ICMI 2021
Country/TerritoryCanada
CityVirtual, Online
Period18/10/2122/10/21

Bibliographical note

Publisher Copyright:
© 2021 ACM.

Keywords

  • Earable Computing
  • Facial Expressions Recognition
  • Head Gestures Detection
  • Hierarchical Classification
  • Transfer learning

Fingerprint

Dive into the research topics of 'Hierarchical Classification and Transfer Learning to Recognize Head Gestures and Facial Expressions Using Earbuds'. Together they form a unique fingerprint.

Cite this