Abstract
Emotion recognition is an increasingly important sub-field in artificial intelligence (AI). Advances in this field could drastically change the way people interact with computers and allow for automation of tasks that currently require a lot of manual work. For example, registering the emotion a subject expresses for a potential advert. Previous work has shown that using multiple modalities, although challenging, is very beneficial. Affective cues in audio and video may not occur simultaneously, and the modalities do not always contribute equally to emotion. This work seeks to apply attention mechanisms to aid in the fusion of audio and video, for the purpose of emotion recognition using state-of-the-art techniques from artificial intelligence and, more specifically, deep neural networks. To achieve this, two forms of attention are used. Embedding attention applies attention on the input of a modality-specific model, allowing recurrent networks to consider multiple input time steps. Bimodal attention fusion applies attention to fuse the output of modality-specific networks. Combining both these attention mechanisms yielded CCCs of 0.62 and 0.72 for arousal and valence respectively on the RECOLA dataset used in AVEC 2016. These results are competitive with the state-of-the-art, underlying the potential of attention mechanisms in multimodal fusion for behavioral signals.
Original language | English |
---|---|
Title of host publication | The annual Benelux Conference on Artificial Intelligence and Machine Learning (BNAIC 2020) |
Subtitle of host publication | BNAIC/BeneLearn 2020 |
Pages | 130-139 |
Number of pages | 10 |
Publication status | Published - 2020 |
Event | Benelux Conference on Artificial Intelligence and Machine Learning - Online, Leiden University, Leiden, Netherlands Duration: 19 Nov 2020 → 20 Nov 2020 https://bnaic.liacs.leidenuniv.nl/ |
Conference
Conference | Benelux Conference on Artificial Intelligence and Machine Learning |
---|---|
Abbreviated title | BNAIC/BeneLearn 2020 |
Country/Territory | Netherlands |
City | Leiden |
Period | 19/11/20 → 20/11/20 |
Internet address |