Temporal based Emotion Recognition inspired by Activity Recognition models

Balaganesh Mohan; Mirela Popa

doi:10.1109/aciiw52867.2021.9666356

Temporal based Emotion Recognition inspired by Activity Recognition models

Balaganesh Mohan^*, Mirela Popa

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

Affective computing is a subset of the larger field of human-computer interaction, having important connections with cognitive processes, influencing the learning process, decision-making and perception. Out of the multiple means of communication, facial expressions are one of the most widely accepted channels for emotion modulation, receiving an increased attention during the last few years. An important aspect, contributing to their recognition success, concerns modeling the temporal dimension. Therefore, this paper aims to investigate the applicability of current state-of-the-art action recognition techniques to the human emotion recognition task. In particular, two different architectures were investigated, a CNN-based model, named Temporal Shift Module (TSM) that can learn spatiotemporal features in 3D data with the computational complexity of a 2D CNN and a video based vision transformer, employing spatio-temporal self attention. The models were trained and tested on the CREMA-D dataset, demonstrating state-of-the-art performance, with a mean class accuracy of 82% and 77% respectively, while outperforming best previous approaches by at least 3.5%

Original language	English
Title of host publication	2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)
Publisher	IEEE
Pages	01-08
Number of pages	8
DOIs	https://doi.org/10.1109/aciiw52867.2021.9666356
Publication status	Published - 1 Sept 2021
Event	2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos - Nara, Japan Duration: 28 Sept 2021 → 1 Oct 2021 Conference number: 29

Conference

Conference	2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos
Abbreviated title	ACIIW 2021
Country/Territory	Japan
City	Nara
Period	28/09/21 → 1/10/21

Keywords

Temporal shift module(TSM)
Vision transformers
Emotion recognition
Action recognition

Access to Document

10.1109/aciiw52867.2021.9666356

Cite this

@inproceedings{6db828458b484e468d77984a4603c754,

title = "Temporal based Emotion Recognition inspired by Activity Recognition models",

abstract = "Affective computing is a subset of the larger field of human-computer interaction, having important connections with cognitive processes, influencing the learning process, decision-making and perception. Out of the multiple means of communication, facial expressions are one of the most widely accepted channels for emotion modulation, receiving an increased attention during the last few years. An important aspect, contributing to their recognition success, concerns modeling the temporal dimension. Therefore, this paper aims to investigate the applicability of current state-of-the-art action recognition techniques to the human emotion recognition task. In particular, two different architectures were investigated, a CNN-based model, named Temporal Shift Module (TSM) that can learn spatiotemporal features in 3D data with the computational complexity of a 2D CNN and a video based vision transformer, employing spatio-temporal self attention. The models were trained and tested on the CREMA-D dataset, demonstrating state-of-the-art performance, with a mean class accuracy of 82% and 77% respectively, while outperforming best previous approaches by at least 3.5%",

keywords = "Temporal shift module(TSM), Vision transformers, Emotion recognition, Action recognition",

author = "Balaganesh Mohan and Mirela Popa",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, ACIIW 2021 ; Conference date: 28-09-2021 Through 01-10-2021",

year = "2021",

month = sep,

day = "1",

doi = "10.1109/aciiw52867.2021.9666356",

language = "English",

pages = "01--08",

booktitle = "2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)",

publisher = "IEEE",

address = "United States",

}

Mohan, B & Popa, M 2021, Temporal based Emotion Recognition inspired by Activity Recognition models. in 2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW). IEEE, pp. 01-08, 2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, Nara, Japan, 28/09/21. https://doi.org/10.1109/aciiw52867.2021.9666356

Temporal based Emotion Recognition inspired by Activity Recognition models. / Mohan, Balaganesh; Popa, Mirela.
2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW). IEEE, 2021. p. 01-08.

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - Temporal based Emotion Recognition inspired by Activity Recognition models

AU - Mohan, Balaganesh

AU - Popa, Mirela

N1 - Conference code: 29

PY - 2021/9/1

Y1 - 2021/9/1

N2 - Affective computing is a subset of the larger field of human-computer interaction, having important connections with cognitive processes, influencing the learning process, decision-making and perception. Out of the multiple means of communication, facial expressions are one of the most widely accepted channels for emotion modulation, receiving an increased attention during the last few years. An important aspect, contributing to their recognition success, concerns modeling the temporal dimension. Therefore, this paper aims to investigate the applicability of current state-of-the-art action recognition techniques to the human emotion recognition task. In particular, two different architectures were investigated, a CNN-based model, named Temporal Shift Module (TSM) that can learn spatiotemporal features in 3D data with the computational complexity of a 2D CNN and a video based vision transformer, employing spatio-temporal self attention. The models were trained and tested on the CREMA-D dataset, demonstrating state-of-the-art performance, with a mean class accuracy of 82% and 77% respectively, while outperforming best previous approaches by at least 3.5%

AB - Affective computing is a subset of the larger field of human-computer interaction, having important connections with cognitive processes, influencing the learning process, decision-making and perception. Out of the multiple means of communication, facial expressions are one of the most widely accepted channels for emotion modulation, receiving an increased attention during the last few years. An important aspect, contributing to their recognition success, concerns modeling the temporal dimension. Therefore, this paper aims to investigate the applicability of current state-of-the-art action recognition techniques to the human emotion recognition task. In particular, two different architectures were investigated, a CNN-based model, named Temporal Shift Module (TSM) that can learn spatiotemporal features in 3D data with the computational complexity of a 2D CNN and a video based vision transformer, employing spatio-temporal self attention. The models were trained and tested on the CREMA-D dataset, demonstrating state-of-the-art performance, with a mean class accuracy of 82% and 77% respectively, while outperforming best previous approaches by at least 3.5%

KW - Temporal shift module(TSM)

KW - Vision transformers

KW - Emotion recognition

KW - Action recognition

U2 - 10.1109/aciiw52867.2021.9666356

DO - 10.1109/aciiw52867.2021.9666356

M3 - Conference article in proceeding

SP - 1

EP - 8

BT - 2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)

PB - IEEE

T2 - 2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos

Y2 - 28 September 2021 through 1 October 2021

ER -