On the relationship between speech-based breathing signal prediction evaluation measures and breathing parameters estimation

Zohreh Mostaani*, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik, Mathew Magimai-Doss

*Corresponding author for this work

Research output: Contribution to journalConference article in journalAcademicpeer-review

Abstract

The respiratory system is one of the major components of the speech production system. Any alteration in breathing can result in changes in speech. Specific breathing characteristics, such as breathing rate and tidal volume, can indicate a person's pathological condition. More recently, neural network-based methods have started emerging for predicting the breathing signal from the speech signal. The neural networks are trained and evaluated with different objective measures, such as mean squared error (MSE) and Pearson's correlation. This paper investigates whether there is a systematic relationship between the different objective measures used for training and evaluating the neural network models and the end-goal, i.e. estimation of breathing parameters such as, breathing rate and tidal volume. Our investigations on two different data sets with two different neural network-based approaches show that there is no clear systematic relationship. In other words, obtaining a high Pearson's correlation on the evaluation set does not necessarily mean better breathing parameter estimation. Thus, indicating the need for developing other objective evaluation measures.

Original languageEnglish
Pages (from-to)1345-1349
Number of pages5
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2021-June
DOIs
Publication statusPublished - 2021
Externally publishedYes
Event2021 IEEE International Conference on Acoustics, Speech, and Signal Processing - Online, Toronto, Canada
Duration: 6 Jun 202111 Jun 2021

Keywords

  • Neural networks
  • Respiratory parameters
  • Speech breathing

Fingerprint

Dive into the research topics of 'On the relationship between speech-based breathing signal prediction evaluation measures and breathing parameters estimation'. Together they form a unique fingerprint.

Cite this