Speech Breathing Estimation Using Deep Learning Methods

Venkata Srikanth Nallanthighal*, Aki Harma, Helmer Strik

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

Breathing is the primary mechanism for maintaining the subglottal pressure for speech production. Speech can be seen as a systematic outflow of air during exhalation characterized by linguistic content and prosodic factors. Thus, sensing respiratory dynamics from the speech is plausible. In this paper, we explore techniques for sensing breathing from speech using deep learning architectures including multi-task learning approaches. Estimating the breathing pattern from the speech would give us information about the respiration rate, breathing capacity and thus enable us to understand the pathological condition of a person using one's speech. Training and evaluation of our model on our database of breathing signal and speech for 40 subjects yielded a sensitivity of 0.88 for breath event detection and 5.6 % error for breathing rate estimation.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings
PublisherIEEE
Pages1140-1144
Number of pages5
ISBN (Electronic)9781509066315
DOIs
Publication statusPublished - May 2020
Externally publishedYes
Event45th International Conference on Acoustics, Speech, and Signal Processing - Online, Barcelona, Spain
Duration: 4 May 20208 May 2020
Conference number: 45
https://2020.ieeeicassp.org/

Publication series

SeriesICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2020-May
ISSN1520-6149

Conference

Conference45th International Conference on Acoustics, Speech, and Signal Processing
Abbreviated titleICASSP 2020
Country/TerritorySpain
CityBarcelona
Period4/05/208/05/20
Internet address

Keywords

  • deep neural networks
  • Multi task learning
  • signal processing
  • Speech breathing
  • speech technology

Fingerprint

Dive into the research topics of 'Speech Breathing Estimation Using Deep Learning Methods'. Together they form a unique fingerprint.

Cite this