Deep sensing of breathing signal during conversational speech

Venkata Srikanth Nallanthighal*, Aki Härmä, Helmer Strik

*Corresponding author for this work

Research output: Contribution to journalConference article in journalAcademicpeer-review

Abstract

In this paper, we show the first results on the estimation of breathing signal from conversational speech using deep learning algorithms. Respiratory diseases such as COPD, asthma, and respiratory infections are common in the elderly population and patients in health care monitoring and medical alert services in general. In this work, we compare algorithms for the estimation of a known respiratory target signal, measured by respiratory belt transducers positioned across the rib cage and abdomen, from conversational speech. We demonstrate the estimation of the respiratory signal from speech using convolutional and recurrent neural networks. The estimated breathing pattern gives respiratory rate, breathing capacity and thus might provide indications of the pathological condition of the speaker. Evaluation of our model on our database of breathing signal and speech yielded a sensitivity of 91.2 % for breath event detection and a mean absolute error of 1.01 breaths per minute for breathing rate estimation.

Original languageEnglish
Pages (from-to)4110-4114
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2019-September
DOIs
Publication statusPublished - 2019
Externally publishedYes
Event20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language - Graz, Austria
Duration: 15 Sept 201919 Sept 2019
Conference number: 20

Keywords

  • Breathing detection
  • Deep neural networks
  • Pathological speech
  • Respiratory diseases
  • Speech technology

Fingerprint

Dive into the research topics of 'Deep sensing of breathing signal during conversational speech'. Together they form a unique fingerprint.

Cite this