Conversation detection in ambient telephony

  • Aki Härmä*
  • , Kien Pham
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

In some speech communication applications such as distributed hands-free telephony it is important that the system can detect the conversational state of a call. This cannot be performed by speech activity only because the captured signal may also contain conversation between two local people, or additional speech noise sources such as speech sounds from a radio or television. In this paper we compare known algorithms and introduce a new algorithm for the real-time detection of active conversation between an incoming caller and a local user. The method is based on the mutual information in speech activity, detection of back-channel speech activity, and statistics of overlapping speech. The proposed method gives over 90% accuracy within one minute observation period which is a clear improvement over the performance of earlier techniques.

Original languageEnglish
Title of host publication2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009
Pages4641-4644
Number of pages4
DOIs
Publication statusPublished - 2009
Externally publishedYes
Event2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Taipei, Taiwan
Duration: 19 Apr 200924 Apr 2009

Publication series

SeriesICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN1520-6149

Conference

Conference2009 IEEE International Conference on Acoustics, Speech, and Signal Processing
Abbreviated titleICASSP 2009
Country/TerritoryTaiwan
CityTaipei
Period19/04/0924/04/09

Keywords

  • Ambient telephony
  • Conversation detection
  • Speakerphone

Fingerprint

Dive into the research topics of 'Conversation detection in ambient telephony'. Together they form a unique fingerprint.

Cite this