Extraction of voice from the center of the stereo image

Aki Härmä*, Munhum Park

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

Detection and extraction of the center vocal source is important for many audio format conversion and manipulation applications. First, we study some generic properties of stereo signals containing sources panned exactly to the center of the stereo image and propose an algorithm for the separation of a stereo audio signal into a center and side channels. Recently, Park et al. [Proc. 129th AES convention, London 2010, Preprint Paper 8071] presented the results of listening tests where the perceived widths of the stereo images were evaluated for synthetic signals. Given the center separation algorithm proposed in this paper, a similar experiment was carried out with realistic stereo audio contents. The results show that there are clear differences between the stimuli used in the two experiments, which are discussed in this paper based on the analysis of the test signals and their binaural characteristics in the listening test configuration.

Original languageEnglish
Title of host publication130th Audio Engineering Society Convention 2011
Pages1130-1139
Number of pages10
Publication statusPublished - 2011
Externally publishedYes
Event130th Audio Engineering Society Convention 2011 - London, United Kingdom
Duration: 13 May 201116 May 2011
Conference number: 130

Conference

Conference130th Audio Engineering Society Convention 2011
Country/TerritoryUnited Kingdom
CityLondon
Period13/05/1116/05/11

Fingerprint

Dive into the research topics of 'Extraction of voice from the center of the stereo image'. Together they form a unique fingerprint.

Cite this