Data-driven modeling of the spatial sound experience

Aki Härmä*, Munhum Park, Armin Kohlrausch

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

Since the evaluation of audio systems or processing schemes is time-consuming and resource-expensive, alternative objective evaluation methods attracted considerable research interests. However, current perceptual models are not yet capable of replacing a human listener especially when the test stimulus is complex, for example, a sound scene consisting of time-varying multiple acoustic images. This paper describes a data-driven approach to develop a model to predict the subjective evaluation of complex acoustic scenes, where the extensive set of listening test results collected in the latest MPEG-H 3D audio initiative was used as training data. The results showed that a few selected outputs of various auditory models may be a useful set of features, where linear regression and multilayer perceptron models reasonably predicted the overall distribution of listening test scores, estimating both mean and variance.

Original languageEnglish
Title of host publication136th Audio Engineering Society Convention 2014
PublisherAudio Engineering Society
Pages123-130
Number of pages8
ISBN (Print)9781632665065
Publication statusPublished - 2014
Externally publishedYes
Event136th Audio Engineering Society Convention 2014 - Berlin, Germany
Duration: 26 Apr 201429 Apr 2014
Conference number: 136

Conference

Conference136th Audio Engineering Society Convention 2014
Country/TerritoryGermany
CityBerlin
Period26/04/1429/04/14

Fingerprint

Dive into the research topics of 'Data-driven modeling of the spatial sound experience'. Together they form a unique fingerprint.

Cite this