Abstract
Since the evaluation of audio systems or processing schemes is time-consuming and resource-expensive, alternative objective evaluation methods attracted considerable research interests. However, current perceptual models are not yet capable of replacing a human listener especially when the test stimulus is complex, for example, a sound scene consisting of time-varying multiple acoustic images. This paper describes a data-driven approach to develop a model to predict the subjective evaluation of complex acoustic scenes, where the extensive set of listening test results collected in the latest MPEG-H 3D audio initiative was used as training data. The results showed that a few selected outputs of various auditory models may be a useful set of features, where linear regression and multilayer perceptron models reasonably predicted the overall distribution of listening test scores, estimating both mean and variance.
Original language | English |
---|---|
Title of host publication | 136th Audio Engineering Society Convention 2014 |
Publisher | Audio Engineering Society |
Pages | 123-130 |
Number of pages | 8 |
ISBN (Print) | 9781632665065 |
Publication status | Published - 2014 |
Externally published | Yes |
Event | 136th Audio Engineering Society Convention 2014 - Berlin, Germany Duration: 26 Apr 2014 → 29 Apr 2014 Conference number: 136 |
Conference
Conference | 136th Audio Engineering Society Convention 2014 |
---|---|
Country/Territory | Germany |
City | Berlin |
Period | 26/04/14 → 29/04/14 |