Task-dependent decoding of speaker and vowel identity from auditory cortical response patterns

Research output: Contribution to journalArticleAcademicpeer-review

23 Citations (Scopus)

Abstract

Selective attention to relevant sound properties is essential for everyday listening situations. It enables the formation of different perceptual representations of the same acoustic input and is at the basis of flexible and goal-dependent behavior. Here, we investigated the role of the human auditory cortex in forming behavior-dependent representations of sounds. We used single-trial fMRI and analyzed cortical responses collected while subjects listened to the same speech sounds (vowels /a/, /i/, and /u/) spoken by different speakers (boy, girl, male) and performed a delayed-match-to-sample task on either speech sound or speaker identity. Univariate analyses showed a task-specific activation increase in the right superior temporal gyrus/sulcus (STG/STS) during speaker categorization and in the right posterior temporal cortex during vowel categorization. Beyond regional differences in activation levels, multivariate classification of single trial responses demonstrated that the success with which single speakers and vowels can be decoded from auditory cortical activation patterns depends on task demands and subject's behavioral performance. Speaker/vowel classification relied on distinct but overlapping regions across the (right) mid-anterior STG/STS (speakers) and bilateral mid-posterior STG/STS (vowels), as well as the superior temporal plane including Heschl's gyrus/sulcus. The task dependency of speaker/vowel classification demonstrates that the informative fMRI response patterns reflect the top-down enhancement of behaviorally relevant sound representations. Furthermore, our findings suggest that successful selection, processing, and retention of task-relevant sound properties relies on the joint encoding of information across early and higher-order regions of the auditory cortex.
Original languageEnglish
Pages (from-to)4548-4557
Number of pages10
JournalJournal of Neuroscience
Volume34
Issue number13
DOIs
Publication statusPublished - 26 Mar 2014

Keywords

  • auditory cortex
  • fMRI decoding
  • speech
  • voice
  • vowels
  • TEMPORAL-LOBE
  • LANGUAGE COMPREHENSION
  • RECEPTIVE-FIELDS
  • CHILDRENS VOICES
  • HUMAN BRAIN
  • SPEECH
  • CORTEX
  • PERCEPTION
  • ACTIVATION
  • SOUNDS

Cite this

@article{68fb5c702e3e468cb1a7d92667245f17,
title = "Task-dependent decoding of speaker and vowel identity from auditory cortical response patterns",
abstract = "Selective attention to relevant sound properties is essential for everyday listening situations. It enables the formation of different perceptual representations of the same acoustic input and is at the basis of flexible and goal-dependent behavior. Here, we investigated the role of the human auditory cortex in forming behavior-dependent representations of sounds. We used single-trial fMRI and analyzed cortical responses collected while subjects listened to the same speech sounds (vowels /a/, /i/, and /u/) spoken by different speakers (boy, girl, male) and performed a delayed-match-to-sample task on either speech sound or speaker identity. Univariate analyses showed a task-specific activation increase in the right superior temporal gyrus/sulcus (STG/STS) during speaker categorization and in the right posterior temporal cortex during vowel categorization. Beyond regional differences in activation levels, multivariate classification of single trial responses demonstrated that the success with which single speakers and vowels can be decoded from auditory cortical activation patterns depends on task demands and subject's behavioral performance. Speaker/vowel classification relied on distinct but overlapping regions across the (right) mid-anterior STG/STS (speakers) and bilateral mid-posterior STG/STS (vowels), as well as the superior temporal plane including Heschl's gyrus/sulcus. The task dependency of speaker/vowel classification demonstrates that the informative fMRI response patterns reflect the top-down enhancement of behaviorally relevant sound representations. Furthermore, our findings suggest that successful selection, processing, and retention of task-relevant sound properties relies on the joint encoding of information across early and higher-order regions of the auditory cortex.",
keywords = "auditory cortex, fMRI decoding, speech, voice, vowels, TEMPORAL-LOBE, LANGUAGE COMPREHENSION, RECEPTIVE-FIELDS, CHILDRENS VOICES, HUMAN BRAIN, SPEECH, CORTEX, PERCEPTION, ACTIVATION, SOUNDS",
author = "M. Bonte and L. Hausfeld and W. Scharke and G. Valente and E. Formisano",
year = "2014",
month = "3",
day = "26",
doi = "10.1523/JNEUROSCI.4339-13.2014",
language = "English",
volume = "34",
pages = "4548--4557",
journal = "Journal of Neuroscience",
issn = "0270-6474",
publisher = "Society for Neuroscience",
number = "13",

}

Task-dependent decoding of speaker and vowel identity from auditory cortical response patterns. / Bonte, M.; Hausfeld, L.; Scharke, W.; Valente, G.; Formisano, E.

In: Journal of Neuroscience, Vol. 34, No. 13, 26.03.2014, p. 4548-4557.

Research output: Contribution to journalArticleAcademicpeer-review

TY - JOUR

T1 - Task-dependent decoding of speaker and vowel identity from auditory cortical response patterns

AU - Bonte, M.

AU - Hausfeld, L.

AU - Scharke, W.

AU - Valente, G.

AU - Formisano, E.

PY - 2014/3/26

Y1 - 2014/3/26

N2 - Selective attention to relevant sound properties is essential for everyday listening situations. It enables the formation of different perceptual representations of the same acoustic input and is at the basis of flexible and goal-dependent behavior. Here, we investigated the role of the human auditory cortex in forming behavior-dependent representations of sounds. We used single-trial fMRI and analyzed cortical responses collected while subjects listened to the same speech sounds (vowels /a/, /i/, and /u/) spoken by different speakers (boy, girl, male) and performed a delayed-match-to-sample task on either speech sound or speaker identity. Univariate analyses showed a task-specific activation increase in the right superior temporal gyrus/sulcus (STG/STS) during speaker categorization and in the right posterior temporal cortex during vowel categorization. Beyond regional differences in activation levels, multivariate classification of single trial responses demonstrated that the success with which single speakers and vowels can be decoded from auditory cortical activation patterns depends on task demands and subject's behavioral performance. Speaker/vowel classification relied on distinct but overlapping regions across the (right) mid-anterior STG/STS (speakers) and bilateral mid-posterior STG/STS (vowels), as well as the superior temporal plane including Heschl's gyrus/sulcus. The task dependency of speaker/vowel classification demonstrates that the informative fMRI response patterns reflect the top-down enhancement of behaviorally relevant sound representations. Furthermore, our findings suggest that successful selection, processing, and retention of task-relevant sound properties relies on the joint encoding of information across early and higher-order regions of the auditory cortex.

AB - Selective attention to relevant sound properties is essential for everyday listening situations. It enables the formation of different perceptual representations of the same acoustic input and is at the basis of flexible and goal-dependent behavior. Here, we investigated the role of the human auditory cortex in forming behavior-dependent representations of sounds. We used single-trial fMRI and analyzed cortical responses collected while subjects listened to the same speech sounds (vowels /a/, /i/, and /u/) spoken by different speakers (boy, girl, male) and performed a delayed-match-to-sample task on either speech sound or speaker identity. Univariate analyses showed a task-specific activation increase in the right superior temporal gyrus/sulcus (STG/STS) during speaker categorization and in the right posterior temporal cortex during vowel categorization. Beyond regional differences in activation levels, multivariate classification of single trial responses demonstrated that the success with which single speakers and vowels can be decoded from auditory cortical activation patterns depends on task demands and subject's behavioral performance. Speaker/vowel classification relied on distinct but overlapping regions across the (right) mid-anterior STG/STS (speakers) and bilateral mid-posterior STG/STS (vowels), as well as the superior temporal plane including Heschl's gyrus/sulcus. The task dependency of speaker/vowel classification demonstrates that the informative fMRI response patterns reflect the top-down enhancement of behaviorally relevant sound representations. Furthermore, our findings suggest that successful selection, processing, and retention of task-relevant sound properties relies on the joint encoding of information across early and higher-order regions of the auditory cortex.

KW - auditory cortex

KW - fMRI decoding

KW - speech

KW - voice

KW - vowels

KW - TEMPORAL-LOBE

KW - LANGUAGE COMPREHENSION

KW - RECEPTIVE-FIELDS

KW - CHILDRENS VOICES

KW - HUMAN BRAIN

KW - SPEECH

KW - CORTEX

KW - PERCEPTION

KW - ACTIVATION

KW - SOUNDS

U2 - 10.1523/JNEUROSCI.4339-13.2014

DO - 10.1523/JNEUROSCI.4339-13.2014

M3 - Article

VL - 34

SP - 4548

EP - 4557

JO - Journal of Neuroscience

JF - Journal of Neuroscience

SN - 0270-6474

IS - 13

ER -