Towards Closed-Loop Speech Synthesis From Stereotactic Eeg: A Unit Selection Approach

M. Angrick*, M. Ottenhoff, L. Diener, D. Ivucic, G. Ivucic, S. Goulis, A.J. Colon, L. Wagner, D.J. Krusienski, P.L. Kubben, T. Schultz, C. Herff

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

51 Downloads (Pure)

Abstract

Neurological disorders can severely impact speech communication. Recently, neural speech prostheses have been proposed that reconstruct intelligible speech from neural signals recorded superficially on the cortex. Thus far, it has been unclear whether similar reconstruction is feasible from deeper brain structures, and whether audible speech can be directly synthesized from these reconstructions with low-latency, as required for a practical speech neuroprosthetic. The present study aims to address both challenges. First, we implement a low-latency unit selection based synthesizer that converts neural signals into audible speech. Second, we evaluate our approach on open-loop recordings from 5 patients implanted with stereotactic depth electrodes who conducted a read-aloud task of Dutch utterances. We achieve correlation coefficients significantly higher than chance level of up to 0.6 and an average computational cost of 6.6 ms for each 10 ms frames. While the current reconstructed utterances are not intelligible, our results indicate promising decoding and run-time capabilities that are suitable for investigations of speech processes in closed-loop experiments.
Original languageEnglish
Title of host publication2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
PublisherIEEE
Pages1296-1300
Number of pages5
ISBN (Print)9781665405409
DOIs
Publication statusPublished - 2022
Event47th IEEE International Conference on Acoustics, Speech and Signal Processing - Online, Singapore, Singapore
Duration: 22 May 202227 May 2022
Conference number: 47
https://2022.ieeeicassp.org/

Publication series

SeriesInternational Conference on Acoustics Speech and Signal Processing Proceedings
ISSN1520-6149

Conference

Conference47th IEEE International Conference on Acoustics, Speech and Signal Processing
Abbreviated titleICASSP 2022
Country/TerritorySingapore
CitySingapore
Period22/05/2227/05/22
Internet address

Keywords

  • neuroprosthesis
  • speech synthesis
  • stereotactic EEG
  • low-latency processing of neural signals
  • SPOKEN
  • COMMUNICATION

Cite this