Ted Talk Teaser Generation With Pre-Trained Models

G. Vico; J. Niehues

doi:10.1109/icassp43922.2022.9746700

Ted Talk Teaser Generation With Pre-Trained Models

G. Vico^*, J. Niehues

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

While we have seen significant advances in automatic summarization for text, research on speech summarization is still limited. In this work, we address the challenge of automatically generating teasers for TED talks. In the first step, we create a corpus for automatic summarization of TED and TEDx talks consisting of the talks' recording, their transcripts and their descriptions. The corpus is used to build a speech summarization system for the task. We adapt and combine pre-trained models for automatic speech recognition (ASR) and text summarization using the collected data. This initial work shows that is more important to adapt the summarization model to the ASR transcripts than to adapt the ASR model to the talks.

Original language	English
Title of host publication	ICASSP 2022 - 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
Publisher	IEEE
Pages	8067-8071
Number of pages	5
ISBN (Print)	9781665405409
DOIs	https://doi.org/10.1109/icassp43922.2022.9746700
Publication status	Published - 2022
Event	47th IEEE International Conference on Acoustics, Speech and Signal Processing - Online, Singapore, Singapore Duration: 22 May 2022 → 27 May 2022 Conference number: 47 https://2022.ieeeicassp.org/

Publication series

Series	International Conference on Acoustics Speech and Signal Processing Proceedings
ISSN	1520-6149

Conference

Conference	47th IEEE International Conference on Acoustics, Speech and Signal Processing
Abbreviated title	ICASSP 2022
Country/Territory	Singapore
City	Singapore
Period	22/05/22 → 27/05/22
Internet address	https://2022.ieeeicassp.org/

Keywords

speech summarization
automatic speech recognition
abstractive summarization

Access to Document

10.1109/icassp43922.2022.9746700

Cite this

@inproceedings{070735d1bef64848830621bda9e7ae44,

title = "Ted Talk Teaser Generation With Pre-Trained Models",

abstract = "While we have seen significant advances in automatic summarization for text, research on speech summarization is still limited. In this work, we address the challenge of automatically generating teasers for TED talks. In the first step, we create a corpus for automatic summarization of TED and TEDx talks consisting of the talks' recording, their transcripts and their descriptions. The corpus is used to build a speech summarization system for the task. We adapt and combine pre-trained models for automatic speech recognition (ASR) and text summarization using the collected data. This initial work shows that is more important to adapt the summarization model to the ASR transcripts than to adapt the ASR model to the talks.",

keywords = "speech summarization, automatic speech recognition, abstractive summarization",

author = "G. Vico and J. Niehues",

year = "2022",

doi = "10.1109/icassp43922.2022.9746700",

language = "English",

isbn = "9781665405409",

series = "International Conference on Acoustics Speech and Signal Processing Proceedings",

publisher = "IEEE",

pages = "8067--8071",

booktitle = "ICASSP 2022 - 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)",

address = "United States",

note = "47th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022 ; Conference date: 22-05-2022 Through 27-05-2022",

url = "https://2022.ieeeicassp.org/",

}

Vico, G & Niehues, J 2022, Ted Talk Teaser Generation With Pre-Trained Models. in ICASSP 2022 - 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). IEEE, International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 8067-8071, 47th IEEE International Conference on Acoustics, Speech and Signal Processing, Singapore, Singapore, 22/05/22. https://doi.org/10.1109/icassp43922.2022.9746700

Ted Talk Teaser Generation With Pre-Trained Models. / Vico, G.; Niehues, J.
ICASSP 2022 - 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). IEEE, 2022. p. 8067-8071 (International Conference on Acoustics Speech and Signal Processing Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - Ted Talk Teaser Generation With Pre-Trained Models

AU - Vico, G.

AU - Niehues, J.

N1 - Conference code: 47

PY - 2022

Y1 - 2022

N2 - While we have seen significant advances in automatic summarization for text, research on speech summarization is still limited. In this work, we address the challenge of automatically generating teasers for TED talks. In the first step, we create a corpus for automatic summarization of TED and TEDx talks consisting of the talks' recording, their transcripts and their descriptions. The corpus is used to build a speech summarization system for the task. We adapt and combine pre-trained models for automatic speech recognition (ASR) and text summarization using the collected data. This initial work shows that is more important to adapt the summarization model to the ASR transcripts than to adapt the ASR model to the talks.

AB - While we have seen significant advances in automatic summarization for text, research on speech summarization is still limited. In this work, we address the challenge of automatically generating teasers for TED talks. In the first step, we create a corpus for automatic summarization of TED and TEDx talks consisting of the talks' recording, their transcripts and their descriptions. The corpus is used to build a speech summarization system for the task. We adapt and combine pre-trained models for automatic speech recognition (ASR) and text summarization using the collected data. This initial work shows that is more important to adapt the summarization model to the ASR transcripts than to adapt the ASR model to the talks.

KW - speech summarization

KW - automatic speech recognition

KW - abstractive summarization

U2 - 10.1109/icassp43922.2022.9746700

DO - 10.1109/icassp43922.2022.9746700

M3 - Conference article in proceeding

SN - 9781665405409

T3 - International Conference on Acoustics Speech and Signal Processing Proceedings

SP - 8067

EP - 8071

BT - ICASSP 2022 - 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)

PB - IEEE

T2 - 47th IEEE International Conference on Acoustics, Speech and Signal Processing

Y2 - 22 May 2022 through 27 May 2022

ER -