Toward Real Time Word Based Prosody Recognition

Alex Tilson, Frank Foerster

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Downloads (Pure)

Abstract

Prosodic salience is a heuristic based on word-level prosody in child-directed speech that is thought to serve as a cue for attentional focus. It has been used in the context of robotic language acquisition to extract the contextually most relevant words from a human tutor’s speech to ground them in a robot’s sensorimotor data. However, the pipeline for performing word-based prosody-recognition operated in a semi-automatic manner and required substantial manual effort. We describe our efforts to automate the existing pipeline by including real time prosody recognition, and a modern speech recognition and forced alignment model. The intention is to enable its use in real time for human-in-the-loop robotic language acquisition and other socially driven forms of online learning.
Original languageEnglish
Title of host publicationProceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning
EditorsAmy Qiu, Bill Noble, David Pagmar, Vladislav Maraev, Nikolai Ilinykh
Place of PublicationGothenburg, Sweden
PublisherAssociation for Computational Linguistics
Chapter9
Pages62-67
Number of pages6
Volume3
ISBN (Print)979-8-89176-163-6
Publication statusPublished - 30 Dec 2024
Event2024 CLASP Conference on Multimodality and Interaction in Language Learning - Gothenburg, Sweden
Duration: 14 Oct 202415 Oct 2024

Publication series

NameCLASP Conference Proceedings
PublisherAssociation for Computational Linguistics (ACL)
Volume3
ISSN (Print)2002-9764

Conference

Conference2024 CLASP Conference on Multimodality and Interaction in Language Learning
Abbreviated titleMILLing 2024
Country/TerritorySweden
CityGothenburg
Period14/10/2415/10/24

Fingerprint

Dive into the research topics of 'Toward Real Time Word Based Prosody Recognition'. Together they form a unique fingerprint.

Cite this