An effective unsupervised scheme for multiple-speaker-change detection

P. Sivakumaran, A. M. Ariyaeeinia, J. Fortuna

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

This paper presents an enhanced Bayesian information criterion (BIC)-based algorithm for multiple-speaker-change detection (MSCD) without prior acoustic information on speakers. The enhancement offered by the proposed approach is in terms of effectiveness. This is achieved through the introduction of robustness into the standard BIC procedure, against certain important causes of misclassification. The paper also introduces a new measure, termed effective error rate (EFER), for evaluating the relative performance of MSCD algorithms. It is shown that the proposed measure allows a more meaningful evaluation of MSCD than the conventional ones. The experimental results obtained using this new evaluation measure clearly confirm the effectiveness of the proposed algorithm. The experimental investigation is based on 3 hours of broadcast news material with 445 speaker changes.

Original languageEnglish
Title of host publicationProc. 7th International Conference on Spoken Language Processing (ICSLP 2002)
Pages569-572
Number of pages4
Publication statusPublished - 2002
Event7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States
Duration: 16 Sept 200220 Sept 2002

Conference

Conference7th International Conference on Spoken Language Processing, ICSLP 2002
Country/TerritoryUnited States
CityDenver
Period16/09/0220/09/02

Fingerprint

Dive into the research topics of 'An effective unsupervised scheme for multiple-speaker-change detection'. Together they form a unique fingerprint.

Cite this