This paper presents an enhanced Bayesian information criterion (BIC)-based algorithm for multiple-speaker-change detection (MSCD) without prior acoustic information on speakers. The enhancement offered by the proposed approach is in terms of effectiveness. This is achieved through the introduction of robustness into the standard BIC procedure, against certain important causes of misclassification. The paper also introduces a new measure, termed effective error rate (EFER), for evaluating the relative performance of MSCD algorithms. It is shown that the proposed measure allows a more meaningful evaluation of MSCD than the conventional ones. The experimental results obtained using this new evaluation measure clearly confirm the effectiveness of the proposed algorithm. The experimental investigation is based on 3 hours of broadcast news material with 445 speaker changes.
|Title of host publication||Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002)|
|Number of pages||4|
|Publication status||Published - 2002|
|Event||7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States|
Duration: 16 Sept 2002 → 20 Sept 2002
|Conference||7th International Conference on Spoken Language Processing, ICSLP 2002|
|Period||16/09/02 → 20/09/02|