Discrimination Effectiveness of Speech Cepstral Features

Amit S Malegaonkar, Aladdin Ariyaeeinia, Perasiriyan Sivakumaran, Surosh Pillay

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

In this work, the discrimination capabilities of speech cepstra for text and speaker related information are investigated. For this purpose, Bhattacharya distance metric is used as the measure of discrimination. The scope of the study covers static and dynamic cepstra derived using the linear prediction analysis (LPCC) as well as mel-frequency analysis (MFCC). The investigations also include the assessment of the linear prediction-based mel-frequency cepstral coefficients (LP-MFCC) as an alternative speech feature type. It is shown experimentally that whilst contaminations in speech unfavourably affect the performance of all types of cepstra, the effects are more severe in the case of MFCC. Furthermore, it is shown that with a combination of static and dynamic features, LP-based mel-frequency cepstra (LP-MFCC) exhibit the best discrimination capabilities in almost all experimental cases.
Original languageEnglish
Title of host publicationBiometrics and Identity Management, BIOID 2008
PublisherSpringer Nature Link
Pages91-99
Number of pages9
Volume5372
ISBN (Electronic)978-3-540-89991-4
ISBN (Print)978-3-540-89990-7
Publication statusPublished - May 2008

Publication series

NameLecture Notes in Computer Science
Volume5372

Fingerprint

Dive into the research topics of 'Discrimination Effectiveness of Speech Cepstral Features'. Together they form a unique fingerprint.

Cite this