Abstract
Speaker recognition in applications of our daily lives is not yet in widespread use. In order for biometric technology to make sense for real-world authentication applications and be accepted by end users, convenience of use, robustness and accuracy of such a system are equally important. This paper defines these requirements for pass phrase based voice authentication embedded within a multi modal biometric system and describes methods and algorithms developed and optimized for the demands of such an application. Classification is based on dynamic time warping which can cope with limited training data. MFCC features which have been optimized for speaker specific properties are used. Robustness of the system is increased with speech enhancement and cepstral mean subtraction. Furthermore, vector quantization with speaker specific codebooks is applied in order to decrease storage requirements for the biometric template. On an appropriate data base, a verification EER of 2.7% is achieved with limited training and test material.
Original language | English |
---|---|
Title of host publication | Procs 1st Conf on Biometrics and Electronic Signatures of the GI Working Group |
Editors | Arslan Bromme, Christoph Busch |
Publisher | GI - Gesellschaft fur Informatik |
Pages | 71-80 |
ISBN (Print) | 3-88579-360-1 |
Publication status | Published - 2003 |
Event | BIOSIG - Darmstadt, Germany Duration: 24 Jul 2003 → … |
Conference
Conference | BIOSIG |
---|---|
Country/Territory | Germany |
City | Darmstadt |
Period | 24/07/03 → … |
Keywords
- Biometrics
- speaker verification technique
- dynamic time warping
- speech enhancement
- vector quantization