Najim Dehak

The role of speaker factors in the NIST extended data task

By Patrick Kenny, Najim Dehak, Réda Dehak, Vishwa Gupta, Pierre Dumouchel

2007-09-25

In Proceedings of the speaker and language recognition workshop (IEEE-odyssey 2008)

Abstract

We tested factor analysis models having various numbers of speaker factors on the core condition and the extended data condition of the 2006 NIST speaker recognition evaluation. In order to ensure strict disjointness between training and test sets, the factor analysis models were trained without using any of the data made available for the 2005 evaluation. The factor analysis training set consisted primarily of Switchboard data and so was to some degree mismatched with the 2006 test data (drawn from the Mixer collection). Consequently, our initial results were not as good as those submitted for the 2006 evaluation. However we found that we could compensate for this by a simple modification to our score normalization strategy, namely by using 1000 z-norm utterances in zt-norm. Our purpose in varying the number of speaker factors was to evaluate the eigenvoiceMAP and classicalMAP components of the inter-speaker variability model in factor analysis. We found that on the core condition (i.e. 2–3 minutes of enrollment data), only the eigenvoice MAP component plays a useful role. On the other hand, on the extended data condition (i.e. 15–20 minutes of enrollment data) both the classical MAP component and the eigenvoice component proved to be useful provided that the number of speaker factors was limited. Our best result on the extended data condition (all trials) was an equal error rate of 2.2% and a detection cost of 0.011.

Continue reading

Linear and non linear kernel GMM SuperVector machines for speaker verification

By Réda Dehak, Najim Dehak, Patrick Kenny, Pierre Dumouchel

2007-08-27

In Proceedings of the european conference on speech communication and technologies (interspeech’07)

Abstract

This paper presents a comparison between Support Vector Machines (SVM) speaker verification systems based on linear and non linear kernels defined in GMM supervector space. We describe how these kernel functions are related and we show how the nuisance attribute projection (NAP) technique can be used with both of these kernels to deal with the session variability problem. We demonstrate the importance of GMM model normalization (M-Norm) especially for the non linear kernel. All our experiments were performed on the core condition of NIST 2006 speaker recognition evaluation (all trials). Our best results (an equal error rate of 6.3%) were obtained using NAP and GMM model normalization with the non linear kernel.

Continue reading

LRDE system description

By Réda Dehak, Charles-Alban Deledalle, Najim Dehak

2006-06-01

In NIST SRE’06 workshop: Speaker recognition evaluation campaign

Abstract

Continue reading