Structural Gaussian mixture models for efficient text-independent speaker verification

Published 2002 in Interspeech

ABSTRACT

Structural Gaussian mixture models (SGMMs) are proposed for efﬁcient text-independent speaker veriﬁcation. A structural background model (SBM) is constructed ﬁrst by hierarchically clustering all Gaussian mixture components in a universal background model (UBM). In this way the acoustic space is partitioned into multiple regions in different levels of resolution. For each target speaker, a SGMM can be generated through multi-level maximum a posteriori (MAP) adaptation from the SBM. During test, only a small subset of Gaussian mixture components is scored for each feature vector in order to reduce the computational cost signiﬁ-cantly. Furthermore, the scores obtained in different layers of the tree-structured models are combined via a neural network for ﬁ-nal decision. Different conﬁgurations are compared in the experiments conducted on the telephony speech data used in the NIST speaker veriﬁcation evaluation. The experimental results show that computational reduction by a factor of 17 can be achieved with equal error rate (EER) reduced by (cid:0)(cid:1) compared with the baseline. The SGMM-SBM also shows some advantages over the recently proposed hash GMM.

PUBLICATION RECORD

Publication year
2002
Venue
Interspeech
Publication date
2002-09-16
Fields of study
Computer Science
Identifiers
DOI 10.21437/ICSLP.2002-402
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Short-time Gaussianization for robust speaker verification
2002cited by this paper
Speaker verification using target and background dependent linear transforms and multi-system fusion
2001cited by this paper
A structural Bayes approach to speaker adaptation
2001influential reference
Feature warping for robust speaker verification
2001cited by this paper
Gaussian selection applied to text-independent speaker verification
2001cited by this paper
Transformation enhanced multi-grained modeling for text-independent speaker recognition
2000cited by this paper
Speaker Verification Using Adapted Gaussian Mixture Models
2000cited by this paper
A study of computation speed-UPS of the GMM-UBM speaker recognition system
1999cited by this paper
Vector quantization for the efficient computation of continuous density likelihoods
1993cited by this paper
Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper
1977cited by this paper

CITED BY

Decisión threshold estimation and model quality evaluation techniques for speaker verification.
2005cites this paper
New MAP estimators for speaker recognition
2003cites this paper