Phonetic speaker recognition

Published 2001 in Interspeech

ABSTRACT

This paper introduces a novel language-independent speaker-recognition system based on differences in dynamic realization of phonetic features (i.e., pronunciation) between speakers rather than spectral differences in voice quality. The system exploits phonetic information from six languages to perform text independent speaker recognition. All experiments were performed on the NIST 2001 Speaker Recognition Evaluation Extended Data Task. Recognition results are provided for unigram, bigram, and trigram models. Performance for each of the three models is examined for phones from each individual language and the final multilanguage fused system. Additional fusion experiments demonstrate that speaker recognition capability is maintained even without phonetic information in the language of the speaker.

PUBLICATION RECORD

Publication year
2001
Venue
Interspeech
Publication date
2001-09-03
Fields of study
Linguistics, Computer Science
Identifiers
DOI 10.21437/Eurospeech.2001-416
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Gender-dependent phonetic refraction for speaker recognition
2002cited by this paper
Phonetic, idiolectal and acoustic speaker recognition
2001cited by this paper
Some Experiments on Idiolectal Differences among Speakers 1 Motivation
2000cited by this paper
Speaker Verification Using Adapted Gaussian Mixture Models
2000cited by this paper
Speaker Recognition on Single- and Multispeaker Data
2000cited by this paper

CITED BY

Building Automatic Speech Recognition Systems for Moroccan Dialect: A Phoneme-Based Approach
2024cites this paper
Computing with Hypervectors for Efficient Speaker Identification
2022cites this paper
Comparative Study of different types of RNN in Speech Classification
2021cites this paper
Intra-Speaker Variability Assessment for Speaker Recognition in Degraded Conditions: A Case of African Tone Languages
2018cites this paper
A Long Short-term Memory Neural Network for Improved Twins' Voice Differentiation
2018cites this paper
Distant Speaker Recognition: An Overview
2016cites this paper
Die Rolle phonetischer Information in der Sprechererkennung
2016influential citation
Improving Speaker Recognition by Biometric Voice Deconstruction
2015cites this paper
Application of Proposed Phoneme Segmentation Technique for Speaker Identification
2014cites this paper
Speaker Detection Using Phoneme Specific Hidden Markov Models
2014cites this paper
Physiologically-Motivated Feature Extraction Methods for Speaker Recognition
2013cites this paper
Automatic Speaker Recognition System
2013cites this paper
Implementation of DJ Rule Based Algorithm for Dhuni- Vishleshan of Compound Punjabi Words
2013cites this paper
Keyword-conditioned phone N-gram modeling with contextual information for speaker verification
2012cites this paper
Short Utterance Speaker Recognition
2012cites this paper
Short Utterance Speaker Recognition A research Agenda
2012cites this paper
Syllable category based short utterance speaker recognition
2012cites this paper
Speakers in Automatic Speaker Recognition
2011cites this paper
Finding Difficult Speakers in Automatic Speaker Recognition
2011cites this paper
A Universal Phoneme-Set Based Language Independent Short Utterance Speaker Recognition
2011cites this paper
An overview of text-independent speaker recognition: From features to supervectors
2010cites this paper
Prosody in Automatic Speaker Recognition: Applications in Biometrics and Voice Imitation
2010cites this paper
Structured Approaches to Data Selection for Speaker Recognition
2010cites this paper
The case for automatic higher-level features in forensic speaker recognition
2008cites this paper
Speaker Vector-Based Speaker Recognition with Phonetic Modeling
2008cites this paper
Fusing prosodic and acoustic information for speaker recognition
2008cites this paper
Comparisons of recent speaker recognition approaches based on word-conditioning
2008cites this paper
Speaker Recognition Via Nonlinear Phonetic- and Speaker-Discriminative Features
2007cites this paper
Speech Recognition as Feature Extraction for Speaker Recognition
2007cites this paper
A Syllable Lattice Approach to Speaker Verification
2007cites this paper
Language Normalization for Bilingual Speaker Recognition Systems
2007cites this paper
Applications of Keyword-Constraining in Speaker Recognition
2007cites this paper
Far-Field Speaker Recognition
2007cites this paper
Speaker cluster based GMM tokenization for speaker recognition
2006cites this paper
Syllable Lattice Based Re-Scoring For Speaker Verification
2006cites this paper
Far-Field Speaker Recognition
2006cites this paper
Text-Independent Speaker Verification: State of the Art and Challenges
2005cites this paper
The NIST speaker recognition evaluation program
2005cites this paper
Improving Speaker Verification Using ALISP-Based Specific GMMs
2005cites this paper
Improved phonetic speaker recognition using lattice decoding
2005influential citation
Decisión threshold estimation and model quality evaluation techniques for speaker verification.
2005cites this paper
Exploiting High-Level Information Provided by ALISP in Speaker Recognition
2005cites this paper
Non-acoustic speaker recognition
2004cites this paper
Multi-stream language identification using data-driven dependency selection
2003cites this paper
Speaker verification using Gaussian component strings in dynamic trajectory space
2002cites this paper
ASR dependent techniques for speaker recognition
2002cites this paper
Gender-dependent phonetic refraction for speaker recognition
2002cites this paper
ASR dependent techniques for speaker identification
2002cites this paper
Phonetic, idiolectal and acoustic speaker recognition
2001cites this paper
Phonetic Refraction for Speaker Recognition
2001cites this paper
Automatic Speaker Recognition: Current Approaches and Future Trends •1
2001cites this paper
Phonetic speaker recognition
2001cites this paper