Fuzzy restricted Boltzmann machine based probabilistic linear discriminant analysis for noise-robust text-dependent speaker verification on short utterances

Sung Hyun Yoon, Min Sung Koh, Ha Jin Yu

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

In the i-vector-based speaker verification system, it is important to compensate for session variability on the i-vector to improve speaker verification performance. Linear discriminant analysis (LDA) is widely used to compensate for session variability by reducing the dimensionality of the i-vector. Restricted Boltzmann machine (RBM)-based probabilistic linear discriminant analysis (PLDA) has been proposed to improve the session variability compensation ability of LDA. It can be viewed as a probabilistic approach of LDA using RBM. However, since the RBM does not consider uncertainties in obtaining the parameters, the representation capability of RBM-based PLDA is limited. For instance, many real-world speaker verifications must consider noisy environments, which make the compensated session variability uncertain. The fuzzy restricted Boltzmann machine (FRBM) was proposed to improve the capability of the RBM. It showed a more robust performance than that of the RBM. Hence, in this paper, we propose FRBM-based PLDA to improve the representation capability of RBM-PLDA by replacing all the parameters of RBM-PLDA with fuzzy numbers. An evaluation with Part 1 of Robust Speaker Recognition (RSR) 2015 was conducted. In the experimental results, the proposed algorithm shows a better compensation for phonetic variability that exists in short utterances, and a robust speaker verification performance in diverse noisy environments where phonetic and noise variabilities are challenging issues in real-world applications.

Original languageEnglish
Pages (from-to)468-480
Number of pages13
JournalIAENG International Journal of Computer Science
Volume47
Issue number3
StatePublished - 2020

Keywords

  • Discriminant analysis
  • Fuzzy restricted boltzmann machine
  • I-vector
  • Restricted boltzmann machine
  • Speaker verification

Fingerprint

Dive into the research topics of 'Fuzzy restricted Boltzmann machine based probabilistic linear discriminant analysis for noise-robust text-dependent speaker verification on short utterances'. Together they form a unique fingerprint.

Cite this