Utilization of age information for speaker verification using multi-task learning deep neural networks

Ju ho Kim, Hee Soo Heo, Jee weon Jung, Hye jin Shim, Seung Bin Kim, Ha Jin Yu

Research output: Contribution to journalArticlepeer-review

Abstract

The similarity in tones between speakers can lower the performance of speaker verification. To improve the performance of speaker verification systems, we propose a multi-task learning technique using deep neural network to learn speaker information and age information. Multi–task learning can improve generalization performances, because it helps deep neural networks to prevent hidden layers from overfitting into one task. However, we found in experiments that learning of age information does not work well in the process of learning the deep neural network. In order to improve the learning, we propose a method to dynamically change the objective function weights of speaker identification and age estimation in the learning process. Results show the equal error rate based on RSR2015 evaluation data set, 6.91 % for the speaker verification system without using age information, 6.77 % using age information only, and 4.73 % using age information when weight change technique was applied.

Original languageEnglish
Pages (from-to)593-600
Number of pages8
JournalJournal of the Acoustical Society of Korea
Volume38
Issue number5
DOIs
StatePublished - 2019

Keywords

  • Age estimation
  • Deep neural network
  • Multi-task learning
  • Speaker verification

Fingerprint

Dive into the research topics of 'Utilization of age information for speaker verification using multi-task learning deep neural networks'. Together they form a unique fingerprint.

Cite this