Multiple Points Input for Convolutional Neural Networks in Replay Attack Detection

Sung Hyun Yoon, Ha Jin Yu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

13 Scopus citations

Abstract

The models based on convolutional neural network (CNN) have shown remarkable performance in spoofing detection for automatic speaker verification. In order to input data into CNN-based models in mini-batch unit, the shape of all data in each mini-batch must be equal. Therefore, the method to make all data have the same length should be preceded because speeches have variable lengths. Segmentation is one of the methods to make the lengths of all data be equal. It divides the data into multiple segments using sliding window. Then, the models take one segment as input. However, it means that the amount of information that can be considered at one time is limited. We proposed the multiple points input method to increase the amount of information that can be considered at one time. The CNNs get input from multiple points in an utterance that are separated far enough to have different characteristics. The experimental results on ASVspoof 2019 physical access scenarios showed that our proposed method reduced the relative equal error rate by about 44% compared to the baseline.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages6444-6448
Number of pages5
ISBN (Electronic)9781509066315
DOIs
StatePublished - May 2020
Event2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Barcelona, Spain
Duration: 4 May 20208 May 2020

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2020-May
ISSN (Print)1520-6149

Conference

Conference2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020
Country/TerritorySpain
CityBarcelona
Period4/05/208/05/20

Keywords

  • ASVspoof
  • convolutional neural network
  • multiple points input
  • replay attack detection
  • sliding window

Fingerprint

Dive into the research topics of 'Multiple Points Input for Convolutional Neural Networks in Replay Attack Detection'. Together they form a unique fingerprint.

Cite this