Neural network for 500 vocabulary word spotting using acoustic sub-word units

Ha Jin Yu, Yung Hwan Oh

Research output: Contribution to journalConference articlepeer-review

3 Scopus citations

Abstract

A neural network model based on a non-uniform unit for speaker-independent continuous speech recognition is proposed. The functions of the neural network model include segmenting the input speech into sub-word units, classifying the units and detecting words, and each of them is implemented by a module. The recognition unit we propose can includes arbitrary number of phonemes in a unit, so that it can absorb co-articulation effects which spread for several phonemes. The unit classifier module separates the speech into stationary and transition parts and use different parameters for them. The word detector module can learn all the pronunciation variations in the training data. The system is evaluated on a subset of TIMIT speech data.

Original languageEnglish
Pages (from-to)3277-3280
Number of pages4
JournalProceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
Volume4
StatePublished - 1997
EventProceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. Part 1 (of 5) - Munich, Ger
Duration: 21 Apr 199724 Apr 1997

Fingerprint

Dive into the research topics of 'Neural network for 500 vocabulary word spotting using acoustic sub-word units'. Together they form a unique fingerprint.

Cite this