Abstract
A new network model, U-net, is proposed to recognize continuous speech, based on the non-uniform unit which is a kind of acoustic sub-word unit. In this model, input speech can be segmented into units by using a part of the network before classification. The unit has steady states at the boundaries and a transient state in the middle. The network structure is designed according to the structure of the unit. The steady states and transient state are recognized by separate networks and different feature parameters are used. For the transient part a delta parameter is used. The segmentation net is trained to reduce the number of unit classes.
Original language | English |
---|---|
Pages | 1677-1680 |
Number of pages | 4 |
State | Published - 1995 |
Event | 4th European Conference on Speech Communication and Technology, EUROSPEECH 1995 - Madrid, Spain Duration: 18 Sep 1995 → 21 Sep 1995 |
Conference
Conference | 4th European Conference on Speech Communication and Technology, EUROSPEECH 1995 |
---|---|
Country/Territory | Spain |
City | Madrid |
Period | 18/09/95 → 21/09/95 |