Repository logo
 
Loading...
Project Logo
Research Project

DETECÇÃO DE EVENTOS ACÚSTICO-FONÉTICOS PARA RECONHECIMENTO AUTOMÁTICO DE FALA

Funder

Authors

Publications

A hierarchical broad-class classification to enhance phoneme recognition
Publication . Lopes, Carla, Alexandra Calado Lopes; Perdigão, Fernando
In this paper a hierarchical classification of different levels of phonetic information is proposed in order to improve phone recognition. In this paradigm several intermediate classifiers give posterior probability predictions for broad phonetic classes, achieving phone detail in the last layer. Class membership probabilities are weighted and combined in order to get a more robust phoneme prediction. A method for finding the best set of weights is also proposed based on discriminative training in a hybrid MLP/HMM system. Experiments show that the use of broad-class information enhances phone recognition. Relative improvements of 8% in Correctness and 5% in Accuracy were achieved in phoneme recognition on the TIMIT database compared to a baseline system.
Broad phonetic class definition driven by phone confusions
Publication . Lopes, Carla; Perdigão, Fernando
Intermediate representations between the speech signal and phones may be used to improve discrimination among phones that are often confused. These representations are usually found according to broad phonetic classes, which are defined by a phonetician. This article proposes an alternative data-driven method to generate these classes. Phone confusion information from the analysis of the output of a phone recognition system is used to find clusters at high risk of mutual confusion. A metric is defined to compute the distance between phones. The results, using TIMIT data, show that the proposed confusion-driven phone clustering method is an attractive alternative to the approaches based on human knowledge. A hierarchical classification structure to improve phone recognition is also proposed using a discriminative weight training method. Experiments show improvements in phone recognition on the TIMIT database compared to a baseline system.

Organizational Units

Description

Keywords

, Engineering and technology ,Engineering and technology/Electrical engineering, electronic engineering, information engineering

Contributors

Funders

Funding agency

Fundação para a Ciência e a Tecnologia, I.P.
Fundação para a Ciência e a Tecnologia, I.P.

Funding programme

FARH

Funding Award Number

SFRH/BD/27966/2006

ID