Repository logo
 
Publication

Broad phonetic class definition driven by phone confusions

dc.contributor.authorLopes, Carla
dc.contributor.authorPerdigão, Fernando
dc.date.accessioned2025-10-27T13:50:18Z
dc.date.available2025-10-27T13:50:18Z
dc.date.issued2012-07-23
dc.description.abstractIntermediate representations between the speech signal and phones may be used to improve discrimination among phones that are often confused. These representations are usually found according to broad phonetic classes, which are defined by a phonetician. This article proposes an alternative data-driven method to generate these classes. Phone confusion information from the analysis of the output of a phone recognition system is used to find clusters at high risk of mutual confusion. A metric is defined to compute the distance between phones. The results, using TIMIT data, show that the proposed confusion-driven phone clustering method is an attractive alternative to the approaches based on human knowledge. A hierarchical classification structure to improve phone recognition is also proposed using a discriminative weight training method. Experiments show improvements in phone recognition on the TIMIT database compared to a baseline system.eng
dc.description.sponsorshipCarla Lopes would like to thank the Portuguese foundation: Fundação para a Ciência e a Tecnologia for the PhD Grant (SFRH/BD/27966/2006).
dc.identifier.citationLopes and Perdigão EURASIP Journal on Advances in Signal Processing 2012, 2012:158
dc.identifier.doi10.1186/1687-6180-2012-158
dc.identifier.issn1687-6180
dc.identifier.urihttp://hdl.handle.net/10400.8/14379
dc.language.isoeng
dc.peerreviewedyes
dc.publisherSpringer Science and Business Media LLC
dc.relationDETECÇÃO DE EVENTOS ACÚSTICO-FONÉTICOS PARA RECONHECIMENTO AUTOMÁTICO DE FALA
dc.relation.ispartofEURASIP Journal on Advances in Signal Processing
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectDigital storage
dc.subjectLinguistics
dc.subjectSpeech recognition
dc.titleBroad phonetic class definition driven by phone confusionseng
dc.typejournal article
dspace.entity.typePublication
oaire.awardTitleDETECÇÃO DE EVENTOS ACÚSTICO-FONÉTICOS PARA RECONHECIMENTO AUTOMÁTICO DE FALA
oaire.awardURIhttp://hdl.handle.net/10400.8/13033
oaire.citation.issue1
oaire.citation.titleEURASIP Journal on Advances in Signal Processing
oaire.citation.volume2012
oaire.fundingStreamFARH
oaire.versionhttp://purl.org/coar/version/c_970fb48d4fbd8a85
relation.isProjectOfPublication4e8fdd81-def3-4535-bb9f-3b37e6bfc953
relation.isProjectOfPublication.latestForDiscovery4e8fdd81-def3-4535-bb9f-3b37e6bfc953

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Broad_phonetic_class_definition_driven_by_phone_co.pdf
Size:
655.2 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.32 KB
Format:
Item-specific license agreed upon to submission
Description: