Publication
Broad phonetic class definition driven by phone confusions
| dc.contributor.author | Lopes, Carla | |
| dc.contributor.author | Perdigão, Fernando | |
| dc.date.accessioned | 2025-10-27T13:50:18Z | |
| dc.date.available | 2025-10-27T13:50:18Z | |
| dc.date.issued | 2012-07-23 | |
| dc.description.abstract | Intermediate representations between the speech signal and phones may be used to improve discrimination among phones that are often confused. These representations are usually found according to broad phonetic classes, which are defined by a phonetician. This article proposes an alternative data-driven method to generate these classes. Phone confusion information from the analysis of the output of a phone recognition system is used to find clusters at high risk of mutual confusion. A metric is defined to compute the distance between phones. The results, using TIMIT data, show that the proposed confusion-driven phone clustering method is an attractive alternative to the approaches based on human knowledge. A hierarchical classification structure to improve phone recognition is also proposed using a discriminative weight training method. Experiments show improvements in phone recognition on the TIMIT database compared to a baseline system. | eng |
| dc.description.sponsorship | Carla Lopes would like to thank the Portuguese foundation: Fundação para a Ciência e a Tecnologia for the PhD Grant (SFRH/BD/27966/2006). | |
| dc.identifier.citation | Lopes and Perdigão EURASIP Journal on Advances in Signal Processing 2012, 2012:158 | |
| dc.identifier.doi | 10.1186/1687-6180-2012-158 | |
| dc.identifier.issn | 1687-6180 | |
| dc.identifier.uri | http://hdl.handle.net/10400.8/14379 | |
| dc.language.iso | eng | |
| dc.peerreviewed | yes | |
| dc.publisher | Springer Science and Business Media LLC | |
| dc.relation | DETECÇÃO DE EVENTOS ACÚSTICO-FONÉTICOS PARA RECONHECIMENTO AUTOMÁTICO DE FALA | |
| dc.relation.ispartof | EURASIP Journal on Advances in Signal Processing | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Digital storage | |
| dc.subject | Linguistics | |
| dc.subject | Speech recognition | |
| dc.title | Broad phonetic class definition driven by phone confusions | eng |
| dc.type | journal article | |
| dspace.entity.type | Publication | |
| oaire.awardTitle | DETECÇÃO DE EVENTOS ACÚSTICO-FONÉTICOS PARA RECONHECIMENTO AUTOMÁTICO DE FALA | |
| oaire.awardURI | http://hdl.handle.net/10400.8/13033 | |
| oaire.citation.issue | 1 | |
| oaire.citation.title | EURASIP Journal on Advances in Signal Processing | |
| oaire.citation.volume | 2012 | |
| oaire.fundingStream | FARH | |
| oaire.version | http://purl.org/coar/version/c_970fb48d4fbd8a85 | |
| relation.isProjectOfPublication | 4e8fdd81-def3-4535-bb9f-3b37e6bfc953 | |
| relation.isProjectOfPublication.latestForDiscovery | 4e8fdd81-def3-4535-bb9f-3b37e6bfc953 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Broad_phonetic_class_definition_driven_by_phone_co.pdf
- Size:
- 655.2 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.32 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
