Repository logo
 
Publication

Detection of Mispronunciations and Disfluencies in Children Reading Aloud

dc.contributor.authorProença, Jorge
dc.contributor.authorLopes, Carla
dc.contributor.authorTjalve, Michael
dc.contributor.authorStolcke, Andreas
dc.contributor.authorCandeias, Sara
dc.contributor.authorPerdigão, Fernando
dc.date.accessioned2025-10-14T10:57:38Z
dc.date.available2025-10-14T10:57:38Z
dc.date.issued2017-08-20
dc.description.abstractTo automatically evaluate the performance of children reading aloud or to follow a child’s reading in reading tutor applications, different types of reading disfluencies and mispronunciations must be accounted for. In this work, we aim to detect most of these disfluencies in sentence and pseudoword reading. Detecting incorrectly pronounced words, and quantifying the quality of word pronunciations, is arguably the hardest task. We approach the challenge as a two-step process. First, a segmentation using task-specific lattices is performed, while detecting repetitions and false starts and providing candidate segments for words. Then, candidates are classified as mispronounced or not, using multiple features derived from likelihood ratios based on phone decoding and forced alignment, as well as additional meta-information about the word. Several classifiers were explored (linear fit, neural networks, support vector machines) and trained after a feature selection stage to avoid overfitting. Improved results are obtained using feature combination compared to using only the log likelihood ratio of the reference word (22% versus 27% miss rate at constant 5% false alarm rate).eng
dc.description.sponsorshipThis work was supported in part by Fundação para a Ciência e Tecnologia under the project UID/EEA/50008/2013 (pluriannual funding in the scope of the LETSREAD project at Instituto de Telecomunicações). The authors acknowledge the support given by Microsoft to this project. Jorge Proença is supported by the SFRH/BD/97204/2013 FCT Grant.
dc.identifier.citationProença, Jorge & Lopes, Carla & Tjalve, Michael & Stolcke, Andreas & Candeias, Sara & Perdigão, Fernando. (2017). Detection of Mispronunciations and Disfluencies in Children Reading Aloud. 1437-1441. 10.21437/Interspeech.2017-1522
dc.identifier.doi10.21437/interspeech.2017-1522
dc.identifier.urihttp://hdl.handle.net/10400.8/14255
dc.language.isoeng
dc.peerreviewedyes
dc.publisherISCA
dc.relationInstituto de Telecomunicações
dc.relationSFRH/BD/97204/2013
dc.relation.ispartofInterspeech 2017
dc.rights.uriN/A
dc.subjectChildren’s speech
dc.subjectReading disfluencies
dc.subjectMispronunciation detection
dc.titleDetection of Mispronunciations and Disfluencies in Children Reading Aloudeng
dc.typeconference paper
dspace.entity.typePublication
oaire.awardTitleInstituto de Telecomunicações
oaire.awardURIhttp://hdl.handle.net/10400.8/14168
oaire.citation.conferenceDate2017-08-20
oaire.citation.conferencePlaceStockholm, Sweden
oaire.citation.endPage1441
oaire.citation.startPage1437
oaire.citation.titleINTERSPEECH 2017
oaire.fundingStreamFinanciamento do Plano Estratégico de Unidades de I&D - 2013/2015 - OE
oaire.versionhttp://purl.org/coar/version/c_970fb48d4fbd8a85
person.familyNameLopes
person.givenNameCarla
person.identifier.ciencia-idAF14-3048-F510
person.identifier.orcid0000-0002-5366-0016
relation.isAuthorOfPublication4dfbaf0a-8c0b-4eaf-b1ad-0e590a5f3524
relation.isAuthorOfPublication.latestForDiscovery4dfbaf0a-8c0b-4eaf-b1ad-0e590a5f3524
relation.isProjectOfPublication090e77d3-8476-4972-9e33-6ad71214fa5c
relation.isProjectOfPublication.latestForDiscovery090e77d3-8476-4972-9e33-6ad71214fa5c

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Detection_of_Mispronunciations_and_Disfluencies_in.pdf
Size:
442.78 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.32 KB
Format:
Item-specific license agreed upon to submission
Description: