Repository logo
 
Publication

The LetsRead Corpus of Portuguese children reading aloud for performance evaluation

dc.contributor.authorProença, Jorge
dc.contributor.authorCelorico, Dirce
dc.contributor.authorCandeias, Sara
dc.contributor.authorLopes, Carla
dc.contributor.authorPerdigão, Fernando
dc.date.accessioned2025-06-17T14:28:01Z
dc.date.available2025-06-17T14:28:01Z
dc.date.issued2016
dc.description.abstractThis paper introduces the LetsRead Corpus of European Portuguese read speech from 6 to 10 years old children. The motivation for the creation of this corpus stems from the inexistence of databases with recordings of reading tasks of Portuguese children with different performance levels and including all the common reading aloud disfluencies. It is also essential to develop techniques to fulfill the main objective of the LetsRead project: to automatically evaluate the reading performance of children through the analysis of reading tasks. The collected data amounts to 20 hours of speech from 284 children from private and public Portuguese schools, with each child carrying out two tasks: reading sentences and reading a list of pseudowords, both with varying levels of difficulty throughout the school grades. In this paper, the design of the reading tasks presented to children is described, as well as the collection procedure. Manually annotated data is analyzed according to disfluencies and reading performance. The considered word difficulty parameter is also confirmed to be suitable for the pseudoword reading tasks.eng
dc.description.sponsorshipThis work was supported in part by Fundação para a Ciência e Tecnologia under the projects UID/EEA/ 50008/2013 (pluriannual funding in the scope of the LETSREAD project), and Marie Curie Action IRIS (ref. 610986, FP7-PEOPLE-2013-IAPP). Jorge Proença is supported by the SFRH/BD/97204/2013 FCT Grant. We would like to thank João de Deus, Bissaya Barreto and EBI de Pereira school associations and CASPAE parent’s association for collaborating in the database collection.
dc.identifier.isbn978-295174089-1
dc.identifier.urihttp://hdl.handle.net/10400.8/13300
dc.language.isoeng
dc.peerreviewedn/a
dc.publisherEuropean Language Resources Association (ELRA)
dc.relation.hasversionhttps://www.scopus.com/record/display.uri?eid=2-s2.0-84997286084&origin=inward&txGid=5de4267bb17dee72547085ed848fb080
dc.rights.uriN/A
dc.subjectChildren’s speech
dc.subjectReading disfluencies
dc.subjectEuropean Portuguese
dc.titleThe LetsRead Corpus of Portuguese children reading aloud for performance evaluationeng
dc.typeconference paper
dspace.entity.typePublication
oaire.citation.conferenceDate2016
oaire.citation.endPage785
oaire.citation.startPage781
oaire.citation.title10th International Conference on Language Resources and Evaluation, LREC 2016
oaire.versionhttp://purl.org/coar/version/c_970fb48d4fbd8a85
person.familyNameLopes
person.givenNameCarla
person.identifier.ciencia-idAF14-3048-F510
person.identifier.orcid0000-0002-5366-0016
relation.isAuthorOfPublication4dfbaf0a-8c0b-4eaf-b1ad-0e590a5f3524
relation.isAuthorOfPublication.latestForDiscovery4dfbaf0a-8c0b-4eaf-b1ad-0e590a5f3524

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
L16-1125.pdf
Size:
538.41 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.32 KB
Format:
Item-specific license agreed upon to submission
Description: