TY - JOUR
T1 - The biovisualspeech corpus of words with sibilants for speech therapy games development
AU - Cavaco, Sofia
AU - Guimarães, Isabel
AU - Ascensão, Mariana
AU - Abad, Alberto
AU - Anjos, Ivo
AU - Oliveira, Francisco
AU - Martins, Sofia
AU - Marques, Nuno
AU - Eskenazi, Maxine
AU - Magalhães, João
AU - Grilo, Margarida
N1 - This work was supported by the Portuguese Foundation for Science and Technology under projects BioVisualSpeech (CMUP-ERI/TIC/0033/2014), NOVA-LINCS (UIDB/04516/2020) and INESC-ID (UIDB/50021/2020).
PY - 2020/10/2
Y1 - 2020/10/2
N2 - In order to develop computer tools for speech therapy that reliably classify speech productions, there is a need for speech production corpora that characterize the target population in terms of age, gender, and native language. Apart from including correct speech productions, in order to characterize the target population, the corpora should also include samples from people with speech sound disorders. In addition, the annotation of the data should include information on the correctness of the speech productions. Following these criteria, we collected a corpus that can be used to develop computer tools for speech and language therapy of Portuguese children with sigmatism. The proposed corpus contains European Portuguese children’s word productions in which the words have sibilant consonants. The corpus has productions from 356 children from 5 to 9 years of age. Some important characteristics of this corpus, that are relevant to speech and language therapy and computer science research, are that (1) the corpus includes data from children with speech sound disorders; and (2) the productions were annotated according to the criteria of speech and language pathologists, and have information about the speech production errors. These are relevant features for the development and assessment of speech processing tools for speech therapy of Portuguese children. In addition, as an illustration on how to use the corpus, we present three speech therapy games that use a convolutional neural network sibilants classifier trained with data from this corpus and a word recognition module trained on additional children data and calibrated and evaluated with the collected corpus.
AB - In order to develop computer tools for speech therapy that reliably classify speech productions, there is a need for speech production corpora that characterize the target population in terms of age, gender, and native language. Apart from including correct speech productions, in order to characterize the target population, the corpora should also include samples from people with speech sound disorders. In addition, the annotation of the data should include information on the correctness of the speech productions. Following these criteria, we collected a corpus that can be used to develop computer tools for speech and language therapy of Portuguese children with sigmatism. The proposed corpus contains European Portuguese children’s word productions in which the words have sibilant consonants. The corpus has productions from 356 children from 5 to 9 years of age. Some important characteristics of this corpus, that are relevant to speech and language therapy and computer science research, are that (1) the corpus includes data from children with speech sound disorders; and (2) the productions were annotated according to the criteria of speech and language pathologists, and have information about the speech production errors. These are relevant features for the development and assessment of speech processing tools for speech therapy of Portuguese children. In addition, as an illustration on how to use the corpus, we present three speech therapy games that use a convolutional neural network sibilants classifier trained with data from this corpus and a word recognition module trained on additional children data and calibrated and evaluated with the collected corpus.
KW - Children’s speech corpus
KW - Serious games for speech and language therapy
KW - Sibilant consonants
KW - Speech sound disorders
UR - http://www.scopus.com/inward/record.url?scp=85093071742&partnerID=8YFLogxK
U2 - 10.3390/info11100470
DO - 10.3390/info11100470
M3 - Conference article
AN - SCOPUS:85093071742
SN - 2078-2489
VL - 11
SP - 1
EP - 18
JO - Information (Switzerland)
JF - Information (Switzerland)
IS - 10(SI)
M1 - 470
T2 - 14th International Conference on the Computational Processing of Portuguese, PROPOR 2020
Y2 - 2 March 2020 through 4 March 2020
ER -