Sibilant Consonants Classification with Deep Neural Networks

Ivo Anjos, Nuno Marques, Margarida Grilo, Isabel Guimarães, João Magalhães, Sofia Cavaco

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

Many children suffering from speech sound disorders cannot pronounce the sibilant consonants correctly. We have developed a serious game that is controlled by the children’s voices in real time and that allows children to practice the European Portuguese sibilant consonants. For this, the game uses a sibilant consonant classifier. Since the game does not require any type of adult supervision, children can practice the production of these sounds more often, which may lead to faster improvements of their speech. Recently, the use of deep neural networks has given considerable improvements in classification for a variety of use cases, from image classification to speech and language processing. Here we propose to use deep convolutional neural networks to classify sibilant phonemes of European Portuguese in our serious game for speech and language therapy. We compared the performance of several different artificial neural networks that used Mel frequency cepstral coefficients or log Mel filterbanks. Our best deep learning model achieves classification scores of $$95.48\%$$ using a 2D convolutional model with log Mel filterbanks as input features.

Original languageEnglish
Title of host publicationProgress in Artificial Intelligence - 19th EPIA Conference on Artificial Intelligence, EPIA 2019, Proceedings
EditorsPaulo Moura Oliveira, Paulo Novais, Luís Paulo Reis
Place of PublicationCham
PublisherSpringer Verlag
Pages435-447
Number of pages13
ISBN (Electronic)978-3-030-30244-3
ISBN (Print)978-3-030-30243-6
DOIs
Publication statusPublished - 2019
Event19th EPIA Conference on Artificial Intelligence, EPIA 2019 - Vila Real, Portugal
Duration: 3 Sept 20196 Sept 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Volume11805 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th EPIA Conference on Artificial Intelligence, EPIA 2019
Country/TerritoryPortugal
CityVila Real
Period3/09/196/09/19

Keywords

  • Deep learning
  • Sibilant consonants
  • Speech and language therapy

Fingerprint

Dive into the research topics of 'Sibilant Consonants Classification with Deep Neural Networks'. Together they form a unique fingerprint.

Cite this