A Novel Speech Analysis and Correction Tool for Arabic-Speaking Children

Lamia Berriche,Maha Driss,Areej Ahmed Almuntashri,Asma Mufreh Lghabi,Heba Saleh Almudhi,Munerah Abdul-Aziz Almansour

Published 2024 in arXiv.org

ABSTRACT

This paper introduces a new application named ArPA for Arabic kids who have trouble with pronunciation. Our application comprises two key components: the diagnostic module and the therapeutic module. The diagnostic process involves capturing the child's speech signal, preprocessing, and analyzing it using different machine learning classifiers like K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Decision Trees as well as deep neural network classifiers like ResNet18. The therapeutic module offers eye-catching gamified interfaces in which each correctly spoken letter earns a higher avatar level, providing positive reinforcement for the child's pronunciation improvement. Two datasets were used for experimental evaluation: one from a childcare centre and the other including Arabic alphabet pronunciation recordings. Our work uses a novel technique for speech recognition using Melspectrogram and MFCC images. The results show that the ResNet18 classifier on speech-to-image converted data effectively identifies mispronunciations in Arabic speech with an accuracy of 99.015\% with Mel-Spectrogram images outperforming ResNet18 with MFCC images.

PUBLICATION RECORD

Publication year
2024
Venue
arXiv.org
Publication date
2024-11-18
Fields of study
Linguistics, Computer Science, Education
Identifiers
DOI 10.48550/arXiv.2411.13592 arXiv 2411.13592
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Anomaly detection with a variational autoencoder for Arabic mispronunciation detection
2024cited by this paper
Mispronunciation detection and diagnosis using deep neural networks: a systematic review
2024cited by this paper
Arabic Mispronunciation Recognition System Using LSTM Network
2023cited by this paper
Mispronunciation Detection and Diagnosis with Articulatory-Level Feedback Generation for Non-Native Arabic Speech
2022cited by this paper
Correct Pronunciation Detection of the Arabic Alphabet Using Deep Learning
2021cited by this paper
An Approach for Pronunciation Classification of Classical Arabic Phonemes Using Deep Learning
2021cited by this paper
Improving Mispronunciation Detection of Arabic Words for Non-Native Learners Using Deep Convolutional Neural Network Features
2020cited by this paper
Language Disorders, Special Needs and Mathematics Learning
2020cited by this paper
Fundamentals, present and future perspectives of speech enhancement
2020cited by this paper
What Are the Peer Interaction Strengths and Difficulties in Children with Developmental Language Disorder? A Systematic Review
2020cited by this paper
Mispronunciation Detection Using Deep Convolutional Neural Network Features and Transfer Learning-Based Model for Arabic Phonemes
2019cited by this paper
Decision tree SVM model with Fisher feature selection for speech emotion recognition
2019cited by this paper
Speech Recognition using SVMs
2001cited by this paper

CITED BY

Listening to stars: audio-inspired multimodal learning for star classification
2025cites this paper
Machine Learning Methods for Recognizing Infant Cries: An Extensive Examination of Audio Signal Processing
2025cites this paper