Arabic Dialect Speech-Text Recognition Using Deep Learning

Recently, the dominant utilization of media networking has emphasised the importance of precisely identifying users’ feelings, covering a spectrum from contentment to dissatisfaction, in the domain of online communications. The dissertation addresses the challenges of accurately transcribing Arabic...

Full description

Saved in:
Bibliographic Details
Main Author: RAEIALBOOM, OMAR SALEH DARWISH (author)
Published: 2024
Subjects:
Online Access:https://bspace.buid.ac.ae/handle/1234/2753
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1862980611686268928
author RAEIALBOOM, OMAR SALEH DARWISH
author_facet RAEIALBOOM, OMAR SALEH DARWISH
author_role author
dc.contributor.none.fl_str_mv Dr Manar Al Khatib
dc.creator.none.fl_str_mv RAEIALBOOM, OMAR SALEH DARWISH
dc.date.none.fl_str_mv 2024-09
2025-01-23T08:47:09Z
2025-01-23T08:47:09Z
dc.format.none.fl_str_mv application/pdf
dc.identifier.none.fl_str_mv 23000715
https://bspace.buid.ac.ae/handle/1234/2753
dc.language.none.fl_str_mv en
dc.publisher.none.fl_str_mv The British University in Dubai (BUiD)
dc.subject.none.fl_str_mv Arabic speech recognition, deep learning, TestRCNN, hybrid TestRCNN-CNN
dc.title.none.fl_str_mv Arabic Dialect Speech-Text Recognition Using Deep Learning
dc.type.none.fl_str_mv Dissertation
description Recently, the dominant utilization of media networking has emphasised the importance of precisely identifying users’ feelings, covering a spectrum from contentment to dissatisfaction, in the domain of online communications. The dissertation addresses the challenges of accurately transcribing Arabic speech due to the language’s complexity, limited audio resources, and diverse regional variations. Traditional speech recognition models struggle in this domain, prompting an exploration of deep learning approaches, specifically the TestRCNN and Hybrid TestRCNN-CNN Models. The research begins with a comprehensive preprocessing process, which involves loading audio data, extracting features using Mel-Frequency Cepstral Coefficients (MFCCs), and encoding labels. Both models are trained and evaluated on a curated dataset of Arabic speech samples, capturing the spatial and temporal features. The TestRCNN Model combines convolutional layers for local feature extraction and recurrent layers to capture temporal dependencies. It achieves an accuracy of 93% and a word error rate (WER) of 0.0986, but faces difficulties in distinguishing closely related phonetic sounds. To address these limitations, a hybrid approach is proposed, combining the TestRCNN and CNN architectures. This hybrid model leverages the CNN’s ability to extract detailed spatial features and the TestRCNN’s proficiency in capturing long-term dependencies. The Hybrid TestRCNN-CNN Model outperforms the TestRCNN, achieving an accuracy of 94% and significantly reducing the (WER) to 0.0460. The dissertation provides a detailed comparison of these models’ operational features, hyperparameters, and outcomes. Through extensive experimentation, the study highlights the hybrid approach’s advantages in accurately transcribing Arabic speech and contributes valuable insights to the field of Arabic speech recognition research.
id budr_1b6feeaf74a850ae5d82e7fbe8d66afa
identifier_str_mv 23000715
language_invalid_str_mv en
network_acronym_str budr
network_name_str The British University in Dubai repository
oai_identifier_str oai:bspace.buid.ac.ae:1234/2753
publishDate 2024
publisher.none.fl_str_mv The British University in Dubai (BUiD)
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling Arabic Dialect Speech-Text Recognition Using Deep LearningRAEIALBOOM, OMAR SALEH DARWISHArabic speech recognition, deep learning, TestRCNN, hybrid TestRCNN-CNNRecently, the dominant utilization of media networking has emphasised the importance of precisely identifying users’ feelings, covering a spectrum from contentment to dissatisfaction, in the domain of online communications. The dissertation addresses the challenges of accurately transcribing Arabic speech due to the language’s complexity, limited audio resources, and diverse regional variations. Traditional speech recognition models struggle in this domain, prompting an exploration of deep learning approaches, specifically the TestRCNN and Hybrid TestRCNN-CNN Models. The research begins with a comprehensive preprocessing process, which involves loading audio data, extracting features using Mel-Frequency Cepstral Coefficients (MFCCs), and encoding labels. Both models are trained and evaluated on a curated dataset of Arabic speech samples, capturing the spatial and temporal features. The TestRCNN Model combines convolutional layers for local feature extraction and recurrent layers to capture temporal dependencies. It achieves an accuracy of 93% and a word error rate (WER) of 0.0986, but faces difficulties in distinguishing closely related phonetic sounds. To address these limitations, a hybrid approach is proposed, combining the TestRCNN and CNN architectures. This hybrid model leverages the CNN’s ability to extract detailed spatial features and the TestRCNN’s proficiency in capturing long-term dependencies. The Hybrid TestRCNN-CNN Model outperforms the TestRCNN, achieving an accuracy of 94% and significantly reducing the (WER) to 0.0460. The dissertation provides a detailed comparison of these models’ operational features, hyperparameters, and outcomes. Through extensive experimentation, the study highlights the hybrid approach’s advantages in accurately transcribing Arabic speech and contributes valuable insights to the field of Arabic speech recognition research.The British University in Dubai (BUiD)Dr Manar Al Khatib2025-01-23T08:47:09Z2025-01-23T08:47:09Z2024-09Dissertationapplication/pdf23000715https://bspace.buid.ac.ae/handle/1234/2753enoai:bspace.buid.ac.ae:1234/27532025-01-23T23:00:20Z
spellingShingle Arabic Dialect Speech-Text Recognition Using Deep Learning
RAEIALBOOM, OMAR SALEH DARWISH
Arabic speech recognition, deep learning, TestRCNN, hybrid TestRCNN-CNN
title Arabic Dialect Speech-Text Recognition Using Deep Learning
title_full Arabic Dialect Speech-Text Recognition Using Deep Learning
title_fullStr Arabic Dialect Speech-Text Recognition Using Deep Learning
title_full_unstemmed Arabic Dialect Speech-Text Recognition Using Deep Learning
title_short Arabic Dialect Speech-Text Recognition Using Deep Learning
title_sort Arabic Dialect Speech-Text Recognition Using Deep Learning
topic Arabic speech recognition, deep learning, TestRCNN, hybrid TestRCNN-CNN
url https://bspace.buid.ac.ae/handle/1234/2753