Arabic Dialect Speech-Text Recognition Using Deep Learning

Recently, the dominant utilization of media networking has emphasised the importance of precisely identifying users’ feelings, covering a spectrum from contentment to dissatisfaction, in the domain of online communications. The dissertation addresses the challenges of accurately transcribing Arabic...

Full description

Saved in:

Bibliographic Details
Main Author:	RAEIALBOOM, OMAR SALEH DARWISH (author)
Published:	2024
Subjects:	Arabic speech recognition, deep learning, TestRCNN, hybrid TestRCNN-CNN
Online Access:	https://bspace.buid.ac.ae/handle/1234/2753
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1862980611686268928
author	RAEIALBOOM, OMAR SALEH DARWISH
author_facet	RAEIALBOOM, OMAR SALEH DARWISH
author_role	author
dc.contributor.none.fl_str_mv	Dr Manar Al Khatib
dc.creator.none.fl_str_mv	RAEIALBOOM, OMAR SALEH DARWISH
dc.date.none.fl_str_mv	2024-09 2025-01-23T08:47:09Z 2025-01-23T08:47:09Z
dc.format.none.fl_str_mv	application/pdf
dc.identifier.none.fl_str_mv	23000715 https://bspace.buid.ac.ae/handle/1234/2753
dc.language.none.fl_str_mv	en
dc.publisher.none.fl_str_mv	The British University in Dubai (BUiD)
dc.subject.none.fl_str_mv	Arabic speech recognition, deep learning, TestRCNN, hybrid TestRCNN-CNN
dc.title.none.fl_str_mv	Arabic Dialect Speech-Text Recognition Using Deep Learning
dc.type.none.fl_str_mv	Dissertation
description	Recently, the dominant utilization of media networking has emphasised the importance of precisely identifying users’ feelings, covering a spectrum from contentment to dissatisfaction, in the domain of online communications. The dissertation addresses the challenges of accurately transcribing Arabic speech due to the language’s complexity, limited audio resources, and diverse regional variations. Traditional speech recognition models struggle in this domain, prompting an exploration of deep learning approaches, specifically the TestRCNN and Hybrid TestRCNN-CNN Models. The research begins with a comprehensive preprocessing process, which involves loading audio data, extracting features using Mel-Frequency Cepstral Coefficients (MFCCs), and encoding labels. Both models are trained and evaluated on a curated dataset of Arabic speech samples, capturing the spatial and temporal features. The TestRCNN Model combines convolutional layers for local feature extraction and recurrent layers to capture temporal dependencies. It achieves an accuracy of 93% and a word error rate (WER) of 0.0986, but faces difficulties in distinguishing closely related phonetic sounds. To address these limitations, a hybrid approach is proposed, combining the TestRCNN and CNN architectures. This hybrid model leverages the CNN’s ability to extract detailed spatial features and the TestRCNN’s proficiency in capturing long-term dependencies. The Hybrid TestRCNN-CNN Model outperforms the TestRCNN, achieving an accuracy of 94% and significantly reducing the (WER) to 0.0460. The dissertation provides a detailed comparison of these models’ operational features, hyperparameters, and outcomes. Through extensive experimentation, the study highlights the hybrid approach’s advantages in accurately transcribing Arabic speech and contributes valuable insights to the field of Arabic speech recognition research.
id	budr_1b6feeaf74a850ae5d82e7fbe8d66afa
identifier_str_mv	23000715
language_invalid_str_mv	en
network_acronym_str	budr
network_name_str	The British University in Dubai repository
oai_identifier_str	oai:bspace.buid.ac.ae:1234/2753
publishDate	2024
publisher.none.fl_str_mv	The British University in Dubai (BUiD)
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling	Arabic Dialect Speech-Text Recognition Using Deep LearningRAEIALBOOM, OMAR SALEH DARWISHArabic speech recognition, deep learning, TestRCNN, hybrid TestRCNN-CNNRecently, the dominant utilization of media networking has emphasised the importance of precisely identifying users’ feelings, covering a spectrum from contentment to dissatisfaction, in the domain of online communications. The dissertation addresses the challenges of accurately transcribing Arabic speech due to the language’s complexity, limited audio resources, and diverse regional variations. Traditional speech recognition models struggle in this domain, prompting an exploration of deep learning approaches, specifically the TestRCNN and Hybrid TestRCNN-CNN Models. The research begins with a comprehensive preprocessing process, which involves loading audio data, extracting features using Mel-Frequency Cepstral Coefficients (MFCCs), and encoding labels. Both models are trained and evaluated on a curated dataset of Arabic speech samples, capturing the spatial and temporal features. The TestRCNN Model combines convolutional layers for local feature extraction and recurrent layers to capture temporal dependencies. It achieves an accuracy of 93% and a word error rate (WER) of 0.0986, but faces difficulties in distinguishing closely related phonetic sounds. To address these limitations, a hybrid approach is proposed, combining the TestRCNN and CNN architectures. This hybrid model leverages the CNN’s ability to extract detailed spatial features and the TestRCNN’s proficiency in capturing long-term dependencies. The Hybrid TestRCNN-CNN Model outperforms the TestRCNN, achieving an accuracy of 94% and significantly reducing the (WER) to 0.0460. The dissertation provides a detailed comparison of these models’ operational features, hyperparameters, and outcomes. Through extensive experimentation, the study highlights the hybrid approach’s advantages in accurately transcribing Arabic speech and contributes valuable insights to the field of Arabic speech recognition research.The British University in Dubai (BUiD)Dr Manar Al Khatib2025-01-23T08:47:09Z2025-01-23T08:47:09Z2024-09Dissertationapplication/pdf23000715https://bspace.buid.ac.ae/handle/1234/2753enoai:bspace.buid.ac.ae:1234/27532025-01-23T23:00:20Z
spellingShingle	Arabic Dialect Speech-Text Recognition Using Deep Learning RAEIALBOOM, OMAR SALEH DARWISH Arabic speech recognition, deep learning, TestRCNN, hybrid TestRCNN-CNN
title	Arabic Dialect Speech-Text Recognition Using Deep Learning
title_full	Arabic Dialect Speech-Text Recognition Using Deep Learning
title_fullStr	Arabic Dialect Speech-Text Recognition Using Deep Learning
title_full_unstemmed	Arabic Dialect Speech-Text Recognition Using Deep Learning
title_short	Arabic Dialect Speech-Text Recognition Using Deep Learning
title_sort	Arabic Dialect Speech-Text Recognition Using Deep Learning
topic	Arabic speech recognition, deep learning, TestRCNN, hybrid TestRCNN-CNN
url	https://bspace.buid.ac.ae/handle/1234/2753

Arabic Dialect Speech-Text Recognition Using Deep Learning

Similar Items