Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)

Arabic poetry with its beauty, deep cultural importance and linguistic features, has always been a subject of attraction for scholars and readers. It attracted numerous researchers and writers to analyze and extract deep poetic features from various poems. As the literature review shows, there are n...

Full description

Saved in:
Bibliographic Details
Main Author: AL SHOUBAKI, HAMZA YOUNIS (author)
Published: 2024
Subjects:
Online Access:https://bspace.buid.ac.ae/handle/1234/2592
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1862980615297564672
author AL SHOUBAKI, HAMZA YOUNIS
author_facet AL SHOUBAKI, HAMZA YOUNIS
author_role author
dc.contributor.none.fl_str_mv Professor Sherief Abdallah
dc.creator.none.fl_str_mv AL SHOUBAKI, HAMZA YOUNIS
dc.date.none.fl_str_mv 2024-04-25T07:20:25Z
2024-04-25T07:20:25Z
2024-02
dc.format.none.fl_str_mv application/pdf
dc.identifier.none.fl_str_mv 20171673
https://bspace.buid.ac.ae/handle/1234/2592
dc.language.none.fl_str_mv en
dc.publisher.none.fl_str_mv The British University in Dubai (BUiD)
dc.subject.none.fl_str_mv Arabic poetry, text classification, RNN, LSTM, Bi-LSTM
dc.title.none.fl_str_mv Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)
التعرف التلقائي على شعراء الشعر العربي باستخدام تقنيات التعلم العميق (LSTM و Bi-LSTM)
dc.type.none.fl_str_mv Dissertation
description Arabic poetry with its beauty, deep cultural importance and linguistic features, has always been a subject of attraction for scholars and readers. It attracted numerous researchers and writers to analyze and extract deep poetic features from various poems. As the literature review shows, there are numerous successful attempts to identify these traits and characteristics such as categorizing the used poetry metric and identifying the poets behind these poems. In our research, we introduce a comprehensive approach to Arabic poetry text classification using deep learning techniques. We have used an almost one-million record dataset of Arabic poetry verses extracted from a poetry encyclopedia. These verses are labeled with different nine poets and cover both classical and modern poetic styles. Due to the complexity of Arabic poetry such as the excessive use of metaphors, figurative language, unlimited imagination, and the diversity of styles from one poet to another and from one poem to another, we tackle these challenges by careful employment of preprocessing steps, feature engineering and selection. We also explore a range of algorithms, including traditional classifiers and deep learning models, to determine and select the most suitable and accurate models of identifying poets' names from the verses. We have decided to employ LSTSM and Bi-LSTM as our main baseline models. The reason behind selecting such models is observing a concentration on RNN (Recurrent Neural Network) and its variants when it comes to text classification. LSTM has proven its capability for sequential data analysis in many different languages. Our reported results have shown promising classification accuracy with an average of 92.35%. This sheds some light on the feasibility of automating the classification of a morphologically complex language text (Arabic). Bi-LSTM has slightly outperformed the classic LSTM in normal situation with average accuracy of 92.15% and 92.56% for LSTM and Bi-LSTM respectively. We discuss what would be the impact of our research findings on Arabic literature in particularly Arabic poetry. We also address the challenges associated with this study.
id budr_de94f20195d0df0889bfbba12def4f50
identifier_str_mv 20171673
language_invalid_str_mv en
network_acronym_str budr
network_name_str The British University in Dubai repository
oai_identifier_str oai:bspace.buid.ac.ae:1234/2592
publishDate 2024
publisher.none.fl_str_mv The British University in Dubai (BUiD)
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)التعرف التلقائي على شعراء الشعر العربي باستخدام تقنيات التعلم العميق (LSTM و Bi-LSTM)AL SHOUBAKI, HAMZA YOUNISArabic poetry, text classification, RNN, LSTM, Bi-LSTMArabic poetry with its beauty, deep cultural importance and linguistic features, has always been a subject of attraction for scholars and readers. It attracted numerous researchers and writers to analyze and extract deep poetic features from various poems. As the literature review shows, there are numerous successful attempts to identify these traits and characteristics such as categorizing the used poetry metric and identifying the poets behind these poems. In our research, we introduce a comprehensive approach to Arabic poetry text classification using deep learning techniques. We have used an almost one-million record dataset of Arabic poetry verses extracted from a poetry encyclopedia. These verses are labeled with different nine poets and cover both classical and modern poetic styles. Due to the complexity of Arabic poetry such as the excessive use of metaphors, figurative language, unlimited imagination, and the diversity of styles from one poet to another and from one poem to another, we tackle these challenges by careful employment of preprocessing steps, feature engineering and selection. We also explore a range of algorithms, including traditional classifiers and deep learning models, to determine and select the most suitable and accurate models of identifying poets' names from the verses. We have decided to employ LSTSM and Bi-LSTM as our main baseline models. The reason behind selecting such models is observing a concentration on RNN (Recurrent Neural Network) and its variants when it comes to text classification. LSTM has proven its capability for sequential data analysis in many different languages. Our reported results have shown promising classification accuracy with an average of 92.35%. This sheds some light on the feasibility of automating the classification of a morphologically complex language text (Arabic). Bi-LSTM has slightly outperformed the classic LSTM in normal situation with average accuracy of 92.15% and 92.56% for LSTM and Bi-LSTM respectively. We discuss what would be the impact of our research findings on Arabic literature in particularly Arabic poetry. We also address the challenges associated with this study.The British University in Dubai (BUiD)Professor Sherief Abdallah2024-04-25T07:20:25Z2024-04-25T07:20:25Z2024-02Dissertationapplication/pdf20171673https://bspace.buid.ac.ae/handle/1234/2592enoai:bspace.buid.ac.ae:1234/25922024-04-25T23:00:40Z
spellingShingle Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)
AL SHOUBAKI, HAMZA YOUNIS
Arabic poetry, text classification, RNN, LSTM, Bi-LSTM
title Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)
title_full Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)
title_fullStr Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)
title_full_unstemmed Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)
title_short Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)
title_sort Automatic Recognition of Poets for Arabic Poetry using Deep Learning Techniques (LSTM and Bi-LSTM)
topic Arabic poetry, text classification, RNN, LSTM, Bi-LSTM
url https://bspace.buid.ac.ae/handle/1234/2592