Sentiment Analysis of Emirati Dialect

: Recently, extensive studies and research in the Arabic Natural Language Processing (ANLP) field have been conducted for text classification and sentiment analysis. Moreover, the number of studies that target Arabic dialects has also increased. In this research paper, we constructed the first manua...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: A. Al Shamsi, Arwa (author)
مؤلفون آخرون: Abdallah, Sherief (author)
منشور في: 2022
الموضوعات:
الوصول للمادة أونلاين:https://bspace.buid.ac.ae/handle/1234/3116
https://doi.org/10.3390/bdcc6020057.
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1862980611245867008
author A. Al Shamsi, Arwa
author2 Abdallah, Sherief
author2_role author
author_facet A. Al Shamsi, Arwa
Abdallah, Sherief
author_role author
dc.creator.none.fl_str_mv A. Al Shamsi, Arwa
Abdallah, Sherief
dc.date.none.fl_str_mv 2022
2025-05-24T12:45:13Z
2025-05-24T12:45:13Z
dc.identifier.none.fl_str_mv Arwa A. Al Shamsi and Sherief Abdallah (2022) “Sentiment Analysis of Emirati Dialect,” Big Data and Cognitive Computing, 6(2), p. 57.
2504-2289
https://bspace.buid.ac.ae/handle/1234/3116
https://doi.org/10.3390/bdcc6020057.
dc.language.none.fl_str_mv en
dc.publisher.none.fl_str_mv MDPI
dc.relation.none.fl_str_mv Big Data and Cognitive Computingv6 n2 (20220501): 57
dc.subject.none.fl_str_mv corpus; Emirati dataset; Arabic dialects; sentiment analysis; classification; classifiers
dc.title.none.fl_str_mv Sentiment Analysis of Emirati Dialect
dc.type.none.fl_str_mv Article
description : Recently, extensive studies and research in the Arabic Natural Language Processing (ANLP) field have been conducted for text classification and sentiment analysis. Moreover, the number of studies that target Arabic dialects has also increased. In this research paper, we constructed the first manually annotated dataset of the Emirati dialect for the Instagram platform. The constructed dataset consisted of more than 70,000 comments, mostly written in the Emirati dialect. We annotated the comments in the dataset based on text polarity, dividing them into positive, negative, and neutral categories, and the number of annotated comments was 70,000. Moreover, the dataset was also annotated for the dialect type, categorized into the Emirati dialect, Arabic dialects, and MSA. Preprocessing and TF-IDF features extraction approaches were applied to the constructed Emirati dataset to prepare the dataset for the sentiment analysis experiment and improve its classification performance. The sentiment analysis experiment was carried out on both balanced and unbalanced datasets using several machine learning classifiers. The evaluation metrics of the sentiment analysis experiments were accuracy, recall, precision, and f-measure. The results reported that the best accuracy result was 80.80%, and it was achieved when the ensemble model was applied for the sentiment classification of the unbalanced dataset.
id budr_143db8363419a0df8358da05bf09f2a2
identifier_str_mv Arwa A. Al Shamsi and Sherief Abdallah (2022) “Sentiment Analysis of Emirati Dialect,” Big Data and Cognitive Computing, 6(2), p. 57.
2504-2289
language_invalid_str_mv en
network_acronym_str budr
network_name_str The British University in Dubai repository
oai_identifier_str oai:bspace.buid.ac.ae:1234/3116
publishDate 2022
publisher.none.fl_str_mv MDPI
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling Sentiment Analysis of Emirati DialectA. Al Shamsi, ArwaAbdallah, Sheriefcorpus; Emirati dataset; Arabic dialects; sentiment analysis; classification; classifiers: Recently, extensive studies and research in the Arabic Natural Language Processing (ANLP) field have been conducted for text classification and sentiment analysis. Moreover, the number of studies that target Arabic dialects has also increased. In this research paper, we constructed the first manually annotated dataset of the Emirati dialect for the Instagram platform. The constructed dataset consisted of more than 70,000 comments, mostly written in the Emirati dialect. We annotated the comments in the dataset based on text polarity, dividing them into positive, negative, and neutral categories, and the number of annotated comments was 70,000. Moreover, the dataset was also annotated for the dialect type, categorized into the Emirati dialect, Arabic dialects, and MSA. Preprocessing and TF-IDF features extraction approaches were applied to the constructed Emirati dataset to prepare the dataset for the sentiment analysis experiment and improve its classification performance. The sentiment analysis experiment was carried out on both balanced and unbalanced datasets using several machine learning classifiers. The evaluation metrics of the sentiment analysis experiments were accuracy, recall, precision, and f-measure. The results reported that the best accuracy result was 80.80%, and it was achieved when the ensemble model was applied for the sentiment classification of the unbalanced dataset.MDPI2025-05-24T12:45:13Z2025-05-24T12:45:13Z2022ArticleArwa A. Al Shamsi and Sherief Abdallah (2022) “Sentiment Analysis of Emirati Dialect,” Big Data and Cognitive Computing, 6(2), p. 57.2504-2289https://bspace.buid.ac.ae/handle/1234/3116https://doi.org/10.3390/bdcc6020057.enBig Data and Cognitive Computingv6 n2 (20220501): 57oai:bspace.buid.ac.ae:1234/31162025-05-24T12:48:12Z
spellingShingle Sentiment Analysis of Emirati Dialect
A. Al Shamsi, Arwa
corpus; Emirati dataset; Arabic dialects; sentiment analysis; classification; classifiers
title Sentiment Analysis of Emirati Dialect
title_full Sentiment Analysis of Emirati Dialect
title_fullStr Sentiment Analysis of Emirati Dialect
title_full_unstemmed Sentiment Analysis of Emirati Dialect
title_short Sentiment Analysis of Emirati Dialect
title_sort Sentiment Analysis of Emirati Dialect
topic corpus; Emirati dataset; Arabic dialects; sentiment analysis; classification; classifiers
url https://bspace.buid.ac.ae/handle/1234/3116
https://doi.org/10.3390/bdcc6020057.