A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data

People use informal language on microblog platforms to share their opinions on products, events, sports, or politics. Moreover, microblog platforms often harbor instances of hate speech and cyberbullying, resulting in a massive amount of data available for natural language processing applications. M...

Full description

Saved in:

Bibliographic Details
Main Author:	Rachid, Jinan (author)
Format:	masterThesis
Published:	2023
Subjects:	Lebanese American University > Dissertations Dissertations, Academic Arabic language > Lexicology > Data processing Arabic language > Transliteration > Data processing Sentiment analysis > Data processing Social media > Data processing
Online Access:	http://hdl.handle.net/10725/15805 https://doi.org/10.26756/th.2023.664 http://libraries.lau.edu.lb/research/laur/terms-of-use/thesis.php
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1864513471751651328
author	Rachid, Jinan
author_facet	Rachid, Jinan
author_role	author
dc.creator.none.fl_str_mv	Rachid, Jinan
dc.date.none.fl_str_mv	2023 2023-12-17 2024-06-27T06:38:38Z 2024-06-27T06:38:38Z
dc.identifier.none.fl_str_mv	http://hdl.handle.net/10725/15805 https://doi.org/10.26756/th.2023.664 http://libraries.lau.edu.lb/research/laur/terms-of-use/thesis.php
dc.language.none.fl_str_mv	en
dc.publisher.none.fl_str_mv	Lebanese American University
dc.rights.*.fl_str_mv	info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv	Lebanese American University--Dissertations Dissertations, Academic Arabic language--Lexicology--Data processing Arabic language--Transliteration--Data processing Sentiment analysis--Data processing Social media--Data processing
dc.title.none.fl_str_mv	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data
dc.type.none.fl_str_mv	Thesis info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/masterThesis
description	People use informal language on microblog platforms to share their opinions on products, events, sports, or politics. Moreover, microblog platforms often harbor instances of hate speech and cyberbullying, resulting in a massive amount of data available for natural language processing applications. Most studies have predominantly focused on common languages like English for tasks such as hate speech detection, sentiment analysis, and emotion analysis. Dialectal Arabic presents additional challenges due to its morphological richness and complexity, making NLP applications more intricate. While recent research has explored Arabic and Arabizi dialects, there has been limited attention given to Lebanese Arabizi. To address this gap, our objective was to construct a substantial Lebanese Arabizi dataset and make it accessible for NLP research. Additionally, we sought to develop a new approach to Arabizi detection and explored the identification of sarcasm and emotion recognition. The dataset comprised 11,000 rows, a combination of comments collected from Instagram and tweets. We utilized a pre-trained DziriBERT model for Arabizi identification and sarcasm detection, comparing the performances of contextual embedding and semantic embedding models. The word embeddings were then input into a Bidirectional Long Short-Term Memory (BiLSTM) model for emotion recognition. The Arabizi identification model achieved an impressive macro F1 score of 98%, while the sarcasm detection model achieved an average macro F1 score of 63%. This Arabizi detection model not only contributes to expanding the Arabizi dataset but also holds potential for broader applications. Sarcasm detection is crucial for microblog platforms to filter content, particularly since it heavily relies on the manual reporting of offensive material. Additionally, emotion recognition assists companies in understanding customers’ opinions about their products and services.
eu_rights_str_mv	openAccess
format	masterThesis
id	LAURepo_6cc014c2e2acc381f75d913e5ffd08bf
language_invalid_str_mv	en
network_acronym_str	LAURepo
network_name_str	Lebanese American University repository
oai_identifier_str	oai:laur.lau.edu.lb:10725/15805
publishDate	2023
publisher.none.fl_str_mv	Lebanese American University
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter DataRachid, JinanLebanese American University--DissertationsDissertations, AcademicArabic language--Lexicology--Data processingArabic language--Transliteration--Data processingSentiment analysis--Data processingSocial media--Data processingPeople use informal language on microblog platforms to share their opinions on products, events, sports, or politics. Moreover, microblog platforms often harbor instances of hate speech and cyberbullying, resulting in a massive amount of data available for natural language processing applications. Most studies have predominantly focused on common languages like English for tasks such as hate speech detection, sentiment analysis, and emotion analysis. Dialectal Arabic presents additional challenges due to its morphological richness and complexity, making NLP applications more intricate. While recent research has explored Arabic and Arabizi dialects, there has been limited attention given to Lebanese Arabizi. To address this gap, our objective was to construct a substantial Lebanese Arabizi dataset and make it accessible for NLP research. Additionally, we sought to develop a new approach to Arabizi detection and explored the identification of sarcasm and emotion recognition. The dataset comprised 11,000 rows, a combination of comments collected from Instagram and tweets. We utilized a pre-trained DziriBERT model for Arabizi identification and sarcasm detection, comparing the performances of contextual embedding and semantic embedding models. The word embeddings were then input into a Bidirectional Long Short-Term Memory (BiLSTM) model for emotion recognition. The Arabizi identification model achieved an impressive macro F1 score of 98%, while the sarcasm detection model achieved an average macro F1 score of 63%. This Arabizi detection model not only contributes to expanding the Arabizi dataset but also holds potential for broader applications. Sarcasm detection is crucial for microblog platforms to filter content, particularly since it heavily relies on the manual reporting of offensive material. Additionally, emotion recognition assists companies in understanding customers’ opinions about their products and services.1 online resource (xi, 58 leaves) : col. ill.Bibliography: leaves 49-58.Lebanese American University2024-06-27T06:38:38Z2024-06-27T06:38:38Z20232023-12-17Thesisinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesishttp://hdl.handle.net/10725/15805https://doi.org/10.26756/th.2023.664http://libraries.lau.edu.lb/research/laur/terms-of-use/thesis.phpeninfo:eu-repo/semantics/openAccessoai:laur.lau.edu.lb:10725/158052024-06-27T06:38:38Z
spellingShingle	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data Rachid, Jinan Lebanese American University--Dissertations Dissertations, Academic Arabic language--Lexicology--Data processing Arabic language--Transliteration--Data processing Sentiment analysis--Data processing Social media--Data processing
status_str	publishedVersion
title	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data
title_full	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data
title_fullStr	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data
title_full_unstemmed	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data
title_short	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data
title_sort	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data
topic	Lebanese American University--Dissertations Dissertations, Academic Arabic language--Lexicology--Data processing Arabic language--Transliteration--Data processing Sentiment analysis--Data processing Social media--Data processing
url	http://hdl.handle.net/10725/15805 https://doi.org/10.26756/th.2023.664 http://libraries.lau.edu.lb/research/laur/terms-of-use/thesis.php

A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data

Similar Items