Investigating Cross-Lingual Hate Speech Detection on Social Media

Social media platforms are becoming an integral part of our life. Massive amounts of content are being uploaded to social media platforms every second by online users. Social media sites are creating an exciting platform for online users to freely express their views, and share news or even thoughts...

Full description

Saved in:
Bibliographic Details
Main Author: KHWILEH, HASSAN YOUSEF (author)
Published: 2020
Subjects:
Online Access:https://bspace.buid.ac.ae/handle/1234/1671
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1862980620780568576
author KHWILEH, HASSAN YOUSEF
author_facet KHWILEH, HASSAN YOUSEF
author_role author
dc.creator.none.fl_str_mv KHWILEH, HASSAN YOUSEF
dc.date.none.fl_str_mv 2020-10-05T13:06:36Z
2020-10-05T13:06:36Z
2020-01
dc.format.none.fl_str_mv application/pdf
dc.identifier.none.fl_str_mv 2015228016
https://bspace.buid.ac.ae/handle/1234/1671
dc.language.none.fl_str_mv en
dc.publisher.none.fl_str_mv The British University in Dubai (BUiD)
dc.subject.none.fl_str_mv hate speech detection
cross-lingual
social media
dc.title.none.fl_str_mv Investigating Cross-Lingual Hate Speech Detection on Social Media
dc.type.none.fl_str_mv Dissertation
description Social media platforms are becoming an integral part of our life. Massive amounts of content are being uploaded to social media platforms every second by online users. Social media sites are creating an exciting platform for online users to freely express their views, and share news or even thoughts and insights about any topic of their interest. Contrarily, social media platforms are becoming the ground for allowing toxic behaviour, online harassment, personal attacking and hate-speech content. This has resulted in many social media users closing their account to maintain their psychological and physical safety. Major social media platforms such as Facebook, Twitter, YouTube are taking this problem very seriously, and making huge efforts and investment to maintain the trust, safety, integrity of the users in their platforms. However, recent research studies conducted in the United States on sample of online users, indicated that over 40% have personally experienced online harassment, and almost every online user is asking major online tech companies to act against it (Pew Research, USA, 2019). With the availability of social media platforms in many languages and across different regions, Hate-speech and online harassment issues are becoming large-scale global problem that is affecting online users around the world. Therefore, there is an increasing demand to advance the current research and development in detecting online hate-speech not only for English but also for other languages. Previous research efforts have mainly focused on tackling hate-speech content for primary languages English, French and others, while very limited work has been done in other emerging languages such as Arabic where Internet penetration is exploding. In this research, we investigate the task of building techniques for detecting online hate speech in Arabic language. Our contribution in this work can be summarised into two parts, the first part is to study the challenges of detecting hate speech for noisy, user-generated informal comments and tweets in Arabic, and the second part is to investigate novel approaches to build effective techniques for tackling this problem.
id budr_09f4d8967af2d8651d26d19fb212474b
identifier_str_mv 2015228016
language_invalid_str_mv en
network_acronym_str budr
network_name_str The British University in Dubai repository
oai_identifier_str oai:bspace.buid.ac.ae:1234/1671
publishDate 2020
publisher.none.fl_str_mv The British University in Dubai (BUiD)
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling Investigating Cross-Lingual Hate Speech Detection on Social MediaKHWILEH, HASSAN YOUSEFhate speech detectioncross-lingualsocial mediaSocial media platforms are becoming an integral part of our life. Massive amounts of content are being uploaded to social media platforms every second by online users. Social media sites are creating an exciting platform for online users to freely express their views, and share news or even thoughts and insights about any topic of their interest. Contrarily, social media platforms are becoming the ground for allowing toxic behaviour, online harassment, personal attacking and hate-speech content. This has resulted in many social media users closing their account to maintain their psychological and physical safety. Major social media platforms such as Facebook, Twitter, YouTube are taking this problem very seriously, and making huge efforts and investment to maintain the trust, safety, integrity of the users in their platforms. However, recent research studies conducted in the United States on sample of online users, indicated that over 40% have personally experienced online harassment, and almost every online user is asking major online tech companies to act against it (Pew Research, USA, 2019). With the availability of social media platforms in many languages and across different regions, Hate-speech and online harassment issues are becoming large-scale global problem that is affecting online users around the world. Therefore, there is an increasing demand to advance the current research and development in detecting online hate-speech not only for English but also for other languages. Previous research efforts have mainly focused on tackling hate-speech content for primary languages English, French and others, while very limited work has been done in other emerging languages such as Arabic where Internet penetration is exploding. In this research, we investigate the task of building techniques for detecting online hate speech in Arabic language. Our contribution in this work can be summarised into two parts, the first part is to study the challenges of detecting hate speech for noisy, user-generated informal comments and tweets in Arabic, and the second part is to investigate novel approaches to build effective techniques for tackling this problem.The British University in Dubai (BUiD)2020-10-05T13:06:36Z2020-10-05T13:06:36Z2020-01Dissertationapplication/pdf2015228016https://bspace.buid.ac.ae/handle/1234/1671enoai:bspace.buid.ac.ae:1234/16712021-09-22T13:09:44Z
spellingShingle Investigating Cross-Lingual Hate Speech Detection on Social Media
KHWILEH, HASSAN YOUSEF
hate speech detection
cross-lingual
social media
title Investigating Cross-Lingual Hate Speech Detection on Social Media
title_full Investigating Cross-Lingual Hate Speech Detection on Social Media
title_fullStr Investigating Cross-Lingual Hate Speech Detection on Social Media
title_full_unstemmed Investigating Cross-Lingual Hate Speech Detection on Social Media
title_short Investigating Cross-Lingual Hate Speech Detection on Social Media
title_sort Investigating Cross-Lingual Hate Speech Detection on Social Media
topic hate speech detection
cross-lingual
social media
url https://bspace.buid.ac.ae/handle/1234/1671