Towards Gulf Emirati Dialect Corpus from Social Media

This open access book presents contributions on a wide range of scientific areas originating from the BUiD Doctoral Research Conference (BDRC 2023)

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: AlAzzam, Bayan A. (author)
مؤلفون آخرون: Alkhatib, Manar (author), Shaalan, Khaled (author)
منشور في: 2024
الموضوعات:
الوصول للمادة أونلاين:https://bspace.buid.ac.ae/handle/1234/2565
https://link.springer.com/chapter/10.1007/978-3-031-56121-4_27
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1862980610145910784
author AlAzzam, Bayan A.
author2 Alkhatib, Manar
Shaalan, Khaled
author2_role author
author
author_facet AlAzzam, Bayan A.
Alkhatib, Manar
Shaalan, Khaled
author_role author
dc.creator.none.fl_str_mv AlAzzam, Bayan A.
Alkhatib, Manar
Shaalan, Khaled
dc.date.none.fl_str_mv 2024-04-05T12:27:14Z
2024-04-05T12:27:14Z
2024
dc.identifier.none.fl_str_mv AlAzzam, B.A., Alkhatib, M., Shaalan, K. (2024). Towards Gulf Emirati Dialect Corpus from Social Media. In: Al Marri, K., Mir, F.A., David, S.A., Al-Emran, M. (eds) BUiD Doctoral Research Conference 2023. Lecture Notes in Civil Engineering, vol 473. Springer, Cham. https://doi.org/10.1007/978-3-031-56121-4_27
Print: 978-3031561207 Online: 978-3031561214
https://bspace.buid.ac.ae/handle/1234/2565
https://link.springer.com/chapter/10.1007/978-3-031-56121-4_27
dc.language.none.fl_str_mv en
dc.publisher.none.fl_str_mv SpringerLink
dc.relation.none.fl_str_mv https://link.springer.com/chapter/10.1007/978-3-031-56121-4_27
dc.subject.none.fl_str_mv Emirati dialect, natural language processing, corpus
dc.title.none.fl_str_mv Towards Gulf Emirati Dialect Corpus from Social Media
dc.type.none.fl_str_mv Conference paper
description This open access book presents contributions on a wide range of scientific areas originating from the BUiD Doctoral Research Conference (BDRC 2023)
id budr_9b26e958d509f3d78f0cad47cbd2dfbf
identifier_str_mv AlAzzam, B.A., Alkhatib, M., Shaalan, K. (2024). Towards Gulf Emirati Dialect Corpus from Social Media. In: Al Marri, K., Mir, F.A., David, S.A., Al-Emran, M. (eds) BUiD Doctoral Research Conference 2023. Lecture Notes in Civil Engineering, vol 473. Springer, Cham. https://doi.org/10.1007/978-3-031-56121-4_27
Print: 978-3031561207 Online: 978-3031561214
language_invalid_str_mv en
network_acronym_str budr
network_name_str The British University in Dubai repository
oai_identifier_str oai:bspace.buid.ac.ae:1234/2565
publishDate 2024
publisher.none.fl_str_mv SpringerLink
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling Towards Gulf Emirati Dialect Corpus from Social MediaAlAzzam, Bayan A.Alkhatib, ManarShaalan, KhaledEmirati dialect, natural language processing, corpusThis open access book presents contributions on a wide range of scientific areas originating from the BUiD Doctoral Research Conference (BDRC 2023)Purpose: This paper discusses the need for a corpus of Emirati traditional phrases and idioms in natural language processing (NLP) for the Gulf Emirati dialect and its potential applications in fields like voice recognition, machine translation, and sentiment analysis. Methodology: The researchers collected a corpus of more than 3000 traditional Emirati words and idioms by gathering data from several social media platforms, such as forums, YouTube, and Emirati radio stations. In addition, the researchers used the website scraping technologies to collect suitable resources, subsequently cleansing and organising the gathered material to ensure accuracy and consistency. A pilot investigation was undertaken, including an individual who is a native speaker of Emirati, in order to verify the precision of the dataset. Findings: The researchers successfully compiled a substantial dataset of traditional Emirati phrases and idioms, so enabling potential future investigations in the realm of Arabic dialects, specifically focusing on Gulf Arabic dialects such as the Emirati dialect. Implications: The compilation of Emirati traditional idioms and words presented in this study has potential practical effects in several domains such as medical, education, and business. These implications mostly revolve around enhancing communication among and with individuals proficient in the Emirati language. Originality/Value: This study distinguishes itself by concentrating on the compilation of an NLP corpus comprising traditional Emirati phrases and idioms, with a specific emphasis on the Gulf Emirati dialect. The dataset generated as a result of this effort may prove indispensable for further studies into Arabic dialects.SpringerLink2024-04-05T12:27:14Z2024-04-05T12:27:14Z2024Conference paperAlAzzam, B.A., Alkhatib, M., Shaalan, K. (2024). Towards Gulf Emirati Dialect Corpus from Social Media. In: Al Marri, K., Mir, F.A., David, S.A., Al-Emran, M. (eds) BUiD Doctoral Research Conference 2023. Lecture Notes in Civil Engineering, vol 473. Springer, Cham. https://doi.org/10.1007/978-3-031-56121-4_27Print: 978-3031561207 Online: 978-3031561214https://bspace.buid.ac.ae/handle/1234/2565https://link.springer.com/chapter/10.1007/978-3-031-56121-4_27enhttps://link.springer.com/chapter/10.1007/978-3-031-56121-4_27oai:bspace.buid.ac.ae:1234/25652024-04-16T05:23:25Z
spellingShingle Towards Gulf Emirati Dialect Corpus from Social Media
AlAzzam, Bayan A.
Emirati dialect, natural language processing, corpus
title Towards Gulf Emirati Dialect Corpus from Social Media
title_full Towards Gulf Emirati Dialect Corpus from Social Media
title_fullStr Towards Gulf Emirati Dialect Corpus from Social Media
title_full_unstemmed Towards Gulf Emirati Dialect Corpus from Social Media
title_short Towards Gulf Emirati Dialect Corpus from Social Media
title_sort Towards Gulf Emirati Dialect Corpus from Social Media
topic Emirati dialect, natural language processing, corpus
url https://bspace.buid.ac.ae/handle/1234/2565
https://link.springer.com/chapter/10.1007/978-3-031-56121-4_27