Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter

<p>In recent years, there has been an increased prevalence of adopting state-sponsored trolls by governments and political organizations to influence public opinion through disinformation campaigns on social media platforms. This phenomenon negatively affects the political process, causes dist...

Full description

Saved in:
Bibliographic Details
Main Author: Saleh Alhazbi (16869960) (author)
Published: 2020
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1864513561279070208
author Saleh Alhazbi (16869960)
author_facet Saleh Alhazbi (16869960)
author_role author
dc.creator.none.fl_str_mv Saleh Alhazbi (16869960)
dc.date.none.fl_str_mv 2020-10-26T00:00:00Z
dc.identifier.none.fl_str_mv 10.1109/access.2020.3033666
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/Behavior-Based_Machine_Learning_Approaches_to_Identify_State-Sponsored_Trolls_on_Twitter/24015915
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Human society
Political science
Information and computing sciences
Distributed computing and systems software
Human-centred computing
Machine learning
Twitter
Machine learning
Government
Feature extraction
Information integrity
State-sponsored trolls
Disinformation
Propaganda
Behavioral pattern
dc.title.none.fl_str_mv Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p>In recent years, there has been an increased prevalence of adopting state-sponsored trolls by governments and political organizations to influence public opinion through disinformation campaigns on social media platforms. This phenomenon negatively affects the political process, causes distrust in the political systems, sows discord within societies, and hastens political polarization. Thus, there is a need to develop automated approaches to identify sponsored-troll accounts on social media in order to mitigate their impacts on the political process and to protect people against opinion manipulation. In this paper, we argue that behaviors of sponsored-troll accounts on social media are different from ordinary users' because of their extrinsic motivation, and they cannot completely hide their suspicious behaviors, therefore these accounts can be identified using machine learning approaches based solely on their behaviors on the social media platforms. We have proposed a set of behavioral features of users' activities on Twitter. Based on these features, we developed four classification models to identify political troll accounts, these models are based on decision tree, random forest, Adaboost, and gradient boost algorithms. The models were trained and evaluated on a set of Saudi trolls disclosed by Twitter in 2019, the overall classification accuracy reaches up to 94.4%. The models also are capable to identify the Russian trolls with accuracy up to 72.6% without training on this set of trolls. This indicates that although the strategies of coordinated trolls might vary from an organization to another, they are all just employees and have common behaviors that can be identified.</p><h2>Other Information</h2><p>Published in: IEEE Access<br>License: <a href="https://creativecommons.org/licenses/by/4.0/legalcode" target="_blank">https://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1109/access.2020.3033666" target="_blank">https://dx.doi.org/10.1109/access.2020.3033666</a></p>
eu_rights_str_mv openAccess
id Manara2_738b3e2eddd5bd0b4ed35e15194aa94b
identifier_str_mv 10.1109/access.2020.3033666
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/24015915
publishDate 2020
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on TwitterSaleh Alhazbi (16869960)Human societyPolitical scienceInformation and computing sciencesDistributed computing and systems softwareHuman-centred computingMachine learningTwitterMachine learningGovernmentFeature extractionInformation integrityState-sponsored trollsDisinformationPropagandaBehavioral pattern<p>In recent years, there has been an increased prevalence of adopting state-sponsored trolls by governments and political organizations to influence public opinion through disinformation campaigns on social media platforms. This phenomenon negatively affects the political process, causes distrust in the political systems, sows discord within societies, and hastens political polarization. Thus, there is a need to develop automated approaches to identify sponsored-troll accounts on social media in order to mitigate their impacts on the political process and to protect people against opinion manipulation. In this paper, we argue that behaviors of sponsored-troll accounts on social media are different from ordinary users' because of their extrinsic motivation, and they cannot completely hide their suspicious behaviors, therefore these accounts can be identified using machine learning approaches based solely on their behaviors on the social media platforms. We have proposed a set of behavioral features of users' activities on Twitter. Based on these features, we developed four classification models to identify political troll accounts, these models are based on decision tree, random forest, Adaboost, and gradient boost algorithms. The models were trained and evaluated on a set of Saudi trolls disclosed by Twitter in 2019, the overall classification accuracy reaches up to 94.4%. The models also are capable to identify the Russian trolls with accuracy up to 72.6% without training on this set of trolls. This indicates that although the strategies of coordinated trolls might vary from an organization to another, they are all just employees and have common behaviors that can be identified.</p><h2>Other Information</h2><p>Published in: IEEE Access<br>License: <a href="https://creativecommons.org/licenses/by/4.0/legalcode" target="_blank">https://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1109/access.2020.3033666" target="_blank">https://dx.doi.org/10.1109/access.2020.3033666</a></p>2020-10-26T00:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1109/access.2020.3033666https://figshare.com/articles/journal_contribution/Behavior-Based_Machine_Learning_Approaches_to_Identify_State-Sponsored_Trolls_on_Twitter/24015915CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/240159152020-10-26T00:00:00Z
spellingShingle Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
Saleh Alhazbi (16869960)
Human society
Political science
Information and computing sciences
Distributed computing and systems software
Human-centred computing
Machine learning
Twitter
Machine learning
Government
Feature extraction
Information integrity
State-sponsored trolls
Disinformation
Propaganda
Behavioral pattern
status_str publishedVersion
title Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_full Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_fullStr Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_full_unstemmed Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_short Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_sort Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
topic Human society
Political science
Information and computing sciences
Distributed computing and systems software
Human-centred computing
Machine learning
Twitter
Machine learning
Government
Feature extraction
Information integrity
State-sponsored trolls
Disinformation
Propaganda
Behavioral pattern