Number of tweets collected per query and type.

<div><p>The main objective of this study is to describe the process of collecting data extracted from Twitter (X) during the Brazilian presidential elections in 2022, encompassing the post-election period and the event of the attack on the buildings of the executive, legislative, and jud...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Sylvia Iasulaitis (8301189) (author)
مؤلفون آخرون: Alan Demétrius Baria Valejo (20660639) (author), Bruno Cardoso Greco (20660642) (author), Vinicius Gonçalves Perillo (20660645) (author), Guilherme Henrique Messias (20660648) (author), Isabella Vicari (20660651) (author)
منشور في: 2025
الموضوعات:
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1852023070859984896
author Sylvia Iasulaitis (8301189)
author2 Alan Demétrius Baria Valejo (20660639)
Bruno Cardoso Greco (20660642)
Vinicius Gonçalves Perillo (20660645)
Guilherme Henrique Messias (20660648)
Isabella Vicari (20660651)
author2_role author
author
author
author
author
author_facet Sylvia Iasulaitis (8301189)
Alan Demétrius Baria Valejo (20660639)
Bruno Cardoso Greco (20660642)
Vinicius Gonçalves Perillo (20660645)
Guilherme Henrique Messias (20660648)
Isabella Vicari (20660651)
author_role author
dc.creator.none.fl_str_mv Sylvia Iasulaitis (8301189)
Alan Demétrius Baria Valejo (20660639)
Bruno Cardoso Greco (20660642)
Vinicius Gonçalves Perillo (20660645)
Guilherme Henrique Messias (20660648)
Isabella Vicari (20660651)
dc.date.none.fl_str_mv 2025-02-03T18:27:46Z
dc.identifier.none.fl_str_mv 10.1371/journal.pone.0316626.g003
dc.relation.none.fl_str_mv https://figshare.com/articles/figure/Number_of_tweets_collected_per_query_and_type_/28335977
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Biotechnology
Science Policy
Biological Sciences not elsewhere classified
Information Systems not elsewhere classified
three major stages
primary collection type
extracting valuable information
brazilian presidential elections
available api keys
public &# 8221
data collection strategy
big social data
big data standards
named &# 8220
brazil </ p
282 million tweets
2022 presidential elections
collecting data extracted
&# 8220
xlink ">
trivial due
sociopolitical areas
several processes
python algorithms
preliminary analysis
political purposes
main objective
judiciary branches
january 2023
interdisciplinary nature
informational studies
extensive dataset
election period
dataset created
contextual analysis
br ),
dc.title.none.fl_str_mv Number of tweets collected per query and type.
dc.type.none.fl_str_mv Image
Figure
info:eu-repo/semantics/publishedVersion
image
description <div><p>The main objective of this study is to describe the process of collecting data extracted from Twitter (X) during the Brazilian presidential elections in 2022, encompassing the post-election period and the event of the attack on the buildings of the executive, legislative, and judiciary branches in January 2023. The work of collecting data took one year. Additionally, the study provides an overview of the general characteristics of the dataset created from 282 million tweets, named “The Interfaces Twitter Elections Dataset” (ITED-Br), the third most extensive dataset of tweets with political purposes. The process of collecting and creating the database for this study went through three major stages, subdivided into several processes: (1) A preliminary analysis of the platform and its operation; (2) Contextual analysis, creation of the conceptual model, and definition of Keywords and (3) Implementation of the Data Collection Strategy. Python algorithms were developed to model each primary collection type. The “token farm” algorithm, was employed to iterate over available API keys. While Twitter is generally a “public” access platform and fits into big data standards, extracting valuable information is not trivial due to the volume, speed, and heterogeneity of data. This study concludes that acquiring informational value requires expertise not only in sociopolitical areas but also in computational and informational studies, highlighting the interdisciplinary nature of such research.</p></div>
eu_rights_str_mv openAccess
id Manara_9a373ea9bdfd90d06f51b2cc3d0849ba
identifier_str_mv 10.1371/journal.pone.0316626.g003
network_acronym_str Manara
network_name_str ManaraRepo
oai_identifier_str oai:figshare.com:article/28335977
publishDate 2025
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Number of tweets collected per query and type.Sylvia Iasulaitis (8301189)Alan Demétrius Baria Valejo (20660639)Bruno Cardoso Greco (20660642)Vinicius Gonçalves Perillo (20660645)Guilherme Henrique Messias (20660648)Isabella Vicari (20660651)BiotechnologyScience PolicyBiological Sciences not elsewhere classifiedInformation Systems not elsewhere classifiedthree major stagesprimary collection typeextracting valuable informationbrazilian presidential electionsavailable api keyspublic &# 8221data collection strategybig social databig data standardsnamed &# 8220brazil </ p282 million tweets2022 presidential electionscollecting data extracted&# 8220xlink ">trivial duesociopolitical areasseveral processespython algorithmspreliminary analysispolitical purposesmain objectivejudiciary branchesjanuary 2023interdisciplinary natureinformational studiesextensive datasetelection perioddataset createdcontextual analysisbr ),<div><p>The main objective of this study is to describe the process of collecting data extracted from Twitter (X) during the Brazilian presidential elections in 2022, encompassing the post-election period and the event of the attack on the buildings of the executive, legislative, and judiciary branches in January 2023. The work of collecting data took one year. Additionally, the study provides an overview of the general characteristics of the dataset created from 282 million tweets, named “The Interfaces Twitter Elections Dataset” (ITED-Br), the third most extensive dataset of tweets with political purposes. The process of collecting and creating the database for this study went through three major stages, subdivided into several processes: (1) A preliminary analysis of the platform and its operation; (2) Contextual analysis, creation of the conceptual model, and definition of Keywords and (3) Implementation of the Data Collection Strategy. Python algorithms were developed to model each primary collection type. The “token farm” algorithm, was employed to iterate over available API keys. While Twitter is generally a “public” access platform and fits into big data standards, extracting valuable information is not trivial due to the volume, speed, and heterogeneity of data. This study concludes that acquiring informational value requires expertise not only in sociopolitical areas but also in computational and informational studies, highlighting the interdisciplinary nature of such research.</p></div>2025-02-03T18:27:46ZImageFigureinfo:eu-repo/semantics/publishedVersionimage10.1371/journal.pone.0316626.g003https://figshare.com/articles/figure/Number_of_tweets_collected_per_query_and_type_/28335977CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/283359772025-02-03T18:27:46Z
spellingShingle Number of tweets collected per query and type.
Sylvia Iasulaitis (8301189)
Biotechnology
Science Policy
Biological Sciences not elsewhere classified
Information Systems not elsewhere classified
three major stages
primary collection type
extracting valuable information
brazilian presidential elections
available api keys
public &# 8221
data collection strategy
big social data
big data standards
named &# 8220
brazil </ p
282 million tweets
2022 presidential elections
collecting data extracted
&# 8220
xlink ">
trivial due
sociopolitical areas
several processes
python algorithms
preliminary analysis
political purposes
main objective
judiciary branches
january 2023
interdisciplinary nature
informational studies
extensive dataset
election period
dataset created
contextual analysis
br ),
status_str publishedVersion
title Number of tweets collected per query and type.
title_full Number of tweets collected per query and type.
title_fullStr Number of tweets collected per query and type.
title_full_unstemmed Number of tweets collected per query and type.
title_short Number of tweets collected per query and type.
title_sort Number of tweets collected per query and type.
topic Biotechnology
Science Policy
Biological Sciences not elsewhere classified
Information Systems not elsewhere classified
three major stages
primary collection type
extracting valuable information
brazilian presidential elections
available api keys
public &# 8221
data collection strategy
big social data
big data standards
named &# 8220
brazil </ p
282 million tweets
2022 presidential elections
collecting data extracted
&# 8220
xlink ">
trivial due
sociopolitical areas
several processes
python algorithms
preliminary analysis
political purposes
main objective
judiciary branches
january 2023
interdisciplinary nature
informational studies
extensive dataset
election period
dataset created
contextual analysis
br ),