Optimizing malicious website prediction: An advanced XGBoost-based machine learning model

<p dir="ltr">In the substantial area of the Internet, some websites can be quite harmful and troublesome for both individuals and businesses. Our methods for identifying and forecasting these malicious websites are not always reliable; they can be slow and inaccurate. What if you had...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Sumaira Hussain (19259669) (author)
مؤلفون آخرون: Islam Zada (21755819) (author), Moutaz Alazab (17730060) (author), Hessa Alfraihi (21755825) (author), Manal Aldhayan (22330930) (author), Inam Ullah (5227166) (author), Mohammad Asmat Ullah Khan (22330933) (author)
منشور في: 2025
الموضوعات:
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1864513537990197248
author Sumaira Hussain (19259669)
author2 Islam Zada (21755819)
Moutaz Alazab (17730060)
Hessa Alfraihi (21755825)
Manal Aldhayan (22330930)
Inam Ullah (5227166)
Mohammad Asmat Ullah Khan (22330933)
author2_role author
author
author
author
author
author
author_facet Sumaira Hussain (19259669)
Islam Zada (21755819)
Moutaz Alazab (17730060)
Hessa Alfraihi (21755825)
Manal Aldhayan (22330930)
Inam Ullah (5227166)
Mohammad Asmat Ullah Khan (22330933)
author_role author
dc.creator.none.fl_str_mv Sumaira Hussain (19259669)
Islam Zada (21755819)
Moutaz Alazab (17730060)
Hessa Alfraihi (21755825)
Manal Aldhayan (22330930)
Inam Ullah (5227166)
Mohammad Asmat Ullah Khan (22330933)
dc.date.none.fl_str_mv 2025-05-19T09:00:00Z
dc.identifier.none.fl_str_mv 10.1515/nleng-2024-0069
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/Optimizing_malicious_website_prediction_An_advanced_XGBoost-based_machine_learning_model/30234547
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Information and computing sciences
Applied computing
Cybersecurity and privacy
Data management and data science
Machine learning
software privacy
software defects
machine learning
classification and prediction
malicious websites
websites dataset
data processing and modeling
dc.title.none.fl_str_mv Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p dir="ltr">In the substantial area of the Internet, some websites can be quite harmful and troublesome for both individuals and businesses. Our methods for identifying and forecasting these malicious websites are not always reliable; they can be slow and inaccurate. What if you had technology that could alert you to websites that pose a risk before issues arise? We are attempting to make that a reality. Despite the availability of online resources and additional research, there is a gap in our understanding. The existing approaches are not always the greatest; thus, we need a clear and consistent technique to use smart computers to forecast malicious websites. This makes it difficult for others attempting comparable tasks to compare our findings with theirs. Therefore, the study aims to close this disparity. With the help of an intelligent technology known as the XGBoost classifier and a set of data from the data-rich website Kaggle, we have devised a comprehensive strategy to address the issues identified with the existing methods for identifying and predicting malicious websites. The main objective of this study is to improve upon the current methods for identifying and predicting malicious websites. Our methodology includes data collection, data cleaning, and the application of the XGBoost classifier. To ensure accuracy, we validated our findings with thorough performance evaluations. Our approach achieved an impressive accuracy score of 95.5%, significantly outperforming previous methods. This study not only demonstrates the effectiveness of the XGBoost approach but also provides guidance for other researchers looking to identify and predict malicious websites more accurately, contributing to a safer internet environment.</p><h2>Other Information</h2><p dir="ltr">Published in: Nonlinear Engineering<br>License: <a href="http://creativecommons.org/licenses/by/4.0" target="_blank">http://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1515/nleng-2024-0069" target="_blank">https://dx.doi.org/10.1515/nleng-2024-0069</a></p>
eu_rights_str_mv openAccess
id Manara2_105b581fd4a71dcd1d5a954587043230
identifier_str_mv 10.1515/nleng-2024-0069
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/30234547
publishDate 2025
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Optimizing malicious website prediction: An advanced XGBoost-based machine learning modelSumaira Hussain (19259669)Islam Zada (21755819)Moutaz Alazab (17730060)Hessa Alfraihi (21755825)Manal Aldhayan (22330930)Inam Ullah (5227166)Mohammad Asmat Ullah Khan (22330933)Information and computing sciencesApplied computingCybersecurity and privacyData management and data scienceMachine learningsoftware privacysoftware defectsmachine learningclassification and predictionmalicious websiteswebsites datasetdata processing and modeling<p dir="ltr">In the substantial area of the Internet, some websites can be quite harmful and troublesome for both individuals and businesses. Our methods for identifying and forecasting these malicious websites are not always reliable; they can be slow and inaccurate. What if you had technology that could alert you to websites that pose a risk before issues arise? We are attempting to make that a reality. Despite the availability of online resources and additional research, there is a gap in our understanding. The existing approaches are not always the greatest; thus, we need a clear and consistent technique to use smart computers to forecast malicious websites. This makes it difficult for others attempting comparable tasks to compare our findings with theirs. Therefore, the study aims to close this disparity. With the help of an intelligent technology known as the XGBoost classifier and a set of data from the data-rich website Kaggle, we have devised a comprehensive strategy to address the issues identified with the existing methods for identifying and predicting malicious websites. The main objective of this study is to improve upon the current methods for identifying and predicting malicious websites. Our methodology includes data collection, data cleaning, and the application of the XGBoost classifier. To ensure accuracy, we validated our findings with thorough performance evaluations. Our approach achieved an impressive accuracy score of 95.5%, significantly outperforming previous methods. This study not only demonstrates the effectiveness of the XGBoost approach but also provides guidance for other researchers looking to identify and predict malicious websites more accurately, contributing to a safer internet environment.</p><h2>Other Information</h2><p dir="ltr">Published in: Nonlinear Engineering<br>License: <a href="http://creativecommons.org/licenses/by/4.0" target="_blank">http://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1515/nleng-2024-0069" target="_blank">https://dx.doi.org/10.1515/nleng-2024-0069</a></p>2025-05-19T09:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1515/nleng-2024-0069https://figshare.com/articles/journal_contribution/Optimizing_malicious_website_prediction_An_advanced_XGBoost-based_machine_learning_model/30234547CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/302345472025-05-19T09:00:00Z
spellingShingle Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
Sumaira Hussain (19259669)
Information and computing sciences
Applied computing
Cybersecurity and privacy
Data management and data science
Machine learning
software privacy
software defects
machine learning
classification and prediction
malicious websites
websites dataset
data processing and modeling
status_str publishedVersion
title Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_full Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_fullStr Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_full_unstemmed Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_short Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_sort Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
topic Information and computing sciences
Applied computing
Cybersecurity and privacy
Data management and data science
Machine learning
software privacy
software defects
machine learning
classification and prediction
malicious websites
websites dataset
data processing and modeling