Optimizing malicious website prediction: An advanced XGBoost-based machine learning model

<p dir="ltr">In the substantial area of the Internet, some websites can be quite harmful and troublesome for both individuals and businesses. Our methods for identifying and forecasting these malicious websites are not always reliable; they can be slow and inaccurate. What if you had...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Sumaira Hussain (19259669) (author)
مؤلفون آخرون:	Islam Zada (21755819) (author), Moutaz Alazab (17730060) (author), Hessa Alfraihi (21755825) (author), Manal Aldhayan (22330930) (author), Inam Ullah (5227166) (author), Mohammad Asmat Ullah Khan (22330933) (author)
منشور في:	2025
الموضوعات:	Information and computing sciences Applied computing Cybersecurity and privacy Data management and data science Machine learning software privacy software defects machine learning classification and prediction malicious websites websites dataset data processing and modeling
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

_version_	1864513537990197248
author	Sumaira Hussain (19259669)
author2	Islam Zada (21755819) Moutaz Alazab (17730060) Hessa Alfraihi (21755825) Manal Aldhayan (22330930) Inam Ullah (5227166) Mohammad Asmat Ullah Khan (22330933)
author2_role	author author author author author author
author_facet	Sumaira Hussain (19259669) Islam Zada (21755819) Moutaz Alazab (17730060) Hessa Alfraihi (21755825) Manal Aldhayan (22330930) Inam Ullah (5227166) Mohammad Asmat Ullah Khan (22330933)
author_role	author
dc.creator.none.fl_str_mv	Sumaira Hussain (19259669) Islam Zada (21755819) Moutaz Alazab (17730060) Hessa Alfraihi (21755825) Manal Aldhayan (22330930) Inam Ullah (5227166) Mohammad Asmat Ullah Khan (22330933)
dc.date.none.fl_str_mv	2025-05-19T09:00:00Z
dc.identifier.none.fl_str_mv	10.1515/nleng-2024-0069
dc.relation.none.fl_str_mv	https://figshare.com/articles/journal_contribution/Optimizing_malicious_website_prediction_An_advanced_XGBoost-based_machine_learning_model/30234547
dc.rights.none.fl_str_mv	CC BY 4.0 info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv	Information and computing sciences Applied computing Cybersecurity and privacy Data management and data science Machine learning software privacy software defects machine learning classification and prediction malicious websites websites dataset data processing and modeling
dc.title.none.fl_str_mv	Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
dc.type.none.fl_str_mv	Text Journal contribution info:eu-repo/semantics/publishedVersion text contribution to journal
description	<p dir="ltr">In the substantial area of the Internet, some websites can be quite harmful and troublesome for both individuals and businesses. Our methods for identifying and forecasting these malicious websites are not always reliable; they can be slow and inaccurate. What if you had technology that could alert you to websites that pose a risk before issues arise? We are attempting to make that a reality. Despite the availability of online resources and additional research, there is a gap in our understanding. The existing approaches are not always the greatest; thus, we need a clear and consistent technique to use smart computers to forecast malicious websites. This makes it difficult for others attempting comparable tasks to compare our findings with theirs. Therefore, the study aims to close this disparity. With the help of an intelligent technology known as the XGBoost classifier and a set of data from the data-rich website Kaggle, we have devised a comprehensive strategy to address the issues identified with the existing methods for identifying and predicting malicious websites. The main objective of this study is to improve upon the current methods for identifying and predicting malicious websites. Our methodology includes data collection, data cleaning, and the application of the XGBoost classifier. To ensure accuracy, we validated our findings with thorough performance evaluations. Our approach achieved an impressive accuracy score of 95.5%, significantly outperforming previous methods. This study not only demonstrates the effectiveness of the XGBoost approach but also provides guidance for other researchers looking to identify and predict malicious websites more accurately, contributing to a safer internet environment.</p><h2>Other Information</h2><p dir="ltr">Published in: Nonlinear Engineering<br>License: <a href="http://creativecommons.org/licenses/by/4.0" target="_blank">http://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1515/nleng-2024-0069" target="_blank">https://dx.doi.org/10.1515/nleng-2024-0069</a></p>
eu_rights_str_mv	openAccess
id	Manara2_105b581fd4a71dcd1d5a954587043230
identifier_str_mv	10.1515/nleng-2024-0069
network_acronym_str	Manara2
network_name_str	Manara2
oai_identifier_str	oai:figshare.com:article/30234547
publishDate	2025
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv	CC BY 4.0
spelling	Optimizing malicious website prediction: An advanced XGBoost-based machine learning modelSumaira Hussain (19259669)Islam Zada (21755819)Moutaz Alazab (17730060)Hessa Alfraihi (21755825)Manal Aldhayan (22330930)Inam Ullah (5227166)Mohammad Asmat Ullah Khan (22330933)Information and computing sciencesApplied computingCybersecurity and privacyData management and data scienceMachine learningsoftware privacysoftware defectsmachine learningclassification and predictionmalicious websiteswebsites datasetdata processing and modeling<p dir="ltr">In the substantial area of the Internet, some websites can be quite harmful and troublesome for both individuals and businesses. Our methods for identifying and forecasting these malicious websites are not always reliable; they can be slow and inaccurate. What if you had technology that could alert you to websites that pose a risk before issues arise? We are attempting to make that a reality. Despite the availability of online resources and additional research, there is a gap in our understanding. The existing approaches are not always the greatest; thus, we need a clear and consistent technique to use smart computers to forecast malicious websites. This makes it difficult for others attempting comparable tasks to compare our findings with theirs. Therefore, the study aims to close this disparity. With the help of an intelligent technology known as the XGBoost classifier and a set of data from the data-rich website Kaggle, we have devised a comprehensive strategy to address the issues identified with the existing methods for identifying and predicting malicious websites. The main objective of this study is to improve upon the current methods for identifying and predicting malicious websites. Our methodology includes data collection, data cleaning, and the application of the XGBoost classifier. To ensure accuracy, we validated our findings with thorough performance evaluations. Our approach achieved an impressive accuracy score of 95.5%, significantly outperforming previous methods. This study not only demonstrates the effectiveness of the XGBoost approach but also provides guidance for other researchers looking to identify and predict malicious websites more accurately, contributing to a safer internet environment.</p><h2>Other Information</h2><p dir="ltr">Published in: Nonlinear Engineering<br>License: <a href="http://creativecommons.org/licenses/by/4.0" target="_blank">http://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1515/nleng-2024-0069" target="_blank">https://dx.doi.org/10.1515/nleng-2024-0069</a></p>2025-05-19T09:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1515/nleng-2024-0069https://figshare.com/articles/journal_contribution/Optimizing_malicious_website_prediction_An_advanced_XGBoost-based_machine_learning_model/30234547CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/302345472025-05-19T09:00:00Z
spellingShingle	Optimizing malicious website prediction: An advanced XGBoost-based machine learning model Sumaira Hussain (19259669) Information and computing sciences Applied computing Cybersecurity and privacy Data management and data science Machine learning software privacy software defects machine learning classification and prediction malicious websites websites dataset data processing and modeling
status_str	publishedVersion
title	Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_full	Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_fullStr	Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_full_unstemmed	Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_short	Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
title_sort	Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
topic	Information and computing sciences Applied computing Cybersecurity and privacy Data management and data science Machine learning software privacy software defects machine learning classification and prediction malicious websites websites dataset data processing and modeling

Optimizing malicious website prediction: An advanced XGBoost-based machine learning model

مواد مشابهة