LDSVM: Leukemia Cancer Classification Using Machine Learning

<p dir="ltr">Leukemia is blood cancer, including bone marrow and lymphatic tissues, typically involving white blood cells. Leukemia produces an abnormal amount of white blood cells compared to normal blood. Deoxyribonucleic acid (DNA) microarrays provide reliable medical diagnostic s...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Abdul Karim (417009) (author)
مؤلفون آخرون: Azhari Azhari (19517461) (author), Mobeen Shahroz (19457371) (author), Samir Brahim Belhaouri (19517464) (author), Khabib Mustofa (19457374) (author)
منشور في: 2022
الموضوعات:
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1864513506521382912
author Abdul Karim (417009)
author2 Azhari Azhari (19517461)
Mobeen Shahroz (19457371)
Samir Brahim Belhaouri (19517464)
Khabib Mustofa (19457374)
author2_role author
author
author
author
author_facet Abdul Karim (417009)
Azhari Azhari (19517461)
Mobeen Shahroz (19457371)
Samir Brahim Belhaouri (19517464)
Khabib Mustofa (19457374)
author_role author
dc.creator.none.fl_str_mv Abdul Karim (417009)
Azhari Azhari (19517461)
Mobeen Shahroz (19457371)
Samir Brahim Belhaouri (19517464)
Khabib Mustofa (19457374)
dc.date.none.fl_str_mv 2022-12-07T09:00:00Z
dc.identifier.none.fl_str_mv 10.32604/cmc.2022.021218
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/LDSVM_Leukemia_Cancer_Classification_Using_Machine_Learning/26889151
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Biomedical and clinical sciences
Oncology and carcinogenesis
Health sciences
Health services and systems
Leukemia
GSE9476
cancer
genes
classification
machine learning
ensemble LDSVM classifier
dc.title.none.fl_str_mv LDSVM: Leukemia Cancer Classification Using Machine Learning
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p dir="ltr">Leukemia is blood cancer, including bone marrow and lymphatic tissues, typically involving white blood cells. Leukemia produces an abnormal amount of white blood cells compared to normal blood. Deoxyribonucleic acid (DNA) microarrays provide reliable medical diagnostic services to help more patients find the proposed treatment for infections. DNA microarrays are also known as biochips that consist of microscopic DNA spots attached to a solid glass surface. Currently, it is difficult to classify cancers using microarray data. Nearly many data mining techniques have failed because of the small sample size, which has become more critical for organizations. However, they are not highly effective in improving results and are frequently employed by doctors for cancer diagnosis. This study proposes a novel method using machine learning algorithms based on microarrays of leukemia GSE9476 cells. The main aim was to predict the initial leukemia disease. Machine learning algorithms such as decision tree (DT), naive bayes (NB), random forest (RF), gradient boosting machine (GBM), linear regression (LinR), support vector machine (SVM), and novel approach based on the combination of Logistic Regression (LR), DT and SVM named as ensemble LDSVM model. The k-fold cross-validation and grid search optimization methods were used with the LDSVM model to classify leukemia in patients and comparatively analyze their impacts. The proposed approach evaluated better accuracy, precision, recall, and f1 scores than the other algorithms. Furthermore, the results were relatively assessed, which showed LDSVM performance. This study aims to successfully predict leukemia in patients and enhance prediction accuracy in minimum time. Moreover, a Synthetic minority oversampling technique (SMOTE) and Principal compenent analysis (PCA) approaches were implemented. This makes the records generalized and evaluates the outcomes well. PCA reduces the feature count without losing any information and deals with class imbalanced datasets, as well as faster model execution along with less computation cost. In this study, a novel process was used to reduce the column results to develop a faster and more rapid experiment execution.</p><h2>Other Information</h2><p dir="ltr">Published in: Computers, Materials & Continua<br>License: <a href="https://creativecommons.org/licenses/by/4.0" target="_blank">https://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.32604/cmc.2022.021218" target="_blank">https://dx.doi.org/10.32604/cmc.2022.021218</a></p>
eu_rights_str_mv openAccess
id Manara2_91eaf12cbcb1a74273eb214dab5ae308
identifier_str_mv 10.32604/cmc.2022.021218
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/26889151
publishDate 2022
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling LDSVM: Leukemia Cancer Classification Using Machine LearningAbdul Karim (417009)Azhari Azhari (19517461)Mobeen Shahroz (19457371)Samir Brahim Belhaouri (19517464)Khabib Mustofa (19457374)Biomedical and clinical sciencesOncology and carcinogenesisHealth sciencesHealth services and systemsLeukemiaGSE9476cancergenesclassificationmachine learningensemble LDSVM classifier<p dir="ltr">Leukemia is blood cancer, including bone marrow and lymphatic tissues, typically involving white blood cells. Leukemia produces an abnormal amount of white blood cells compared to normal blood. Deoxyribonucleic acid (DNA) microarrays provide reliable medical diagnostic services to help more patients find the proposed treatment for infections. DNA microarrays are also known as biochips that consist of microscopic DNA spots attached to a solid glass surface. Currently, it is difficult to classify cancers using microarray data. Nearly many data mining techniques have failed because of the small sample size, which has become more critical for organizations. However, they are not highly effective in improving results and are frequently employed by doctors for cancer diagnosis. This study proposes a novel method using machine learning algorithms based on microarrays of leukemia GSE9476 cells. The main aim was to predict the initial leukemia disease. Machine learning algorithms such as decision tree (DT), naive bayes (NB), random forest (RF), gradient boosting machine (GBM), linear regression (LinR), support vector machine (SVM), and novel approach based on the combination of Logistic Regression (LR), DT and SVM named as ensemble LDSVM model. The k-fold cross-validation and grid search optimization methods were used with the LDSVM model to classify leukemia in patients and comparatively analyze their impacts. The proposed approach evaluated better accuracy, precision, recall, and f1 scores than the other algorithms. Furthermore, the results were relatively assessed, which showed LDSVM performance. This study aims to successfully predict leukemia in patients and enhance prediction accuracy in minimum time. Moreover, a Synthetic minority oversampling technique (SMOTE) and Principal compenent analysis (PCA) approaches were implemented. This makes the records generalized and evaluates the outcomes well. PCA reduces the feature count without losing any information and deals with class imbalanced datasets, as well as faster model execution along with less computation cost. In this study, a novel process was used to reduce the column results to develop a faster and more rapid experiment execution.</p><h2>Other Information</h2><p dir="ltr">Published in: Computers, Materials & Continua<br>License: <a href="https://creativecommons.org/licenses/by/4.0" target="_blank">https://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.32604/cmc.2022.021218" target="_blank">https://dx.doi.org/10.32604/cmc.2022.021218</a></p>2022-12-07T09:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.32604/cmc.2022.021218https://figshare.com/articles/journal_contribution/LDSVM_Leukemia_Cancer_Classification_Using_Machine_Learning/26889151CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/268891512022-12-07T09:00:00Z
spellingShingle LDSVM: Leukemia Cancer Classification Using Machine Learning
Abdul Karim (417009)
Biomedical and clinical sciences
Oncology and carcinogenesis
Health sciences
Health services and systems
Leukemia
GSE9476
cancer
genes
classification
machine learning
ensemble LDSVM classifier
status_str publishedVersion
title LDSVM: Leukemia Cancer Classification Using Machine Learning
title_full LDSVM: Leukemia Cancer Classification Using Machine Learning
title_fullStr LDSVM: Leukemia Cancer Classification Using Machine Learning
title_full_unstemmed LDSVM: Leukemia Cancer Classification Using Machine Learning
title_short LDSVM: Leukemia Cancer Classification Using Machine Learning
title_sort LDSVM: Leukemia Cancer Classification Using Machine Learning
topic Biomedical and clinical sciences
Oncology and carcinogenesis
Health sciences
Health services and systems
Leukemia
GSE9476
cancer
genes
classification
machine learning
ensemble LDSVM classifier