Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review

<p dir="ltr">Diabetes mellitus is a long-term metabolic condition marked by high blood sugar levels due to issues with insulin production, insulin effectiveness, or a combination of both. It stands as one of the fastest-growing diseases worldwide, projected to afflict 693 million adu...

Full description

Saved in:
Bibliographic Details
Main Author: Sulaiman Khan (12585349) (author)
Other Authors: Farida Mohsen (16994682) (author), Zubair Shah (231886) (author)
Published: 2024
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1864513539440377856
author Sulaiman Khan (12585349)
author2 Farida Mohsen (16994682)
Zubair Shah (231886)
author2_role author
author
author_facet Sulaiman Khan (12585349)
Farida Mohsen (16994682)
Zubair Shah (231886)
author_role author
dc.creator.none.fl_str_mv Sulaiman Khan (12585349)
Farida Mohsen (16994682)
Zubair Shah (231886)
dc.date.none.fl_str_mv 2024-12-20T09:00:00Z
dc.identifier.none.fl_str_mv 10.1007/s10462-024-11020-w
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/Genetic_biomarkers_and_machine_learning_techniques_for_predicting_diabetes_systematic_review/30173266
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Biological sciences
Genetics
Biomedical and clinical sciences
Medical biochemistry and metabolomics
Health sciences
Health services and systems
Information and computing sciences
Artificial intelligence
Machine learning
Diabetes
Scoping review
Data modalities
Genetic data
Genetic risk factors
Clinical biomarkers
dc.title.none.fl_str_mv Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p dir="ltr">Diabetes mellitus is a long-term metabolic condition marked by high blood sugar levels due to issues with insulin production, insulin effectiveness, or a combination of both. It stands as one of the fastest-growing diseases worldwide, projected to afflict 693 million adults by 2045. The escalating prevalence of diabetes and associated health complications (kidney disease, retinopathy, and neuropathy) underscore the imperative to devise predictive models for early diagnosis and intervention. These complications contribute to increased mortality rates, blindness, kidney failure, and an overall diminished quality of life in individuals living with diabetes. While clinical risk factors and glycemic control provide valuable insights, they alone cannot reliably predict the onset of vascular complications. Genetic biomarkers and machine learning techniques have emerged as promising tools for predicting diabetes development risk and associated complications. Despite the emergence of numerous smart AI models for diabetes prediction, there is still a need for a thorough review outlining their progress and challenges. To address this gap, this paper offers a systematic review of the literature on AI-based models for diabetes identification, following the PRISMA extension for scoping reviews guidelines. Our review revealed that multimodal diabetes prediction models outperformed unimodal models. Most studies focused on classical machine learning models, with SNPs being the most used data type, followed by gene expression profiles, while lipidomic and metabolomic data were the least utilized. Moreover, some studies focused on identifying genetic determinants of diabetes complications relied on familial linkage analysis, tailored for robust effect loci. However, these approaches had limitations, including susceptibility to false positives in candidate gene studies and underpowered AI models capabilities due to sample size constraints. The landscape shifted dramatically with the proliferation of genomic datasets, fueled by the emergence of biobanks and the amalgamation of global cohorts. This surge has led to a more than twofold increase in genetic discoveries related to both diabetes and its complications using AI. Our focus here is on these genetic breakthroughs, particularly those empowered by AI models. However, we also highlight the existing gaps in research and underscore the need for further advancements to propel genomic discovery to the next level.</p><h2>Other Information</h2><p dir="ltr">Published in: Artificial Intelligence Review<br>License: <a href="https://creativecommons.org/licenses/by/4.0" target="_blank">https://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1007/s10462-024-11020-w" target="_blank">https://dx.doi.org/10.1007/s10462-024-11020-w</a></p>
eu_rights_str_mv openAccess
id Manara2_a660a574eb99f9d4b5e24c91bd45b7b1
identifier_str_mv 10.1007/s10462-024-11020-w
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/30173266
publishDate 2024
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Genetic biomarkers and machine learning techniques for predicting diabetes: systematic reviewSulaiman Khan (12585349)Farida Mohsen (16994682)Zubair Shah (231886)Biological sciencesGeneticsBiomedical and clinical sciencesMedical biochemistry and metabolomicsHealth sciencesHealth services and systemsInformation and computing sciencesArtificial intelligenceMachine learningDiabetesScoping reviewData modalitiesGenetic dataGenetic risk factorsClinical biomarkers<p dir="ltr">Diabetes mellitus is a long-term metabolic condition marked by high blood sugar levels due to issues with insulin production, insulin effectiveness, or a combination of both. It stands as one of the fastest-growing diseases worldwide, projected to afflict 693 million adults by 2045. The escalating prevalence of diabetes and associated health complications (kidney disease, retinopathy, and neuropathy) underscore the imperative to devise predictive models for early diagnosis and intervention. These complications contribute to increased mortality rates, blindness, kidney failure, and an overall diminished quality of life in individuals living with diabetes. While clinical risk factors and glycemic control provide valuable insights, they alone cannot reliably predict the onset of vascular complications. Genetic biomarkers and machine learning techniques have emerged as promising tools for predicting diabetes development risk and associated complications. Despite the emergence of numerous smart AI models for diabetes prediction, there is still a need for a thorough review outlining their progress and challenges. To address this gap, this paper offers a systematic review of the literature on AI-based models for diabetes identification, following the PRISMA extension for scoping reviews guidelines. Our review revealed that multimodal diabetes prediction models outperformed unimodal models. Most studies focused on classical machine learning models, with SNPs being the most used data type, followed by gene expression profiles, while lipidomic and metabolomic data were the least utilized. Moreover, some studies focused on identifying genetic determinants of diabetes complications relied on familial linkage analysis, tailored for robust effect loci. However, these approaches had limitations, including susceptibility to false positives in candidate gene studies and underpowered AI models capabilities due to sample size constraints. The landscape shifted dramatically with the proliferation of genomic datasets, fueled by the emergence of biobanks and the amalgamation of global cohorts. This surge has led to a more than twofold increase in genetic discoveries related to both diabetes and its complications using AI. Our focus here is on these genetic breakthroughs, particularly those empowered by AI models. However, we also highlight the existing gaps in research and underscore the need for further advancements to propel genomic discovery to the next level.</p><h2>Other Information</h2><p dir="ltr">Published in: Artificial Intelligence Review<br>License: <a href="https://creativecommons.org/licenses/by/4.0" target="_blank">https://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1007/s10462-024-11020-w" target="_blank">https://dx.doi.org/10.1007/s10462-024-11020-w</a></p>2024-12-20T09:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1007/s10462-024-11020-whttps://figshare.com/articles/journal_contribution/Genetic_biomarkers_and_machine_learning_techniques_for_predicting_diabetes_systematic_review/30173266CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/301732662024-12-20T09:00:00Z
spellingShingle Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review
Sulaiman Khan (12585349)
Biological sciences
Genetics
Biomedical and clinical sciences
Medical biochemistry and metabolomics
Health sciences
Health services and systems
Information and computing sciences
Artificial intelligence
Machine learning
Diabetes
Scoping review
Data modalities
Genetic data
Genetic risk factors
Clinical biomarkers
status_str publishedVersion
title Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review
title_full Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review
title_fullStr Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review
title_full_unstemmed Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review
title_short Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review
title_sort Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review
topic Biological sciences
Genetics
Biomedical and clinical sciences
Medical biochemistry and metabolomics
Health sciences
Health services and systems
Information and computing sciences
Artificial intelligence
Machine learning
Diabetes
Scoping review
Data modalities
Genetic data
Genetic risk factors
Clinical biomarkers