<b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>

<p dir="ltr">Statistical approaches indicate that tandem repeats (TRs) play a crucial regulatory role in translation initiation site (TIS) selection and proteomic diversity.<b> </b>By leveraging machine learning techniques, we enhanced the previous approaches and investig...

Descrizione completa

Salvato in:
Dettagli Bibliografici
Autore principale: Ali Mohammad Ali Maddi (11301486) (author)
Pubblicazione: 2025
Soggetti:
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1849927625216098304
author Ali Mohammad Ali Maddi (11301486)
author_facet Ali Mohammad Ali Maddi (11301486)
author_role author
dc.creator.none.fl_str_mv Ali Mohammad Ali Maddi (11301486)
dc.date.none.fl_str_mv 2025-11-26T00:17:48Z
dc.identifier.none.fl_str_mv 10.6084/m9.figshare.28934366.v2
dc.relation.none.fl_str_mv https://figshare.com/articles/dataset/_b_Minimal_repeats_identified_by_interpretable_machine_learning_as_novel_cross-species_markers_of_translation_initiation_sites_b_/28934366
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Bioinformatic methods development
Minimal Repeat
Tandem Repeat
TIS Detection
Feature Selection
Machine Learning
Genomic Pattern Recognition
Translation Initiation Site
dc.title.none.fl_str_mv <b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>
dc.type.none.fl_str_mv Dataset
info:eu-repo/semantics/publishedVersion
dataset
description <p dir="ltr">Statistical approaches indicate that tandem repeats (TRs) play a crucial regulatory role in translation initiation site (TIS) selection and proteomic diversity.<b> </b>By leveraging machine learning techniques, we enhanced the previous approaches and investigated the impact of the TR spectrum, ranging from 2 to 85 repeats on TIS selection across four species, consisting of human, mouse, bovine, and fruit fly.<b> </b>We identified a subset of 50 key motifs that effectively distinguish TIS-present from TIS-absent regions with high accuracy. The majority of these motif sequences (80 %) were minimal repeats (MRs), mainly ranging from 2 to 3 repeats. We found evolutionary conservation of the majority of these motifs, underscoring their functional importance, while certain species-specific MRs served as genomic fingerprints, reflecting unique regulatory adaptations. Additionally, the dense distribution of MRs around TISs highlighted their potential as genomic codes for identifying TIS hotspots.<b> </b>This study signified that alongside TRs, MRs are key genomic markers for TIS selection, offering insights into their biological roles and evolutionary significance. The mechanistic aspects of MRs/TRs are yet to be discovered in future studies.</p>
eu_rights_str_mv openAccess
id Manara_6eee9f1060b2213634d5c1c2ee0e6a16
identifier_str_mv 10.6084/m9.figshare.28934366.v2
network_acronym_str Manara
network_name_str ManaraRepo
oai_identifier_str oai:figshare.com:article/28934366
publishDate 2025
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling <b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>Ali Mohammad Ali Maddi (11301486)Bioinformatic methods developmentMinimal RepeatTandem RepeatTIS DetectionFeature SelectionMachine LearningGenomic Pattern RecognitionTranslation Initiation Site<p dir="ltr">Statistical approaches indicate that tandem repeats (TRs) play a crucial regulatory role in translation initiation site (TIS) selection and proteomic diversity.<b> </b>By leveraging machine learning techniques, we enhanced the previous approaches and investigated the impact of the TR spectrum, ranging from 2 to 85 repeats on TIS selection across four species, consisting of human, mouse, bovine, and fruit fly.<b> </b>We identified a subset of 50 key motifs that effectively distinguish TIS-present from TIS-absent regions with high accuracy. The majority of these motif sequences (80 %) were minimal repeats (MRs), mainly ranging from 2 to 3 repeats. We found evolutionary conservation of the majority of these motifs, underscoring their functional importance, while certain species-specific MRs served as genomic fingerprints, reflecting unique regulatory adaptations. Additionally, the dense distribution of MRs around TISs highlighted their potential as genomic codes for identifying TIS hotspots.<b> </b>This study signified that alongside TRs, MRs are key genomic markers for TIS selection, offering insights into their biological roles and evolutionary significance. The mechanistic aspects of MRs/TRs are yet to be discovered in future studies.</p>2025-11-26T00:17:48ZDatasetinfo:eu-repo/semantics/publishedVersiondataset10.6084/m9.figshare.28934366.v2https://figshare.com/articles/dataset/_b_Minimal_repeats_identified_by_interpretable_machine_learning_as_novel_cross-species_markers_of_translation_initiation_sites_b_/28934366CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/289343662025-11-26T00:17:48Z
spellingShingle <b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>
Ali Mohammad Ali Maddi (11301486)
Bioinformatic methods development
Minimal Repeat
Tandem Repeat
TIS Detection
Feature Selection
Machine Learning
Genomic Pattern Recognition
Translation Initiation Site
status_str publishedVersion
title <b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>
title_full <b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>
title_fullStr <b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>
title_full_unstemmed <b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>
title_short <b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>
title_sort <b>Minimal repeats identified by machine learning as novel cross-species markers of translation initiation sites</b>
topic Bioinformatic methods development
Minimal Repeat
Tandem Repeat
TIS Detection
Feature Selection
Machine Learning
Genomic Pattern Recognition
Translation Initiation Site