liverRegression code.

<p dir="ltr">Enhancers are transcriptional regulatory elements that help drive phenotypic diversity, yet they often undergo rapid sequence evolution despite functional conservation, posing a challenge for predicting their function across species. Previous machine learning models have...

Description complète

Enregistré dans:
Détails bibliographiques
Auteur principal: Amy Stephen (22684448) (author)
Publié: 2025
Sujets:
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
Description
Résumé:<p dir="ltr">Enhancers are transcriptional regulatory elements that help drive phenotypic diversity, yet they often undergo rapid sequence evolution despite functional conservation, posing a challenge for predicting their function across species. Previous machine learning models have focused on the binary task of predicting the presence or absence of open chromatin, but yet to demonstrate an ability to predict continuous differences in signal.   Here, we trained convolutional neural networks (CNNs) on a regression task to predict chromatin accessibility, which is a proxy for enhancer activity, in the liver across five mammals, and we developed a novel framework to evaluate cross-species performance. We demonstrated that training on multiple species improves model generalization to both species used in training and held-out species. However, the models consistently achieved poor performance in predicting quantitative differences in accessibility between species at orthologous regions. Our study highlights the challenges in using regression models to predict chromatin accessibility changes between species. </p>