The Molecules Gateway: A Homogeneous, Searchable Database of 150k Annotated Molecules from Actinomycetes

Natural products are a sustainable resource for drug discovery, but their identification in complex mixtures remains a daunting task. We present an automated pipeline that compares, harmonizes and ranks the annotations of LC-HRMS data by different tools. When applied to 7,400 extracts derived from 6...

Full description

Saved in:
Bibliographic Details
Main Author: Matteo Simone (1475968) (author)
Other Authors: Marianna Iorio (1475977) (author), Paolo Monciardini (1475971) (author), Massimo Santini (32134) (author), Niccolò Cantù (19950877) (author), Arianna Tocchetti (252368) (author), Stefania Serina (19950880) (author), Cristina Brunati (1475974) (author), Thomas Vernay (19950883) (author), Andrea Gentile (16838135) (author), Mattia Aracne (19950886) (author), Marco Cozzi (19950889) (author), Justin J. J. van der Hooft (6904412) (author), Margherita Sosio (531104) (author), Stefano Donadio (774816) (author), Sonia I. Maffioli (1475965) (author)
Published: 2024
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Natural products are a sustainable resource for drug discovery, but their identification in complex mixtures remains a daunting task. We present an automated pipeline that compares, harmonizes and ranks the annotations of LC-HRMS data by different tools. When applied to 7,400 extracts derived from 6,566 strains belonging to 86 actinomycete genera, it yielded 150,000 molecules after processing over 50 million MS features. The web-based Molecules Gateway provides a highly interactive access to experimental and calculated data for these molecules, along with the metadata related to extracts and producer strains. We show how the Molecules Gateway can be used to rapidly identify known hard to find microbial products, unreported analogs of known families and not yet described metabolites. The Molecules Gateway, which complements available repositories, contains annotated MS data, both acquired and computationally processed under an identical workflow, making it suitable for global analyses which reveal a large and untapped chemical diversity afforded by actinomycetes.