Table 1_QSAR-guided discovery of novel KRAS inhibitors for lung cancer therapy.xlsx

Introduction<p>KRAS mutations are key oncogenic drivers in lung cancer, yet effective pharmacological targeting has remained a major challenge due to the protein's elusive and dynamic binding pockets. Computational modeling offers a promising route to identify novel inhibitors with improv...

Full description

Saved in:
Bibliographic Details
Main Author: Osasan Stephen Adebayo (22630082) (author)
Other Authors: George Oche Ambrose (17661786) (author), Daramola Olusola (22630085) (author), Adefolalu Oluwafemi (22630088) (author), Hind A. Alzahrani (17689671) (author), Abdulkarim Hasan (16378311) (author)
Published: 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Introduction<p>KRAS mutations are key oncogenic drivers in lung cancer, yet effective pharmacological targeting has remained a major challenge due to the protein's elusive and dynamic binding pockets. Computational modeling offers a promising route to identify novel inhibitors with improved potency and selectivity.</p>Methods<p>A quantitative structure–activity relationship (QSAR) modeling approach was developed to predict the inhibitory potency (pIC<sub>50</sub>) of KRAS inhibitors and support de novo drug design. Molecular descriptors for 62 inhibitors retrieved from the ChEMBL database (CHEMBL4354832) were computed using Chemopy. Following descriptor normalization and dimensionality reduction, five machine learning algorithm spartial least squares (PLS), random forest (RF), stepwise multiple linear regression (MLR), genetic algorithm optimized MLR (GA-MLR), and XGBoost were applied. Model performance was evaluated using R<sup>2</sup>, RMSE, and MAE, while permutation-based importance and SHAP analyses provided feature interpretability.</p>Results<p>Among the models tested, PLS exhibited the best predictive performance (R<sup>2</sup> = 0.851; RMSE = 0.292), followed by RF (R<sup>2</sup> = 0.796). The GA-MLR model, based on eight optimized molecular descriptors, achieved good interpretability and robust internal validation (R<sup>2</sup> = 0.677). Virtual screening of 56 de novo designed compounds within the model's applicability domain identified compound C9 with a predicted pIC<sub>50</sub>) of 8.11 as the most promising hit.</p>Discussion<p>This integrative QSAR modeling and de novo design framework effectively predicted the bioactivity of KRAS inhibitors and facilitated the identification of novel candidate molecules. The findings demonstrate the utility of combining interpretable machine learning models with virtual screening to accelerate the discovery of potent KRAS inhibitors for lung cancer therapy.</p>