Chloroplast phylogenomics of Salacia chinensis L. (Celastraceae) with machine learning-assisted insights into anticancer drug discovery

Authors

  • Sheikh Sunzid Ahmed Department of Botany, University of Dhaka, Dhaka 1000, Bangladesh
  • M Oliur Rahman Department of Botany, University of Dhaka, Dhaka 1000, Bangladesh

Keywords:

Comparative plastomics; Simple sequence repeats; Phylogenetics; LightGBM; AKT-Scan AI; Tanimoto similarity; Molecular docking

Abstract

The present study reports the first complete chloroplast (Cp) genome of Salacia chinensis L. (Celastraceae), an important medicinal shrub native to Bangladesh, alongside a machine learning-driven exploration of its therapeutic potential. The circular plastome spans 157,454 bp, comprising a large single-copy of 85,757 bp, a small single-copy of 18,451 bp, and two inverted repeats of 26,623 bp each. The Cp genome encodes 127 genes, including 83 protein-coding genes, 36 tRNAs, and eight rRNAs. Comparative plastome analysis indicated a conserved genomic organization with no major structural rearrangements among the closely related members. A total of 95 simple sequence repeats were identified, predominantly mononucleotide motifs (69), suggesting potential markers for genetic diversity studies. Phylogenomic reconstruction confirmed the systematic placement of S. chinensis within Celastraceae. Complementing the genomic insights, a machine learning-guided anticancer drug discovery framework was employed targeting the AKT1 protein (RAC-alpha serine/threonine-protein kinase). A supervised LightGBM model achieved 90.4% accuracy with an AUC of 0.950, enabling the identification of two promising phytochemical leads, Regeol A and Carnaubadiol, exhibiting predicted bioactivities of 51.4% and 68.1%, respectively. Molecular docking analysis demonstrated strong binding affinities of –8.8 kcal/mol and –8.7 kcal/mol for Regeol A and Carnaubadiol, respectively, surpassing the reference drug (–8.0 kcal/mol), while ADMET profiling supported favorable pharmacokinetic properties with minimal toxicity concerns. In addition, we developed AKT-Scan AI (https://aktscanai.streamlit.app), a high-throughput machine learning platform for predicting AKT1-targeted bioactivity and assessing drug-likeness properties. Collectively, this integrative study enriches the genomic understanding of S. chinensis (GenBank Accession: PZ250435.1) and underscores its potential as a promising source of bioactive compounds for targeted therapeutic applications.

Bangladesh J. Plant Taxon. 33(1): 1-19, 2026 (June)

Abstract
0
PDF
0

Downloads

Published

2026-06-28

How to Cite

Chloroplast phylogenomics of Salacia chinensis L. (Celastraceae) with machine learning-assisted insights into anticancer drug discovery. (2026). Bangladesh Journal of Plant Taxonomy, 33(1), 1-19. https://www.banglajol.info/index.php/BJPT/article/view/91030

Issue

Section

Articles

How to Cite

Chloroplast phylogenomics of Salacia chinensis L. (Celastraceae) with machine learning-assisted insights into anticancer drug discovery. (2026). Bangladesh Journal of Plant Taxonomy, 33(1), 1-19. https://www.banglajol.info/index.php/BJPT/article/view/91030