Abstract
HIV-1 protease (PR) inhibitors are crucial for antiviral therapies targeting acquired immunodeficiency syndrome. Hundreds of PR complexes with various ligands have been resolved and deposited in the Protein Data Bank. However, binding affinity measurements for these ligands are not always available. This gap hinders a comprehensive understanding of inhibitor efficacy. To address this challenge, machine learning models are constructed and validated based on the crystallographic coordinates of 291 PR–inhibitor complexes, leveraging over 2500 molecular descriptors. The models achieved accuracy scores exceeding 0.85, and applied to predict the binding affinity of 274 additional complexes for which inhibition constants are not experimentally measured. The analysis is focused on three models, each with 8–9 features, and based on KBest with random forest, recursive feature elimination with random forest, and sequential feature selection with support vector machine. The findings revealed key predictive features, including properties of PR inhibitors like charge distribution, hydrogen-bonding capability, and 3D topology, as well as intrinsic properties of PR, such as active site symmetry and flap mutations. The study highlights the contribution of a comprehensive analysis of accumulated experimental data to enhance the structural understanding of this important molecular system.
Original language | English |
---|---|
Pages (from-to) | e2500277 |
Journal | ChemMedChem |
DOIs | |
State | Published - 28 May 2025 |
Keywords
- crystal structure
- human immunodeficiency virus
- ligand binding
- machine learning
- symmetry
All Science Journal Classification (ASJC) codes
- Biochemistry
- Molecular Medicine
- Pharmacology
- Drug Discovery
- General Pharmacology, Toxicology and Pharmaceutics
- Organic Chemistry