학술논문

Overview of the SAMPL5 host–guest challenge: Are we doing better?
Document Type
article
Source
Journal of Computer-Aided Molecular Design. 31(1)
Subject
Medicinal and Biomolecular Chemistry
Chemical Sciences
Clinical Research
Bioengineering
Generic health relevance
Binding Sites
Drug Design
Ligands
Molecular Dynamics Simulation
Molecular Structure
Protein Binding
Proteins
Solvents
Structure-Activity Relationship
Thermodynamics
Host-guest
Molecular recognition
Computer-aided drug design
Blind challenge
Binding affinity
Host–guest
Theoretical and Computational Chemistry
Medicinal & Biomolecular Chemistry
Medicinal and biomolecular chemistry
Theoretical and computational chemistry
Language
Abstract
The ability to computationally predict protein-small molecule binding affinities with high accuracy would accelerate drug discovery and reduce its cost by eliminating rounds of trial-and-error synthesis and experimental evaluation of candidate ligands. As academic and industrial groups work toward this capability, there is an ongoing need for datasets that can be used to rigorously test new computational methods. Although protein-ligand data are clearly important for this purpose, their size and complexity make it difficult to obtain well-converged results and to troubleshoot computational methods. Host-guest systems offer a valuable alternative class of test cases, as they exemplify noncovalent molecular recognition but are far smaller and simpler. As a consequence, host-guest systems have been part of the prior two rounds of SAMPL prediction exercises, and they also figure in the present SAMPL5 round. In addition to being blinded, and thus avoiding biases that may arise in retrospective studies, the SAMPL challenges have the merit of focusing multiple researchers on a common set of molecular systems, so that methods may be compared and ideas exchanged. The present paper provides an overview of the host-guest component of SAMPL5, which centers on three different hosts, two octa-acids and a glycoluril-based molecular clip, and two different sets of guest molecules, in aqueous solution. A range of methods were applied, including electronic structure calculations with implicit solvent models; methods that combine empirical force fields with implicit solvent models; and explicit solvent free energy simulations. The most reliable methods tend to fall in the latter class, consistent with results in prior SAMPL rounds, but the level of accuracy is still below that sought for reliable computer-aided drug design. Advances in force field accuracy, modeling of protonation equilibria, electronic structure methods, and solvent models, hold promise for future improvements.