Tuesday, November 7, 2023

Graph Neural Networks for Identifying Protein-Reactive Compounds

Cano Gil, V. H.; Rowley, C. N.

ChemRxiv 2023

https://doi.org/10.26434/chemrxiv-2023-d0dqp

The identification of protein-reactive electrophilic compounds is critical to the design of new covalent modifier drugs, screening for toxic compounds, and the exclusion of reactive compounds from high throughput screening. In this work, we employ traditional and graph machine learning algorithms to classify molecules being reactive towards proteins or nonreactive. For training data, we built a new dataset, ProteinReactiveDB, comprised primarily of covalent and noncovalent inhibitors from DrugBank, BindingDB, and CovalentInDB databases. To assess the transferability of the trained models, we created a custom set of covalent and noncovalent inhibitors, which was constructed from recent literature. Baseline models were developed using Morgan fingerprints as training inputs, but they performed poorly when applied to compounds outside the training set. We then trained various Graph Neural Networks (GNNs), with the best GNN model achieving an Area Under the Receiver Operator Characteristic (AUROC) curve of 0.84, precision of 0.92, and recall of 0.73. We also explore the interpretability of these GNNs using Gradient Activation Mapping (GradCAM), which shows regions of the molecules GNNs deem most relevant when making a prediction. These maps indicated that our trained models can identify electrophilic functional groups in a molecule and classify molecules as protein-reactive based on their presence.



Discovery of a Tunable Heterocyclic Electrophile 4-Chloro-pyrazolopyridine That Defines a Unique Subset of Ligandable Cysteines

Hong-Rae Kim, David P. Byun, Kalyani Thakur, Jennifer Ritchie, Yixin Xie, Ronald Holewinski, Kiall F. Suazo, Mckayla Stevens, Hope Liechty, ...