3DProtDTA a new deep learning model for drug-target affinity prediction
Naveen Kumar Yethirajula
Junior Scientific Writer at Sanofi || Crafting Impactful Pharma Content | Bioinformatics Enthusiast Exploring Vaccinology Tools
Introduction
- Accurate prediction of drug-target binding affinity (DTA) is critical in drug discovery to filter out ineffective drug candidates early and reduce failures in clinical trials
- Computational DTA techniques aim to strike a balance between accuracy and efficiency compared to detailed simulations
- Recent advances use deep learning and graph representations to better capture molecule connectivity
Components
- Represents proteins as residue-level graphs with amino acid nodes, covalent and non-covalent edge features
- Uses AlphaFold structures to enable modeling of graph representations of proteins
- Ligands represented as molecular graphs plus morgan fingerprints to combine complementary information
- Graph neural networks (GNNs) employed to encode both protein and ligand graphs, with fully connected layers and graph pooling to enable end-to-end learning for affinity prediction
- Fully connected layers combined with graph pooling used for affinity prediction
- It represents both proteins and ligands (drug candidates) as graphs, retaining more information about their connectivity and 3D arrangements compared to linear representations like sequences or SMILES strings
- For proteins, it generates residue-level graphs with nodes representing amino acids and edges encoding covalent and non-covalent interactions i.e features capture sequence, structure, and binding properties
Methods
- Benchmarked on Davis and KIBA datasets against state-of-the-art methods
- Compared multiple GNN architectures and graph pooling techniques
- Used grid search and cross-validation to select optimal model architecture
- Trained end-to-end model on interaction datasets to predict binding affinities
Key Findings
- Outperforms previous methods on all evaluation metrics on both benchmarks
领英推荐
- Particularly significant gains on Davis set over next best method
- Ablation studies indicate combined ligand features and graph pooling choices help most
- Specific GNN architecture less crucial than representations and end-to-end integration
- Analysis of different components indicates combined ligand features and graph pooling choices contribute most to performance gains
Limitations
- Relies on AlphaFold structures - biases possible compared to experimental structures
- Residue-level graphs simplify protein complexity compared to atomic-level encodings
-The GNN architecture itself is less impactful
Future Prospects
- Integrate experimental structures where available to complement AlphaFold
- Construct atomic-level protein graphs
- Encode more physics-based descriptors e.g. flexibility, energies
Conclusion
- 3DProtDTA advances state-of-the-art in DTA prediction through integrated, end-to-end learning
- Results highlight value of graph representations and importance of modelling connectivity
- Provides strong baseline with room for improvement as more structured data becomes available
In conclusion, the use of AlphaFold structures, graph representations, and end-to-end learning appear to enable superior modelling of binding affinities
Happy reading. Would you add anything, please add in the comments section
Intern at Scry AI
10 个月Great share. AI is playing a transformative role in drug discovery and addressing challenges in predicting protein structures, de novo drug design, drug-protein interactions, repurposing drugs, toxicity prediction, and physicochemical property assessment. Predicting protein structures, led by DLNs like AlphaFold2 and RoseTTAFold, aids in identifying compounds that can bind to disease-causing proteins. De novo drug design involves combining organic chemistry rules with AI, thus accelerating the drug discovery process. AI algorithms, including DLNs, Support Vector Machines, and Logistic Regression, are increasingly being used to repurpose existing drugs and hence potentially saving costs. AI models like DeepTox surpass traditional toxicity prediction methods, thereby enhancing accuracy. Predicting physicochemical properties achieved through AI algorithms accurately assessing parameters. Additionally, AI contributes to clinical trial improvement by harmonizing diverse datasets, creating comprehensive patient views, and optimizing patient matching for trials. These applications collectively showcase AI's pivotal role in revolutionizing drug discovery and development processes. More about this topic: https://lnkd.in/gPjFMgy7