3DICE: Interpretable 3D Cross-Modal Learning for Drug–Target Interaction Prediction and Large-Scale Drug Discovery
Austin Zi Rui Liu, Nguyen Quoc Khanh Le, Matthew Chin Heng ChuaAbstract
Motivation
Drug–target interaction (DTI) prediction is a crucial step in modern drug discovery. Accurate and efficient predictions can substantially reduce costs and development time. Applications of deep learning methods for this purpose have been extensively studied in recent years, yielding instrumental contributions to this field. However, existing methods face issues pertaining to efficient learning of drug and target feature representations, which is detrimental to generalisability and performance in cold-start scenarios. Most approaches extract representations from SMILES strings for drugs and FASTA sequences for target proteins, which encode limited 3D structural information. Additionally, many models lack explainability, being black boxes that provide little physical insight into the underlying mechanisms behind such interactions.
Results
We propose 3DICE, a novel framework leveraging co-attention-based fusion and massively pre-trained 3D structural encoders for both drugs and proteins. Uni-Mol and ESM-IF1 are employed to generate high-fidelity, 3D structure-aware embeddings which enable richer geometric and chemical understanding. Cross-modal fusion modules further augment representations to model intermolecular binding relationships. Importantly, this mechanism also provides intrinsic interpretability, highlighting and enabling qualitative analysis of most influential atoms or residues. Experiments conducted on two canonical benchmark datasets display the competitiveness of our model in real-world scenarios. 3DICE outperformed state-of-the-art models across multiple metrics on the DrugBank and KIBA datasets. Additional experiments provide a more rigorous analysis of interpretability than is typically reported in prior DTI studies, and we find that attention consistently highlights decision-critical regions which is not intrinsically class-specific.
Availability
Our model and dataset are freely available at: https://github.com/austinatose/3DICE.
Supplementary information
Supplementary data are available at Bioinformatics online.