research-article

Globalized bipartite local model for drug-target interaction prediction

Authors:
Jian-Ping Mei

Nanyang Technological University, Singapore

Nanyang Technological University, Singapore
View Profile

,
Chee-Keong Kwoh

Nanyang Technological University, Singapore

Nanyang Technological University, Singapore
View Profile

,
Peng Yang

Nanyang Technological University, Singapore

Nanyang Technological University, Singapore
View Profile

,
Xiao-Li Li

Institute for Infocomm Research, Connexis, Singapore

Institute for Infocomm Research, Connexis, Singapore
View Profile

,
Jie Zheng

Nanyang Technological University, Singapore

Nanyang Technological University, Singapore
View Profile

BIOKDD '12: Proceedings of the 11th International Workshop on Data Mining in BioinformaticsAugust 2012Pages 8–14https://doi.org/10.1145/2350176.2350178

Published:12 August 2012Publication History

BIOKDD '12: Proceedings of the 11th International Workshop on Data Mining in Bioinformatics

Pages 8–14

ABSTRACT

In pharmacology, it is essential to identify the interactions between drug and targets to understand its effects. Supervised learning with Bipartite Local Model (BLM) recently has been shown to be effective for prediction of drug-target interactions by first predicting target proteins of a given known drug, then predicting drugs targeting a known protein. However, this pure "local" model is inapplicable to new drug or target candidates that currently have no known interactions. In this paper, we extend the existing BLM method by integrating a strategy for handling new drug and target candidates. Based on the assumption that similar drugs and targets have similar interaction profiles, we present a simple neighbor-based training data inferring procedure and integrate it into the frame work of BLM. This globalized BLM called bipartite local model with neighbor-based inferring (BLMN) then has an extended functionality for prediction interactions between new drug candidates and new target candidates. Good performance of BLMN has been observed in the experiment of predicting interactions between drugs and four important categories of targets. For the Nuclear Receptors dataset, where there are more chances for the presented strategy to be applied, 20% improvement in terms of AUPR was achieved. This demonstrates the effectiveness of BLMN and its potential in prediction of drug-target interactions.

References

K. Bleakley and Y. Yamanishi. Supervised prediction of drug-target interactions using bipartite local models. Bioinformatics, 25(18):2397--2403, 2009. Google ScholarDigital Library
M. Campillos et al. Drug target identification using side-effect similarity. Science, 321(5886):263--266, 2008.Google ScholarCross Ref
P. R. Caron et al. Chemogeominc approaches to drug discovery. Curr. Opin. Chem. Biol., 5:464--470, 2001.Google ScholarCross Ref
X. Chen et al. Drug-target interaction prediction by random walk on the heterogeneous network. Molecular BioSystems, 2012.Google ScholarCross Ref
J. Davis and M. Goadrich. The relationship between precision-recall and ROC curves. In Proc. 23rd International Conference on Machine Learning, pages 233--240, 2006. Google ScholarDigital Library
S. Gnther et al. SuperTarget and Matador: resources for exploring drug-target relationships. Nucleic acids res., 36(Database issue):D919--D922, 2008.Google Scholar
M. Hattori et al. Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. J. Am. Chem Soc., 125(39):11853--11865, 2003.Google ScholarCross Ref
V. J. Haupt and M. Schroeder. Old friends in new guise: repositioning of known drugs with structural bioinformatics. Briefings in Bioinformatics, 2011.Google ScholarCross Ref
L. Jacob and J.-P. Vert. Protein-ligand interaction prediction: an improved chemogenomics approach. Bioinformatics, 24(19):2149--2156, 2008. Google ScholarDigital Library
M. Kanehisa et al. From genomics to chemical genomics: new developments in KEGG. Nucleic acids res., 34(Database):D354--357, 2006.Google Scholar
M. J. Keiser et al. Predicting new molecular targets for known drugs. Nature, 462:175--181, 2009.Google ScholarCross Ref
H. Kubinyi and G. Müller. Chemogenomics in Drug Discovery. Wiley-VCH, Weinheim, 2004.Google ScholarCross Ref
T. V. Laarhoven, S. B. Nabuurs, and E. Marchiori. Gaussian interaction profile kernels for predicting drug-target interaction. Bioinformatics, 2011. Google ScholarDigital Library
Y. C. Martin et al. Do structurally similar molecules have similar biological activity? J. Med. Chem, 45:4350--4358, 2002.Google ScholarCross Ref
L. Perlman et al. Combining drug and gene similarity measures for drug-target elucidation. Journal of computational biology, 18:133--145, 2011.Google Scholar
D. Rognan. Chemogenomic approaches to rational drug design. British Journal of Pharmacology, 152:38--52, 2007.Google ScholarCross Ref
I. Schomburg et al. BRENDA, the enzyme database: updates and major new developments. Nucleic Asids Res., 32(supl-1):D431--433, 2004.Google Scholar
D. S. Wishart et al. DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic acids res., 36(Database issue):D901--906, 2008.Google Scholar
Z. Xia et al. Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces. BMC Systems Biology, 4 (Suppl 2):S6, 2010.Google ScholarCross Ref
Y. Yamanishi et al. Prediction of drug-target interaction networks from the integration of chemical and genomic spaces. Bioinformatics, 24:i232--i240, 2008. Google ScholarDigital Library

Index Terms

Globalized bipartite local model for drug-target interaction prediction
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Classification and regression trees

Recommendations

Prediction of Compound-Target Interactions of Natural Products Using Large-scale Drug and Protein Information
DTMBIO '15: Proceedings of the ACM Ninth International Workshop on Data and Text Mining in Biomedical Informatics

Verifying the proteins that are targeted by compounds of natural herbs will help select natural herb-based drug candidates. However, this entails a great deal of effort to clarify the interaction throughout in vitro or in vivo experiments. In this light,...
Read More
Supervised prediction of drug–target interactions using bipartite local models

Motivation: In silico prediction of drug–target interactions from heterogeneous biological data is critical in the search for drugs for known diseases. This problem is currently being attacked from many different points of view, a strong indication ...
Read More
Multi-dimensional search for drug–target interaction prediction by preserving the consistency of attention distribution
Abstract
Predicting drug–target interaction (DTI) is a crucial step in the process of drug repurposing and new drug development. Although the attention mechanism has been widely used to capture the interactions between drugs and targets, it mainly uses ...
Graphical abstract

Display Omitted
Highlights
- Predicting drug–target interaction from 2D substructures and 3D structure of the drug.
- Improved Chemistry-inspired Molecular Fragments (ICMF) strategy decompose the drugs.
- Building the 3D spatial feature matrix for drugs from the ...
Read More

Reviews

Reviewer: You Chen

The identification of drug-target interactions is very useful for understanding drug effects. In recent years, researchers have designed many models and systems to predict drug-target interactions. Among those models, the bipartite local model (BLM) is well known for its high accuracy of prediction. BLM treats drug-target interactions as a bipartite graph, with the two sides of the graph depicting drugs and targets respectively. BLM is a supervised learning model and does not resolve the important problem of predicting drug-target interactions for new drugs. The authors of this paper propose a new bipartite local model with neighbor-based inferring (BLMN). The significant difference between BLM and BLMN is that BLMN predicts drug-target interactions not only for existing drugs, but also for new drugs; this is shown in the first equation. For a new drug, BLMN first finds its nearest neighbors using pairwise similarity. It then predicts drug-target interactions for the new drug based on the drug-target interactions of its nearest neighbors. The inputs to BLMN are pairwise similarities for all drugs, pairwise similarities for all targets, and drug-target interactions for all drugs and targets. Pairwise similarities of drugs and targets are calculated through chem-seq, network-based, and hybrid inputs: "chem-seq denotes that chemical similarity is used for the drug and sequence similarity is used for the target; network-based denotes that the drug-drug similarity and target-target similarity are derived from the existing interaction network; [and] hybrid denotes that the drug-drug similarity and target-target similarity are combinations of the two types of similarities." A binary matrix A represents interactions between a set of drugs and a set of targets. If drug i has an interaction with target j , then the cell value of A ( i , j ) will be one. If drug i has no interaction with target j , then the value of A ( i , j ) will be zero. BLMN was evaluated on four different datasets: enzyme, ion channel, G protein coupled receptor (GPCR), and nuclear receptor. The experimental results indicate that BLMN has the highest scores for the area under the receiver operating characteristic (ROC) curve (AUC) and the area under the precision-recall curve (AUPR), based on hybrid similarity calculations for all four datasets. In addition, the authors compared BLM and BLMN on the nuclear receptor database with three different types of similarity: chem-seq, network-based, and hybrid. BLMN outperforms BLM for all three types of similarity. I did find three drawbacks to the paper. First, BLMN is a supervised learning method, which relies on a labeled training dataset. As we know, supervised learning methods perform better than semi-supervised and unsupervised methods, but they require many more resources. Second, in the third equation, there is a parameter ? to focus on the neighbors with the highest similarity; however, the experimental section neither provides a value for ? nor explains how different values of ? influence the model's performance. Third, the paper mostly relies on an earlier paper on BLM [1]. Without reading the BLM-related paper, readers cannot easily understand how BLMN works. Readers whose research interests are data mining and social analysis in bioinformatics are the paper's target audience. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
BIOKDD '12: Proceedings of the 11th International Workshop on Data Mining in Bioinformatics
August 2012
38 pages
ISBN:9781450315524
DOI:10.1145/2350176
General Chairs:
Jake Chen
Indiana University-Purdue University Indianapolis, Indianapolis, IN
,
Mohammed J. Zaki
Rensselaer Polytechnic Institute, Troy, NY
,
Program Chairs:
Tamer Kahveci
University of Florida, Gainesville, FL
,
Saeed Salem
North Dakota State University, Fargo, ND
,
Mehmet Koyutürk
Case Western Reserve University, Cleveland, OH
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 August 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
drug-target interaction
local model
neighbor-based
new candidate
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate7of16submissions,44%
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 211
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Globalized bipartite local model for drug-target interaction prediction

BIOKDD '12: Proceedings of the 11th International Workshop on Data Mining in Bioinformatics

ABSTRACT

References

Cited By

Index Terms

Recommendations

Prediction of Compound-Target Interactions of Natural Products Using Large-scale Drug and Protein Information

Supervised prediction of drug–target interactions using bipartite local models

Multi-dimensional search for drug–target interaction prediction by preserving the consistency of attention distribution

Reviews

Access critical reviews of Computing literature here

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Globalized bipartite local model for drug-target interaction prediction

BIOKDD '12: Proceedings of the 11th International Workshop on Data Mining in Bioinformatics

ABSTRACT

References

Cited By

Index Terms

Recommendations

Prediction of Compound-Target Interactions of Natural Products Using Large-scale Drug and Protein Information

Supervised prediction of drug–target interactions using bipartite local models

Multi-dimensional search for drug–target interaction prediction by preserving the consistency of attention distribution

Reviews

Access critical reviews of Computing literature here

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media