Genome Informatics
Online ISSN : 2185-842X
Print ISSN : 0919-9454
ISSN-L : 0919-9454
Large Scale Statistical Prediction of Protein-Protein Interaction by Potentially Interacting Domain (PID) Pair
Kyu Kim WanJong ParkJung Keun Suh
Author information
JOURNAL FREE ACCESS

2002 Volume 13 Pages 42-50

Details
Abstract

Protein-protein interaction plays a critical role in biological processes. The identification of interacting proteins by computational methods can provide new leads in functional studies of uncharacterized proteins without performing extensive experiments. We developed a database for the potentially interacting domain pairs (PID) extracted from a dataset of experimentally identified interacting protein pairs (DIP: database of interacting proteins) with InterPro, an integrated database of protein families, domains and functional sites. In developing protein interaction databases and predictive methods, sensitive statistical scoring systems is critical to provide a reliability index for accurate functional analysis of interaction networks. We present a statistical scoring system, named “PID matrix score” as a measure of the interaction probability (interactability) between domains. This system provided a valuable tool for functional prediction of unknown proteins. For the evaluation of PID matrix, cross validation was performed with subsets of DIP data. The prediction system gives about 50% sensitivity and more than 98% specificity, which implies that the information for interacting proteins pairs could be enriched about 30 fold with the PID matrix. Itis demonstrated that mapping of the genome-wide interaction network can be achieved by using the PID matrix.

Content from these authors
© Japanese Society for Bioinformatics
Previous article Next article
feedback
Top