Genome Informatics
Online ISSN : 2185-842X
Print ISSN : 0919-9454
ISSN-L : 0919-9454
Predicting Protein Disorder for N-, C- and Internal Regions
Xiaohong LiPedro RomeroMeeta RaniA. Keith DunkerZoran Obradovic
著者情報
ジャーナル フリー

1999 年 10 巻 p. 30-40

詳細
抄録

Logistic regression (LR), discriminant analysis (DA), and neural networks (NN) were used to predict ordered and disordered regions in proteins. Training data were from a set of non-redundant X-ray crystal structures, with the data being partitioned into N-terminal, C-terminal and internal (I) regions. The DA and LR methods gave almost identical 5-cross validation accuracies that averaged to the following values: 75.9±3.1%(N-regions), 70.7±1.5%(I-regions), and 74.6±4.4%(C-regions). NN predictions gave slightly higher scores: 78.8±1.2%(N-regions), 72.5±1.2%(I-regions), and 75.3±3.3%(C-regions). Predictions improved with length of the disordered regions. Averaged over the three methods, values ranged from 52% to 78% for length=9-14 to≥21, respectively, for I-regions, from 72% to 81% for length=5 to 12-15, respectively, for N-regions, and from 70% to 80% for length=5 to 12-15, respectively, for C-regions. These data support the hypothesis that disorder is encoded by the amino acid sequence.

著者関連情報
© Japanese Society for Bioinformatics
前の記事 次の記事
feedback
Top