Genome Informatics
Online ISSN : 2185-842X
Print ISSN : 0919-9454
ISSN-L : 0919-9454
Finding Coding Region Using Secondary Hexamer Measure and Two-Dimensional Linear Discriminant Analysis
Katsuhiko MurakamiToshihisa Takagi
著者情報
ジャーナル フリー

1996 年 7 巻 p. 256-257

詳細
抄録
We have developed a coding region prediction system. It is constructed from several measures that indicate exonness of a region in DNA sequence. The system includes a new statistical measure called secondary hexamer measure which we have developed. In addition to the measure, several measures are combined by two-dimensional linear discriminant analysis (2D-LDA). Then the system outputs a best gene model, that is a model with the best score accumulated by phase-specific dynamic programming. Our test of this program on 568 vertebrate complete gene sequences had 61% accuracy at exon level for exact match and 95% accuracy at nucleotide level. The average correlation coefficient (CC) between prediction and actual structure was 0.80.
著者関連情報
© Japanese Society for Bioinformatics
前の記事 次の記事
feedback
Top