2009 年 84 巻 6 号 p. 425-430
Promoters, the genomic regions proximal to the transcriptional start sites (TSSs) play pivotal roles in determining the rate of transcription initiation by serving as direct docking platforms for the RNA polymerase II complex. In the post-genomic era, correct gene prediction has become one of the biggest challenges in genome annotation. Species-independent promoter prediction tools could also be useful in meta-genomics, since transcription data will not be available for micro-organisms which are not cultivated. Promoter prediction in prokaryotic genomes presents unique challenges owing to their organizational properties. Several methods have been developed to predict the promoter regions of genomes in prokaryotes, including algorithms for recognition of sequence motifs, artificial neural networks, and algorithms based on genome’s structure. However, none of them satisfies both criteria of sensitivity and precision. In this work, we present a modified artificial neural network fed by nearest neighbors based on DNA duplex stability, named N4, which can predict the transcription start sites of Escherichia coli with sensitivity and precision both above 94%, better than most of the existed algorithms.