IPSJ Transactions on Bioinformatics
Online ISSN : 1882-6679
ISSN-L : 1882-6679
 
AtLASS: A Scheme for End-to-End Prediction of Splice Sites Using Attention-based Bi-LSTM
Ryo HaradaKeitaro KumeKazumasa HorieTakuro NakayamaYuji InagakiToshiyuki Amagasa
著者情報
ジャーナル フリー

2023 年 16 巻 p. 20-27

詳細
抄録

Eukaryotic genomes contain exons and introns, and it is necessary to accurately identify exon-intron boundaries, i.e., splice sites, to annotate genomes. To address this problem, many previous works have proposed annotation methods/tools based on RNA-seq evidence. Many recent works exploit neural networks (NNs) as their prediction models, but only a few can be used to generate new genome annotation in practice. In this study, we propose AtLASS, a fully automated method for predicting splice sites from genomic and RNA-seq data using attention-based Bi-LSTM (Bidirectional Long Short-Term Memory). We exploit two-stage training on RNA-seq data to address the problem of biased label problem, thereby reducing the false positives. The experiments on the genomes of three species show that the performance of the proposed method itself is comparable to that of existing methods, but we can achieve better performance by combining the outputs of the proposed method and the existing method. The proposed method is the first program specialized in end-to-end splice site prediction using NNs.

著者関連情報
© 2023 by the Information Processing Society of Japan
前の記事 次の記事
feedback
Top