Genome Informatics
Online ISSN : 2185-842X
Print ISSN : 0919-9454
ISSN-L : 0919-9454
A Compression Algorithm for DNA Sequences and Its Applications in Genome Comparison
Xin ChenSam KwongMing Li
著者情報
ジャーナル フリー

1999 年 10 巻 p. 51-61

詳細
抄録
We present a lossless compression algorithm, GenCompress, for genetic sequences, based on searching for approximate repeats. Our algorithm achieves the best compression ratios for benchmark DNA sequences. Significantly better compression results show that the approximate repeats are one of the main hidden regularities in DNA sequences.
We then describe a theory of measuring the relatedness between two DNA sequences. Using our algorithm, we present strong experimental support for this theory, and demonstrate its application in comparing genomes and constructing evolutionary trees.
著者関連情報
© Japanese Society for Bioinformatics
前の記事 次の記事
feedback
Top