直接最適化法direct optimizationの問題点

吉澤 和徳

doi:10.19004/taxa.30.0_17

Abstract

Problems in direct optimization method as implemented in the software POY were reviewed. Alignment of highly variable regions using POY is problematic because it likely provides artificial maximization of pseudo-homology within the alignment. If the variable regions are aligned with conserved regions, then the bias could be emerged in two ways. When the conserved regions contain only weak phylogenetic signal, then stochastic similarities within the variable regions strongly affect to the final alignment. This type of bias can be emerged even if the variable regions are aligned independently. As a result, aligned variable sequences may provide superior amount of pseudo-signal which could even mask true weak signal contained in the conserved regions. When the conserved regions contain significant phylogenetic signal, then the variable regions will play as adherents of the conserved data even if the variable regions contain no or even contradicting phylogenetic signal. The adherent bias is especially problematic when data set contains heterogeneous sequences by gene introgression, recombination, or lineage sorting. Structure- or similarity-based approaches should be used in aligning DNA sequences.

Content from these authors

2011 The Japanese Society of Systematic Zoology

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!