ausblenden:
Schlagwörter:
phylogenetics; population genetics; mutation distance; match length; suffix tree
Zusammenfassung:
Phylogenetics and population genetics are central disciplines in evolutionary biology. Both are based on comparative
data, today usually DNA sequences. These have become so plentiful that alignment-free sequence comparison is of
growing importance in the race between scientists and sequencing machines. In phylogenetics, efficient distance
computation is the major contribution of alignment-free methods. A distance measure should reflect the number
of substitutions per site, which underlies classical alignment-based phylogeny reconstruction. Alignment-free distance
measures are either based on word counts or on match lengths, and I apply examples of both approaches to
simulated and real data to assess their accuracy and efficiency. While phylogeny reconstruction is based on the
number of substitutions, in population genetics, the distribution of mutations along a sequence is also considered.
This distribution can be explored by match lengths, thus opening the prospect of alignment-free population
genomics.