DNA sequence evolution with neighbor-dependent mutation

Arndt, Peter F.; Burge, Christopher B.; Hwa, Terence

doi:10.1089/10665270360688039

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Journal Article

DNA sequence evolution with neighbor-dependent mutation

MPS-Authors

/persons/resource/persons50074

Arndt, Peter F.
Evolutionary Genomics (Peter Arndt), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

There are no public fulltexts stored in PuRe

Supplementary Material (public)

There is no public supplementary material available

Citation

Arndt, P. F., Burge, C. B., & Hwa, T. (2003). DNA sequence evolution with neighbor-dependent mutation. Journal of Computational Biology, 10(3-4), 313-322. doi:10.1089/10665270360688039.

Cite as: https://hdl.handle.net/11858/00-001M-0000-0010-8A40-D

Abstract

We introduce a model of DNA sequence evolution which can account for biases in mutation rates that depend on the identity of the neighboring bases. An analytic solution for this class of models is developed by adopting well-known methods of nonlinear dynamics. Results are presented for the CpG-methylation-deamination process, which dominates point substitutions in vertebrates. The dinucleotide frequencies generated by the model (using empirically obtained mutation rates) match the overall pattern observed in noncoding DNA. A web-based tool has been constructed to compute single- and dinucleotide frequencies for arbitrary neighbor-dependent mutation rates. Also provided is the backward procedure to infer the mutation rates using maximum likelihood analysis given the observed single- and dinucleotide frequencies. Reasonable estimates of the mutation rates can be obtained very efficiently, using generic noncoding DNA sequences as input, after masking out long homonucleotide subsequences. Our method is much more convenient and versatile to use than the traditional method of deducing mutation rates by counting mutation events in carefully chosen sequences. More generally, our approach provides a more realistic but still tractable description of noncoding genomic DNA and may be used as a null model for various sequence analysis applications.