Multiscale DNA partitioning: Statistical evidence for segments.

Futschik, A.; Hotz, T.; Munk, A.; Sieling, H.

doi:10.1093/bioinformatics/btu180

Local TagsRelease HistoryDetailsSummary

Multiscale DNA partitioning: Statistical evidence for segments.

Futschik, A., Hotz, T., Munk, A., & Sieling, H. (2014). Multiscale DNA partitioning: Statistical evidence for segments. Bioinformatics, 30(16), 2255-2262. doi:10.1093/bioinformatics/btu180.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0023-C068-1 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0028-3600-E

Genre: Journal Article

Files

show Files

hide Files

:

2051429.pdf (Publisher version), 487KB

View Save

File Permalink:
https://hdl.handle.net/11858/00-001M-0000-0024-171A-B

Name:
2051429.pdf

Description:
-

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
-

:

2051429_Suppl.pdf (Supplementary material), 276KB

View Save

File Permalink:
https://hdl.handle.net/11858/00-001M-0000-0024-1719-D

Name:
2051429_Suppl.pdf

Description:
-

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

hide

Locator:
http://bioinformatics.oxfordjournals.org/content/30/16/2255.full.pdf+html (Publisher version) Open Access status unknown

Description:
-

OA-Status:

Creators

show

hide

Creators:
Futschik, A., Author
Hotz, T., Author
Munk, A.¹, Author
Sieling, H., Author

Affiliations:
1Research Group of Statistical Inverse-Problems in Biophysics, MPI for biophysical chemistry, Max Planck Society, ou_1113580

Content

show

hide

Free keywords: -

Abstract: Motivation: DNA segmentation, i.e. the partitioning of DNA in compositionally homogeneous segments, is a basic task in bioinformatics. Different algorithms have been proposed for various partitioning criteria such as Guanine/Cytosine (GC) content, local ancestry in population genetics or copy number variation. A critical component of any such method is the choice of an appropriate number of segments. Some methods use model selection criteria and do not provide a suitable error control. Other methods that are based on simulating a statistic under a null model provide suitable error control only if the correct null model is chosen. Results: Here, we focus on partitioning with respect to GC content and propose a new approach that provides statistical error control: as in statistical hypothesis testing, it guarantees with a user-specified probability Graphic that the number of identified segments does not exceed the number of actually present segments. The method is based on a statistical multiscale criterion, rendering this as a segmentation method that searches segments of any length (on all scales) simultaneously. It is also accurate in localizing segments: under benchmark scenarios, our approach leads to a segmentation that is more accurate than the approaches discussed in the comparative review of Elhaik et al. In our real data examples, we find segments that often correspond well to features taken from standard University of California at Santa Cruz (UCSC) genome annotation tracks.

Details

show

hide

Language(s): eng - English

Dates: Published Online: 2014-04-21Date issued: 2014-08-15

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: Peer

Identifiers: DOI: 10.1093/bioinformatics/btu180

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Bioinformatics

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 30 (16) Sequence Number: - Start / End Page: 2255 - 2262 Identifier: -