de.mpg.escidoc.pubman.appbase.FacesBean
English
 
Help Guide Disclaimer Contact us Login
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Detection of copy number variants in sequencing data.

Neubert, K. (2010). Detection of copy number variants in sequencing data. Master Thesis, FREIE UNIVERSITAET BERLIN, Berlin.

Item is

Basic

show hide
Item Permalink: http://hdl.handle.net/11858/00-001M-0000-0010-7A5F-E Version Permalink: http://hdl.handle.net/11858/00-001M-0000-0010-7A60-8
Genre: Thesis

Files

show Files
hide Files
:
neubert_master2010.pdf (Any fulltext), 2MB
Description:
-
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
eDoc_access: PUBLIC
License:
-

Locators

show

Creators

show
hide
 Creators:
Neubert, Kerstin1, Author              
Affiliations:
1Bioinformatics (Ralf Herwig), Dept. of Vertebrate Genomics (Head: Hans Lehrach), Max Planck Institute for Molecular Genetics, Max Planck Society, escidoc:1479648              

Content

show
hide
Free keywords: -
 Abstract: In this work a program for detection of CNVs in sequencing data based on depth of coverage was implemented in C++ (copyDOC). Single steps in the pipeline, the acquisition of DOC signals in windows, the event calling and merging are implemented using generic programming techniques that enable the future integration of other algorithms in the pipeline. Furthermore, a testing environment was implemented, the copySim platform, which is very useful for testing and evaluation of different algorithms. CopyDOC was successfully applied to synthetic and real data using constant sized windows. Dynamic windows, that adapt according to the local mappability of the sequence, are implemented in the pipeline, but could not be tested in this work. They might be advantageous in datasets that contain uniquely mapped reads. However, CNVs have been shown to be overrepresented in segmental duplications (Nguyen et al. 2006; Cooper et al. 2007) and by a general exclusion of multireads those CNVs might be difficult to ascertain. In the application of copyDOC to a 1000 genomes dataset the overlap of predicted variants was considerable higer using multireads compared to uniquely mapped reads. Thus there is a requirement for tools that can handle multireads. Futher improvements of copyDOC might be done for the CNV calling algorithm and the merging step. For example the program workflow could be tested with a direct comparison of the DOC signals in two datasets via log ratios instead of appling a t-test on DOC signals in the two datasets. CopyDOC and copySim could be used as platform for the implementation and evaluation of futher CNV detection algorithms.

Details

show
hide
Language(s): eng - English
 Dates: 2010-09-10
 Publication Status: Accepted / In Press
 Pages: 54
 Publishing info: Berlin : FREIE UNIVERSITAET BERLIN
 Table of Contents: -
 Rev. Method: -
 Identifiers: eDoc: 557058
 Degree: Master

Event

show

Legal Case

show

Project information

show

Source

show