English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  An improved compound Poisson model for the number of motif hits in DNA sequences

Kopp, W., & Vingron, M. (2017). An improved compound Poisson model for the number of motif hits in DNA sequences. Bioinformatics, 33(24), 3929-3937. doi:10.1093/bioinformatics/btx539.

Item is

Files

show Files
hide Files
:
Kopp.pdf (Publisher version), 728KB
Name:
Kopp.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
© The Author(s) 2017. Published by Oxford University Press.
License:
ttp://creativecommons.org/licenses/by/ 4.0/

Locators

show
hide
Description:
-
OA-Status:

Creators

show
hide
 Creators:
Kopp, W.1, Author           
Vingron, M.2, Author           
Affiliations:
1IMPRS for Computational Biology and Scientific Computing - IMPRS-CBSC (Kirsten Kelleher), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1479666              
2Gene regulation (Martin Vingron), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1479639              

Content

show
hide
Free keywords: -
 Abstract: Motivation: Transcription factors play a crucial role in gene regulation by binding to specific regulatory sequences. The sequence motifs recognized by a transcription factor can be described in terms of position frequency matrices. When scanning a sequence for matches to a position frequency matrix, one needs to determine a cut-off, which then in turn results in a certain number of hits. In this paper we describe how to compute the distribution of match scores and of the number of motif hits, which are the prerequisites to perform motif hit enrichment analysis. Results: We put forward an improved compound Poisson model that supports general order- d Markov background models and which computes the number of motif-hits more accurately than earlier models. We compared the accuracy of the improved compound Poisson model with previously proposed models across a range of parameters and motifs, demonstrating the improvement. The importance of the order- d model is supported in a case study using CpG-island sequences. Availability: The method is available as a Bioconductor package named ' motifcounter ' https://bioconductor.org/packages/motifcounter. Supplementary information: Supplementary data are available at Bioinformatics online.

Details

show
hide
Language(s): eng - English
 Dates: 2017-08-252017-08-282017-12-15
 Publication Status: Issued
 Pages: 9
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1093/bioinformatics/btx539
ISSN: 1367-4811 (Electronic)1367-4803 (Print)
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Bioinformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Oxford : Oxford University Press
Pages: - Volume / Issue: 33 (24) Sequence Number: - Start / End Page: 3929 - 3937 Identifier: ISSN: 1367-4803
CoNE: https://pure.mpg.de/cone/journals/resource/954926969991