非表示:
キーワード:
position weight matrix; binding site clusters; count statistics; number of occurrences; overlapping occurrences
要旨:
Transcription factors (TFs) play a key role in gene regulation by binding to target sequences.
In silico prediction of potential binding to a sequence is a main task in computational biology.
Although many methods have been proposed to tackle this problem, the statistical significance of
the prediction is still not solved. We propose an approach to give a good approximation for the
potential of a sequence to be bound by a TF. Instead of assessing distinct binding sites, we motivate
to focus on the number of binding sites. Based on a suitable statistical model, probabilities for
scoring are approximated for a TF to bind to a sequence. Two examples show the necessity of such
a model as well as the superiority of the proposed method compared to standard approaches.