English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Conference Paper

Accurate Splice site Prediction Using Support Vector Machines

MPS-Authors
/persons/resource/persons84204

Schweikert,  G
Max Planck Institute for Biological Cybernetics, Max Planck Society;
Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society;
Friedrich Miescher Laboratory, Max Planck Society;

Philips ,  P
Friedrich Miescher Laboratory, Max Planck Society;

Behr,  J
Friedrich Miescher Laboratory, Max Planck Society;

/persons/resource/persons84153

Rätsch,  G
Friedrich Miescher Laboratory, Max Planck Society;

Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Sonnenburg, S., Schweikert, G., Philips, P., Behr, J., & Rätsch, G. (2007). Accurate Splice site Prediction Using Support Vector Machines. BMC Bioinformatics, 8(Supplement 10), 1-16.


Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-CAE1-8
Abstract
Background: For splice site recognition, one has to solve two classification problems:
discriminating true from decoy splice sites for both acceptor and donor sites. Gene finding systems
typically rely on Markov Chains to solve these tasks.
Results: In this work we consider Support Vector Machines for splice site recognition. We employ
the so-called weighted degree kernel which turns out well suited for this task, as we will illustrate in
several experiments where we compare its prediction accuracy with that of recently proposed
systems. We apply our method to the genome-wide recognition of splice sites in Caenorhabditis
elegans, Drosophila melanogaster, Arabidopsis thaliana, Danio rerio, and Homo sapiens. Our
performance estimates indicate that splice sites can be recognized very accurately in these genomes
and that our method outperforms many other methods including Markov Chains, GeneSplicer and
SpliceMachine. We provide genome-wide predictions of splice sites and a stand-alone prediction
tool ready to be used for incorporation in a gene finder.
Availability: Data, splits, additional information on the model selection, the whole genome
predictions, as well as the stand-alone prediction tool are available for download at http://
www.fml.mpg.de/raetsch/projects/splice.