English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Large Scale Genomic Sequence SVM Classifiers

Sonnenburg, S., Rätsch, G., & Schölkopf, B. (2005). Large Scale Genomic Sequence SVM Classifiers. In ICML Bonn (pp. 849). USA: ANY PUBLISHER.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Sonnenburg, S, Author
Rätsch, G1, Author           
Schölkopf, B1, Author           
De Raedt S. Wrobel, L., Editor
Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795              

Content

show
hide
Free keywords: -
 Abstract: In genomic sequence analysis tasks like splice site recognition or promoter identification, large amounts of training sequences are available, and indeed needed to achieve sufficiently high classification performances. In this work we study two recently proposed and successfully used kernels, namely the Spectrum kernel and the Weighted Degree kernel (WD). In particular, we suggest several extensions using Suffix Trees and modi cations of an SMO-like SVM training algorithm in order to accelerate the training of the SVMs and their evaluation on test sequences. Our simulations show that for the spectrum kernel and WD kernel, large scale SVM training can be accelerated by factors of 20 and 4 times, respectively, while using much less memory (e.g. no kernel caching). The evaluation on new sequences is often several thousand times faster using the new techniques (depending on the number of Support Vectors). Our method allows us to train on sets as large as one million sequences.

Details

show
hide
Language(s):
 Dates: 2005
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: BibTex Citekey: 3627
 Degree: -

Event

show
hide
Title: ICML Bonn
Place of Event: -
Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show
hide
Title: ICML Bonn
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: USA : ANY PUBLISHER
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 849 Identifier: -