de.mpg.escidoc.pubman.appbase.FacesBean
English
 
Help Guide Disclaimer Contact us Login
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Book Chapter

Efficient string mining under constraints via the deferred frequency index

MPS-Authors

Schulz,  Marcel H.
Max Planck Society;

Locator
There are no locators available
Fulltext (public)
There are no public fulltexts available
Supplementary Material (public)
There is no public supplementary material available
Citation

Weese, D., & Schulz, M. H. (2008). Efficient string mining under constraints via the deferred frequency index. In P. Perner (Ed.), Advances in Data Mining. Medical Applications, E-Commerce, Marketing, and Theoretical Aspects. Berlin/Heidelberg: Springer.


Cite as: http://hdl.handle.net/11858/00-001M-0000-0010-7F86-8
Abstract
We propose a general approach for frequency based string mining, which has many applications, e.g. in contrast data mining. Our contribution is a novel algorithm based on a deferred data structure. Despite its simplicity, our approach is up to 4 times faster and uses about half the memory compared to the best-known algorithm of Fischer et al. Applications in various string domains, e.g. natural language, DNA or protein sequences, demonstrate the improvement of our algorithm.