de.mpg.escidoc.pubman.appbase.FacesBean
English
 
Help Guide Disclaimer Contact us Login
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Conference Paper

On Designing an Automated Malaysian Stemmer for the Malay Language

MPS-Authors
http://pubman.mpdl.mpg.de/cone/persons/resource/persons84118

Ong,  CS
Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society;

Locator
There are no locators available
Fulltext (public)
There are no public fulltexts available
Supplementary Material (public)
There is no public supplementary material available
Citation

Tai, S., Ong, C., & Abullah, N. (2000). On Designing an Automated Malaysian Stemmer for the Malay Language. Proceedings of the Fifth International Workshop on Information Retrieval with Asian Languages, 207-208.


Cite as: http://hdl.handle.net/11858/00-001M-0000-0013-E447-3
Abstract
Online and interactive information retrieval systems are likely to play an increasing role in the Malay Language community. To facilitate and automate the process of matching morphological term variants, a stemmer focusing on common affix removal algorithms is proposed as part of the design of an information retrieval system for the Malay Language. Stemming is a morphological process of normalizing word tokens down to their essential roots. The proposed stemmer strips prefixes and suffixes off the word. The experiment conducted with web sites selected from the World Wide Web has exhibited substantial improvements in the number of words indexed.