IMPROVED PERFORMANCE OF STEMMING USING EFFICIENT STEMMER ALGORITHM FOR INFORMATION RETRIEVAL

Ramalingam Sugumar

Abstract


with the vast amount of digital text available in many languages, it has become important to develop several language processing tools that could efficiently manage the large text databases. In many Natural Language Processing (NLP) and Information Retrieval (IR) applications, building of vocabulary of words and language models is an important task in text mining. But a large number of morphological variations in the words, especially in morphologically rich languages, pose a great challenge. In Text mining, stemming is an important pre-processing technique that can handle these variations. The Efficient Stemmer algorithm is extension version of enhanced porter stemmer. The Efficient Stemmer algorithm performance is compared with several algorithms such as porter, new porter and etc. The performance of the Enhanced Porter Stemmer is better than others.                                                                              

Keywords— Text Classification, Pre-processing, Stemming Techniques, Enhanced Porter Stemmer Algorithm, Efficient stemmer

Full Text:

PDF

Refbacks

  • There are currently no refbacks.


© 2017 International Journal of Global Research in Computer Science (JGRCS)
Copyright Agreement & Authorship Responsibility