Wednesday, March 31, 2021

International Journal of Network Security & Its Applications (IJNSA)

International Journal of Network Security & Its Applications (IJNSA)

ISSN: 0974 - 9330 (Online); 0975 - 2307 (Print)

http://airccse.org/journal/ijnsa.html

A New Stemmer to Improve Information Retrieval

Wahiba Ben Abdessalem Karaa, University of Tunis. Tunisia

ABSTRACT

A stemming is a technique used to reduce words to their root form, by removing derivational and inflectional affixes. The stemming is widely used in information retrieval tasks. Many researchers demonstrate that stemming improves the performance of information retrieval systems. Porter stemmer is the most common algorithm for English stemming. However, this stemming algorithm has several drawbacks, since its simple rules cannot fully describe English morphology. Errors made by this stemmer may affect the information retrieval performance.

The present paper proposes an improved version of the original Porter stemming algorithm for the English language. The proposed stemmer is evaluated using the error counting method. With this method, the performance of a stemmer is computed by calculating the number of understemming and overstemming errors. The obtained results show an improvement in stemming accuracy, compared with the original stemmer, but also compared to other stemmers such as Paice and Lovins stemmers. We prove, in addition, that the new version of porter stemmer affects the information retrieval performance.

Keywords

Stemming, porter stemmer, information retrieval

Original Source URL: http://airccse.org/journal/nsa/5413nsa11.pdf

Volume Link: http://airccse.org/journal/jnsa13_current.html


No comments:

Post a Comment

International Journal of Network Security & Its Applications (IJNSA) - ERA, WJCI Indexed

International Journal of Network Security & Its Applications (IJNSA) - ERA, WJCI Indexed ISSN: 0974 - 9330 (Online); 0975 - 2307 (Print)...