T-Stem - A Superior Stemmer and Temporal Extractor for Arabic Texts

Stemming has a large effect on Arabic information indexing and retrieval, at least partially due to the highly inflected nature of the language. Our work demonstrates the process of improving other stemmers, mainly that of [1]. We reached a recall difference of 28% over the work of [1]. The main par...

Full description

Saved in:
Bibliographic Details
Main Author: Haraty, Ramzi A. (author)
Other Authors: Khatib, Samer A. (author)
Format: article
Published: 2005
Online Access:http://hdl.handle.net/10725/5119
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
http://www.dirf.org/jdim/v3n303.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Stemming has a large effect on Arabic information indexing and retrieval, at least partially due to the highly inflected nature of the language. Our work demonstrates the process of improving other stemmers, mainly that of [1]. We reached a recall difference of 28% over the work of [1]. The main part of improvement was due to the addition of more grammatical rules that facilitate the process of stemming.Following this part, we implemented a procedure that extracts the temporal references from the texts. This procedure is highly dependable on the stemming process. A list of all the temporal references is used. The type of the temporal word decides the procedure to treat this word and gives the importance of this temporal reference. These conditions, with the help of the stemmer, produced an excellent result of 95% precision rate and of 91% recall rate.