Birkute, Abraham Hailu (2013) Amharic Document Categorization Using Item sets Method. Masters thesis, Addis Ababa University.
PDF (Amharic Document Categorization Using Item sets Method)
Hailu, Abraham.pdf - Accepted Version Restricted to Repository staff only Download (691kB) | Request a copy |
Abstract
Document categorization or document classification is the process of assigning a document to one or more classes or categories. Many researches are conducted in the area of Amharic document categorization. The main focus of those studies is to examine different document categorization techniques and measuring their performance however itemsets method is not so far examined. This study focused to extend Apriori algorithm which is traditionally used for the purpose of knowledge mining in the form of association rules. The research focused on the basic principles of applying itemsets method to categorize Amharic documents. In addition to that the implementation of all the required tools which helps to carry out automatic Amharic Document categorization using itemsets method is developed and the algorithm is examined. Experiment results show itemsets method is an efficient method to categorize Amharic documents. The effectiveness and accuracy of the method to categorize Amharic documents is also evaluated and reported. Finally, factors affecting the performance of the proposed system and the importance of preprocessing training dataset in finding useful information are discussed.
Item Type: | Thesis (Masters) |
---|---|
Subjects: | P Language and Literature > P Philology. Linguistics P Language and Literature > PL Languages and literatures of Eastern Asia, Africa, Oceania T Technology > T Technology (General) |
Divisions: | Africana |
Depositing User: | Selom Ghislain |
Date Deposited: | 18 Jun 2018 09:45 |
Last Modified: | 18 Jun 2018 09:45 |
URI: | http://thesisbank.jhia.ac.ke/id/eprint/4352 |
Actions (login required)
View Item |