Shebeshe, Fasika Tesfaye (2010) Phrasal Translation for Amharic English Cross Language Information Retrieval (CLIR). Masters thesis, Addis Ababa University.
PDF (Phrasal Translation for Amharic English Cross Language Information Retrieval (CLIR))
Fasika, Tesfaye Shebeshe.pdf - Accepted Version Restricted to Repository staff only Download (715kB) | Request a copy |
Abstract
Amharic is a language most widely used in Ethiopia and serve as the official working language of the Federal Democratic Republic of Ethiopia. Despite this fact, English serves as medium of instruction and communication in academic environment, working language in some governmental and nongovernmental organizations in Ethiopia. This fact showed that there is a language barrier between what most peoples of Ethiopia are familiar with and expected to use in their working and academic environment. Hence, experimenting on the applicability of a cross language information retrieval system for Amharic-English which can break the language barrier is important. This research is mainly conducted to break the language barrier that Amharic speaking users face in obtaining and utilizing documents available in English. The experimentation conduct is employed a corpus based approach which make use of phrasal query translation. This approach requires accessibility of a large volume of parallel documents prepared in Amharic and English. News article were used to conduct this research. The performance of the system was measured by average precision and recall. The result of the experimentation is recall value of 0.248 for translated Amharic queries, 0.463 for Amharic queries 0.436 for the baseline English queries. This showed that the result of the translated queries was low compared to the baseline queries. The performance of such system is highly dependent on the phrase translation system. Hence coming up with a good translation model will have a paramount impact on the performance of the system. Therefore, with the use of adequately large and cleaned parallel Amharic-English corpus, it is possible to develop a phrasal query translation for Amharic English a cross language information retrieval.
Item Type: | Thesis (Masters) |
---|---|
Uncontrolled Keywords: | phrasal query translation, Cross Language Information Retrieval, phrase alignment |
Subjects: | P Language and Literature > PE English P Language and Literature > PL Languages and literatures of Eastern Asia, Africa, Oceania Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science |
Divisions: | Africana |
Depositing User: | Selom Ghislain |
Date Deposited: | 05 Oct 2018 13:50 |
Last Modified: | 05 Oct 2018 13:50 |
URI: | http://thesisbank.jhia.ac.ke/id/eprint/6759 |
Actions (login required)
View Item |