Automatic Construction of Amharic Semantic Networks (ASNet)

Tefera, Alelgn (2013) Automatic Construction of Amharic Semantic Networks (ASNet). Masters thesis, Addis Ababa University.

[img] PDF (Automatic Construction of Amharic Semantic Networks (ASNet))
Alelign, Tefera.pdf - Accepted Version
Restricted to Repository staff only

Download (1MB) | Request a copy

Abstract

Semantic networks are becoming popular issues these days. Even though this popularity is mostly related to the idea of semantic web, it is also related to the natural language applications. Semantic networks allow search engines to search not only for the key words given by the user but also for the related concepts, and show how this relation is made. Knowledge stored as semantic networks can be used by programs that generate text from structured data. Semantic networks are also used for document summarization by compressing the data semantically and document classification using the knowledge stored in it. As a result, semantic networks have become key components in many NLP applications. In this thesis, we focused on the construction of semantic networks for Amharic text. We have developed Amharic WordNet as initial knowledge base for the system and extracted intervening word patterns between pairs of concepts in the WordNet for a specific relation from free text. For each pair of concepts which we know the relationship contained in Amharic WordNet, we search the corpus for some text snapshot between these concepts. The returned text snapshot is processed to extract all the patterns having n-gram words between the two concepts. We have used the WordSpace model for extraction of semantically related concepts. The process of relation identification in among these concepts utilizes the extracted text patterns. “Part-of” and “type-of” relations are very popular and frequently found between concepts in any corpus. We have designed our system to extract “part-of” and “type-of” relations between concepts. The system was tested in three different phases with different datasets from Ethiopian News Agency and Walta Information Center. The accuracy of the system to extract pairs of concepts having “type-of” and “part-of” relations is 68.5% and 71.7% respectively.

Item Type: Thesis (Masters)
Uncontrolled Keywords: Concept, Semantic Relation, Distributional Semantics, Semantic Network, Knowledge base
Subjects: P Language and Literature > PL Languages and literatures of Eastern Asia, Africa, Oceania
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
T Technology > T Technology (General)
Divisions: Africana
Depositing User: Selom Ghislain
Date Deposited: 26 Jun 2018 08:24
Last Modified: 26 Jun 2018 08:24
URI: http://thesisbank.jhia.ac.ke/id/eprint/4579

Actions (login required)

View Item View Item