Mining Patients’ Data for Effective Tuberculosis Diagnosis: The Case of Menelik II Hospital

Nesredin, Asia (2012) Mining Patients’ Data for Effective Tuberculosis Diagnosis: The Case of Menelik II Hospital. Masters thesis, Addis Ababa University.

[img] PDF (Mining Patients’ Data for Effective Tuberculosis Diagnosis: The Case of Menelik II Hospital)
Asia, Nesredin.pdf - Accepted Version
Restricted to Repository staff only

Download (837kB) | Request a copy

Abstract

TB is a common and deadly infectious disease that can occur at any age. The incidence of TB has more than doubled in Africa during the last two decades. Ethiopia is also ranked 8th among the 22 countries with the highest TB burden in the world. On the other hand, the advances in computing and information storage have provided vast amounts of data. However, the challenge has been to extract knowledge from this raw data. This has lead to new methods and techniques such as data mining that can bridge the knowledge gap. Data mining can be used to model health care problems. This research aimed to apply data mining techniques to patients’ data to establish meaningful relationships or patterns for effective TB diagnosis. In general, 7069 data sets were extracted from Menelik II hospital. The data set contains patients’ detail information. The research establishes whether patients’ data are classified using various data mining techniques for predicting purpose. The research specifically look at the use of clustering algorithm followed by classification for a data mining approach to help identify patients patterns and speed up the process of TB diagnosis system. The study has tried to apply k-means clustering with some enhancements to aid in the process of segmenting the dataset into TB-positive and TB-negative. The resulting cluster is then used for developing the classification model. Classification was employed in the study to identify patterns and predict the occurrence of TB. The classification task was made using J48 decision tree and Naïve Bayes classification algorithms and different experimentations was conducted. The model developed for predicting purpose has an accuracy of 85.93%. The discovered knowledge from the J48 decision tree is presented by traversing the tree for the ease of understanding. The outcome of the research can have many benefits, to the organization especially for TB diagnosis activities.

Item Type: Thesis (Masters)
Subjects: H Social Sciences > H Social Sciences (General)
R Medicine > RA Public aspects of medicine
R Medicine > RA Public aspects of medicine > RA0421 Public health. Hygiene. Preventive Medicine
Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
Divisions: Africana
Depositing User: Selom Ghislain
Date Deposited: 24 Aug 2018 12:08
Last Modified: 24 Aug 2018 12:08
URI: http://thesisbank.jhia.ac.ke/id/eprint/4900

Actions (login required)

View Item View Item