A Comparison of Data Mining Algorithms for Liver Disease Prediction on Imbalanced Data

Ain Najwa Arbain; B. Yushalinie Pillay Balakrishnan

doi:10.69511/ijdsaa.v1i1.2

A Comparison of Data Mining Algorithms for Liver Disease Prediction on Imbalanced Data

Authors

Ain Najwa Arbain Asia Pacific University of Technology & Innovation, Kuala Lumpur, Malaysia
B. Yushalinie Pillay Balakrishnan Asia Pacific University of Technology & Innovation, Kuala Lumpur, Malaysia

DOI:

https://doi.org/10.69511/ijdsaa.v1i1.2

Keywords:

Liver disease Prediction, Imbalanced Data, Data Mining, Classification

Abstract

Liver is one of the most important organs in the human body but due to unhealthy lifestyle and excessive alcohol intake, liver disease has been increasing at an alarming rate globally hence it calls for an immediate attention to predict the disease before it is too late. However, medical data is often associated to be imbalanced and complex. Hence, the aim of this project is to investigate the data mining algorithm to predict liver disease on imbalanced data through random sampling. Results are compared and analysed based on accuracy and ROC index. K-Nearest Neighbour (k-NN) outperforms the other algorithms such as Logistic Regression, AutoNeural and Random Forest with the accuracy of 99.794%. As a conclusion, the model proposed in this research is performing better than past researchers conducted on Andhra Pradesh liver disease dataset.

Downloads

Published

2019-02-09

How to Cite

Arbain, A. N., & Balakrishnan, B. Y. P. (2019). A Comparison of Data Mining Algorithms for Liver Disease Prediction on Imbalanced Data. International Journal of Data Science and Advanced Analytics, 1(1), 1–11. https://doi.org/10.69511/ijdsaa.v1i1.2

Download Citation

Issue

Vol. 1 No. 1 (2019)

Section

Articles

License

International Journal of Data Science and Advanced Analytics (IJDSAA) is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. This license allows users to copy, distribute and transmit an article, adapt the article as long as the author is attributed and the article is not used for commercial purposes.

The author(s) confirms

The manuscript submission has not been previously published, nor is it before another journal for consideration (or an explanation has been provided in Comments to the Editor).
The published materials used in the manuscript were obtained permission for reproduction. (if any)

A Comparison of Data Mining Algorithms for Liver Disease Prediction on Imbalanced Data

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

Make a Submission