Comparative Evaluation of Predictive Models on Kidney, Lung Cancer and Heart Disease

  • Prof. Pradnya Bhangale
  • Aarya Shah
  • Dev Patel
  • Parth Shah
  • Sagar Salvi
Keywords: Machine learning, Lung cancer, Cardiovascular Disease, Kidney Disease, Prediction Accuracy, Hybrid Model

Abstract

This study supports advances in machine learning to improve early detection and treatment planning for lung cancer, cardiovascular disease, and kidney disease. We compare traditional models such as decision trees and logistic regression with complex techniques such as support vector machines, random forests, and KNN and evaluate them on publicly available data. This hybrid approach uses random forest and decision tree classifiers, leveraging adaptive learning to improve model accuracy. Results showed high prediction accuracy for kidney disease and lung cancer , while prediction accuracy for heart disease was average . This difference indicates the need for better work and more information. Future studies will focus on improving cardiovascular models, addressing data uncertainty, and integrating predictive models into clinical practice to support early diagnosis and personalized treatment to improve patient outcomes. This study demonstrates the potential for machine learning to have a major impact on diagnosis and patient management.

References

[1] Manurung, J., Perwira, Y., & Sinaga, B. (2022). Expert System to Diagnose Dental and Oral Disease Using Naive Bayes Method. In Proceedings of the 2022 IEEE International Conference of Computer Science and Information Technology (ICOSNIKOM) (pp. 1-5). Medan, Indonesia. DOI: 10.1109/ICOSNIKOM56551.2022.10034871.
[2] Xu, H., Kong, Y., & Tan, S. (2023). Predictive Modeling of Diabetic Kidney Disease using Random Forest Algorithm along with Features Selection. In Proceedings of the 2023 3rd International Conference on Intelligent Technologies (CONIT) (pp. 1-5).
[3] Harshini, P. S., Naresh, K., Pamulapati, S. R., & Lavanya, A. (2023). Diagnosis of Liver Diseases Using Machine Learning Algorithms and their Prediction Using Logistic Regression and ANN. In Proceedings of the 2023 3rd International Conference on Intelligent Technologies (CONIT) (pp. 1-5). Hyderabad, Telangana, India. DOI: 10.1109/CONIT59222.2023.10205819.
[4] Zhang, J., Jia, H., & Zhang, N. (2023). Alternate Support Vector Machine Decision Trees for Power Systems Rule Extractions. IEEE Transactions on Power Systems, 38(1), 1-5.
[5] Raju, C. G., Amudha, V., & S. G. (2023). Comparison of Linear Regression and Logistic Regression Algorithms for Ground Water Level Detection with Improved Accuracy. In Proceedings of the 2023 Eighth International Conference on Science Technology Engineering and Mathematics (ICONSTEM) (pp. 1-5). DOI: 10.1109/ICONSTEM56934.2023.10142495.
[6] Liu, J., Zhu, X., & Zhang, Y. (2020). Application of DE-GWO-SVM Algorithm in Business Order Prediction Model. In 2020 IEEE 11th International Conference on Software Engineering and Service Science (ICSESS) (pp. 1-5). DOI: 10.1109/ICSESS49938.2020.9237714.
[7] Wen, Y., He, X., Lv, D., & Li, F. (2023). Hybrid Algorithm of Gradient Boosted Decision Tree and Multiple Linear Regression and Its Application on Decision Prediction. In 2023 IEEE 3rd International Conference on Electronic Communications Internet of Things and Big Data (ICEIB) (pp. 1-5). DOI: 10.1109/ICEIB57887.2023.10170440.
[8] Patil, R., Devkar, A.,Patil, S., Raut, R., & Todakari, N. (2023). Earthquake Depth & Magnitude Prediction Model Using Artificial Neural Network. In 2023 4th International Conference for Emerging Technology (INCET) (pp. 1-5). DOI: 10.1109/INCET57972.2023.10170413.
[9] Liu, J., & Liu, F. (2023). A novel method for predicting the dynamics of carbon emissions for air transport processes. In Proceedings of the 42nd Chinese Control Conference (pp. 1-5). Tianjin, China. DOI: 10.1109/CCC.2023.1234567.
[10] Chandra, A., & Roy, S. (2023). On the Detection of Alzheimer’s Disease using Naïve Bayes Classifier. In 2023 International Conference on Microwave, Optical, and Communication Engineering (ICMOCE) (pp. 1-5). DOI: 10.1109/ICMOCE57812.2023.10166516.
[11] Kaur, M., Thacker, C., Goswami, L., Thamizhvani, T. R., Abdulrahman, I. S., & Raj, A. S. (2023). Alzheimer’s Disease Detection using Weighted KNN Classifier in Comparison with Medium KNN Classifier with Improved Accuracy. In 2023 3rd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE) (pp. 1-5). DOI: 10.1109/ICACITE57410.2023.10183208.
[12] Vasu, V. N., Madhusundar, N., Surendran, R., & Saravanan, M. S. (2022). Prediction of Defective Products Using Logistic Regression Algorithm against Linear Regression Algorithm for Better Accuracy. In 2022 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT) (pp. 1-5). DOI: 10.1109/3ICT56508.2022.9990653.
[13] Mostafi, S., Alghamdi, T., & Elgazzar, K. (2021). A Bayesian Linear Regression Approach to Predict Traffic Congestion. In 2021 IEEE 7th World Forum on Internet of Things (WF-IoT) (pp. 1-6). DOI: 10.1109/WF-IoT51360.2021.9595298.
[14] Alanezi, M. A., Mohamed, Z. S., Homeed, M. T., & Zeki, A. M. (2020). Comparing Naïve Bayes, Decision Tree and Logistic Regression Methods in Fraudulent Credit Card Transactions. In 2020 International Conference on Data Analytics for Business and Industry: Way Towards a Sustainable Economy (ICDABI) (pp. 1-5). DOI: 10.1109/ICDABI51230.2020.9325705.
[15] Kan, N., Li, C., Yang, C., Dai, W., Zou, J., & Xiong, H. (2021). Uncertainty-Aware Robust Adaptive Video Streaming with Bayesian Neural Network and Model Predictive Control. Big Data Mining and Analytics, 4(2), 116-123. DOI: 10.26599/BDMA.2020.9020016.
[16] Wu, J., Li, Z., & Yang, S. (2021). COVID-19 Dynamics Prediction by Improved Multi-Polynomial Regression Model. In 2021 International Conference on Data Science (CONFCDS) (pp. 1-5). DOI: 10.1145/3448734.3450847.
[17] Villavicencio, C. N., Jeng, J. H., & Hsieh, J. G. (2021). Support Vector Machine Modelling for COVID-19 Prediction based on Symptoms using R Programming Language. In 2021 International Conference on Machine Learning and Machine Intelligence (MLMI) (pp. 1-5). DOI: 10.1145/3490725.3490735.
[18] Gupta, V. K., Gupta, A., Kumar, D., & Sardana, A. (2021). Prediction of COVID-19 Confirmed, Death, and Cured Cases in India Using Random Forest Model. Big Data Mining and Analytics, 4(2), 116-123. DOI: 10.26599/BDMA.2020.9020016.
[19] Aaboub, F., Chamlal, H., & Ouaderhman, T. (2023). Analysis of the prediction performance of decision tree-based algorithms. In 2023 International Conference on Decision Aid Sciences and Applications (DASA) (pp. 1-5). DOI: 10.1109/DASA59624.2023.10286809.
[20] Bhadle, R. V., & Rathod, D. P. (2023). Support Vector Machine, Naïve Bayes, and Recurrent Neural Network to Detect Data Poisoning Attacks on Dataset. In 2023 5th Biennial International Conference on Nascent Technologies in Engineering (ICNTE) (pp. 1-5). DOI: 10.1109/ICNTE56631.2023.10146665.
[21] Yadav, K., & Singh, S. (2023). Loan Status Prediction using SVM and Logistic Regression. In 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT) (pp. 1-5). DOI: 10.1109/ICCCNT56998.2023.10307473.
[22] Saisundar, A., & Devi, T. (2023). Accurate Human Palm Recognition System in Cybercrime Analysis using Naive Bayes in comparison with Decision Tree. In 2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF) (pp. 1-5). DOI: 10.1109/ICECON57129.2023.10083899.
[23] Lung Cancer Dataset
Link:https://www.kaggle.com/code/sandragracenelson/lung-cancer-prediction/input?select=survey+lung+cancer.csv
[24] Chronic Kidney Disease Dataset
Link:https://www.kaggle.com/code/mahmoudlimam/chronic-kidney-disease-clustering-and-prediction/input
[25] Cardiovascular Disease Dataset
Link:https://www.kaggle.com/datasets/sulianova/cardiovascular-diseasedataset
Published
2024-12-18
How to Cite
Bhangale, P. P., Shah, A., Patel, D., Shah, P., & Salvi, S. (2024). Comparative Evaluation of Predictive Models on Kidney, Lung Cancer and Heart Disease. Asian Journal For Convergence In Technology (AJCT) ISSN -2350-1146, 10(3), 1-6. https://doi.org/10.33130/AJCT.2024v10i03.004

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.