Classification of Diabetes Mellitus using the K-Nearest Neighbor (KNN) Algorithm: A Case Study of Patient Data at Salatiga Regional Hospital

Gwen Theresia Grandis Aritonang, Magdalena A. Ineke Pakereng

Abstract


One of the most common metabolic diseases in Indonesia, including at RSUD Salatiga, is Diabetes Mellitus. Early diagnosis of this disease is crucial to prevent the development of more severe complications; however, this process is often challenging because the initial symptoms are difficult to recognize. This study implements the K-Nearest Neighbor (KNN) algorithm as a classification technique for predicting diabetes mellitus risk based on clinical data from patients at RSUD Salatiga. The dataset used in this research included variables such as gender, age, hypertension history, glucose level, HbA1c level, smoking status, liver disease history, and Body Mass Index (BMI). The research stages consisted of data collection, preprocessing (data cleaning, normalization, and variable encoding), feature selection, and optimization of the k parameter. Model evaluation was conducted using a confusion matrix with performance metrics including accuracy, precision, recall, and F1-score. The results indicate that the KNN algorithm achieved the highest accuracy of 92.08% when feature selection was applied with k = 6. This improvement demonstrates that both k-parameter optimization and feature selection significantly affect model performance. Therefore, the KNN algorithm can serve as an effective early prediction tool to assist medical personnel in identifying diabetes mellitus patients, enabling faster and more accurate interventions.

Keywords


classification; diabetes mellitus; early prediction; feature selection; k-nearest neighbor (KNN)

Full Text:

PDF

References


I. A. K. Adi and W. A. E. Prabowo, “Klasifikasi Penyakit Diabetes menggunakan Pendekatan Pembelajaran Mesin dengan Model Non-linier,” JURIKOM (Jurnal Ris. Komputer), Vol. 12, No. 3, pp. 262–268, 2025, DOI: 10.30865/jurikom.v12i3.8586.

Lestari, Zulkarnain, Sijid, and S. Aisyah, “Diabetes Melitus: Review Etiologi, Patofisiologi, Gejala, Penyebab, Cara Pemeriksaan, Cara Pengobatan dan Cara Pencegahan,” UIN Alauddin Makassar, Vol. 1, No. 2, pp. 237–241, 2021, [Online]. Available: http://journal.uin-alauddin.ac.id/index.php/psb

S. S. Bhat, V. Selvam, and G. A. Ansari, “Predicting Life Style of Early Diabetes Mellitus using Machine Learning Technique,” Int. J. Comput., Vol. 22, No. 3, pp. 345–351, 2023, DOI: 10.47839/ijc.22.3.3230.

S. S. Putro et al., “Classification of Diabetes Mellitus Disease at Rato Ebuh Hospital-Indonesia using the K-Nearest Neighbors Method based on Missing Value,” BIO Web Conf., Vol. 146, 2024, DOI: 10.1051/bioconf/202414601081.

M. Saputra, J. P. Sidabuke, R. P. Sinulingga, and R. B. Tamba, “Analisis Metode Algoritma K-Nearest Neighbor (KNN) dan Naive Bayes untuk Klasifikasi Diabetes Mellitus,” J. TEKINKOM, Vol. 6, No. 2, p. 2023, 2023, DOI: 10.37600/tekinkom.v6i2.942.

D. Devian, P. Nurul Sabrina, and A. Komarudin, “Prediksi Penyakit Diabetes dengan Metode K-Nearest Neighbor (KNN) dan Seleksi Fitur Information Gain,” JATI (Jurnal Mhs. Tek. Inform., Vol. 8, No. 6, pp. 11320–11326, 2024, DOI: 10.36040/jati.v8i6.11364.

A. Mulyani, S. Khoerunisa, and D. Kurniadi, “Comparison of KNN and SVM Algorithms Performance using SMOTE to Classify Diabetes,” J. Nas. Tek. Elektro dan Teknol. Inf., Vol. 14, No. 1, pp. 25–34, 2025.

Dewi Nasien et al., “Perbandingan Implementasi Machine Learning menggunakan Metode KNN, Naive Bayes, dan Logistik Regression untuk mengklasifikasi Penyakit Diabetes,” JEKIN - J. Tek. Inform., Vol. 4, No. 1, pp. 10–17, 2024, DOI: 10.58794/jekin.v4i1.640.

D. Fabiyanto and Y. Rianto, “Performance Evaluation of Multiple Machine Learning Models for Wine Quality Prediction Evaluasi Kinerja Multiple Model Machine Learning untuk Prediksi Kualitas Wine,” J. Inform. dan Teknol. Inf., Vol. 21, No. 2, pp. 209–223, 2024, DOI: 10.31515/telematika.v21i2.

M. D. Nurmalasari, K. Kusrini, and S. Sudarmawan, “Komparasi Algoritma Naive Bayes dan K-Nearest Neighbor untuk membangun Pengetahuan Diagnosa Penyakit Diabetes,” J. Komtika (Komputasi dan Inform., Vol. 5, No. 1, pp. 52–59, 2021, DOI: 10.31603/komtika.v5i1.5140.

N. Trisna and A. Mahessya, “Prediksi Risiko Diabetes Tahap Awal menggunakan Machine Learning dengan Algoritma K-Nearest Neighbor,” Vol. 5, No. 2, pp. 481–487, 2025, [Online]. Available: https://jurnal.pustakagalerimandiri.co.id/index.php/pustakadataDOI:https://doi.org/10.55382/jurnalpustakadata.v5i2.1550

D. Kuswandani and R. Umar, “Perbandingan Metode Seleksi Fitur Filter dan Wrapper pada Klasifikasi Risiko Penyakit Jantung menggunakan k - NN Comparison of Filter and Wrapper Feature Selection Methods for Heart Disease Risk Classification using K - Nearest Neighbors ( k - NN ),” Vol. 15, pp. 871–885, 2026.

R. Hadi, N. L. G. P. Suwirmayanti, I. G. N. A. Kusuma, I. G. A. D. Saryanti, and P. D. Novayanti, “Implementasi Metode Normalisasi dan Seleksi Fitur dalam Optimasi Algoritma K-Nearest Neighbor (KNN) untuk Klasifikasi Data Bank,” J. Nas. Komputasi dan Teknol. Inf., Vol. 7, No. 5, pp. 1064–1071, 2024, DOI: 10.32672/jnkti.v7i5.7989.

U. I. Lestari, “Penerapan Metode K-Nearest Neighbor untuk Sistem Pendukung Keputusan Identifikasi Penyakit Diabetes Melitus,” JATISI (Jurnal Tek. Inform. dan Sist. Informasi), Vol. 8, No. 4, pp. 2071–2082, 2021, DOI: 10.35957/jatisi.v8i4.1235.

E. Kavlakoglu, “Apa Algoritma K-Tetangga Terdekat (KNN)?,” IBM Research. [Online]. Available: https://www.ibm.com/id-id/think/topics/knn




DOI: https://doi.org/10.32520/stmsi.v15i5.6306

Article Metrics

Abstract view : 0 times
PDF - 0 times

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.