Indexing metadata

Detection of Graduation Potential in Prospective Students using the Random Forest Algorithm


 
Dublin Core PKP Metadata Items Metadata for this Document
 
1. Title Title of document Detection of Graduation Potential in Prospective Students using the Random Forest Algorithm
 
2. Creator Author's name, affiliation, country Puguh Hasta Gunawan; Indonesia
 
2. Creator Author's name, affiliation, country Irving Vitra Paputungan; Universitas Islam Indonesia; Indonesia
 
3. Subject Discipline(s) Informatika
 
3. Subject Keyword(s) Random Forest; Confusion Matrix; Graduation Detection
 
4. Description Abstract Detecting students’ graduation potential is commonly performed by evaluating various academic and non-academic factors. This study aims to develop a predictive model for student graduation from the beginning of their academic journey, utilizing high school academic data such as grades, attendance, study hours, as well as demographic and social factors. The goal is to enable universities to identify students who are at risk of delayed graduation. With accurate predictions, institutions are expected to design more targeted academic interventions, such as tutoring, counseling, or other forms of academic support. A total of 396 student records were used in this study and processed through a series of preprocessing steps, including the removal of irrelevant data and the encoding of categorical variables. The model was developed using the Random Forest algorithm with parameters set to max_depth = 15 and random_state = 42. Model performance was evaluated using accuracy, recall, F1-score, and the ROC curve. The results show that the model achieved an accuracy of 89%, with the Pass class having a recall of 87% and an F1-score of 91%, and the Fail class showing a recall of 92% and an F1-score of 84%. Additionally, the Area Under the Curve (AUC) value of 0.94 indicates excellent model performance in distinguishing between students likely to graduate and those at risk of not graduating.
This study confirms that the model is effective in classifying graduation outcomes based on early academic data. For further development, it is recommended to include additional variables such as psychological factors, learning motivation, and socioeconomic conditions. Moreover, tuning the model by adding other parameters—such as n_estimators, min_samples_split, and max_features—is suggested to improve the model’s accuracy and generalizability.
 
5. Publisher Organizing agency, location Program Studi Sistem Informasi Fakultas Teknik dan Ilmu Komputer
 
6. Contributor Sponsor(s) Universitas Islam Indonesia
 
7. Date (YYYY-MM-DD) 2025-09-01
 
8. Type Status & genre Peer-reviewed Article
 
8. Type Type
 
9. Format File format PDF
 
10. Identifier Uniform Resource Identifier https://sistemasi.org/index.php/stmsi/article/view/5331
 
11. Source Title; vol., no. (year) Sistemasi: Jurnal Sistem Informasi; Vol 14, No 5 (2025): Sistemasi: Jurnal Sistem Informasi
 
12. Language English=en id
 
13. Relation Supp. Files pemeriksaan turnitin (709KB)
 
14. Coverage Geo-spatial location, chronological period, research sample (gender, age, etc.)
 
15. Rights Copyright and permissions Copyright (c) 2025 Sistemasi: Jurnal Sistem Informasi