Imbalanced Multi-class Prediction of Student Drop-out and Graduation
A Systematic Literature Review
DOI:
https://doi.org/10.7160/eriesj.2026.190106Keywords:
Higher Education, Imbalanced Classification, Learning Analytics, Machine Learning, Responsibility, Student DropoutAbstract
Student study status prediction, including drop-out and graduation, is a widely studied topic in higher education. Yet, evidence across studies remains difficult to compare due to differences in targets, imbalance treatment, metrics, and validation strategies. This systematic literature review synthesizes 70 peer‑reviewed articles published between 2017 and 2025 that apply machine learning or deep learning to predict study outcomes under class imbalance. Results reveal a strong dominance of binary targets, while multi‑class experiments are relatively rare, though they better reflect institutional categories and expose larger performance gaps across classes. Reported imbalance handling includes data‑level resampling, algorithm‑level class weighting, and ensemble or hybrid designs, but many studies lack sufficient procedural detail. Evaluation practices vary considerably; studies reporting per-class measures and imbalance-aware metrics, such as macro F1 and balanced accuracy, provide more decision-relevant evidence than those relying mainly on accuracy. Validation strategies range from hold‑out and stratified cross‑validation to nested validation, temporal splits, and external testing, shaping the credibility of reported performance for deployment. We propose an integrative taxonomy linking target formulation, imbalance degree, handling strategy, and evaluation design to enhance intervention efficiency through capacity‑aware prioritization, while strengthening responsibility through transparent reporting, defensible validation, and explicit attention to minority class performance.
References
Alvarado-Uribe, J., Mejía-Almada, P., Masetto Herrera, A. L., Molontay, R., Hilliger, I., Hegde, V., Montemayor Gallegos, J. E., Ramírez Díaz, R. A. and Ceballos, H. G. (2022) ‘Student dataset from Tecnologico de Monterrey in Mexico to predict dropout in higher education’, Data, Vol. 7, No. 9, p. 119. https://doi.org/10.3390/data7090119
Alvarez, N. L., Callejas, Z. and Griol, D. (2020) ‘Predicting computer engineering students’ dropout in Cuban higher education with pre-enrollment and early performance data’, Journal of Technology and Science Education, Vol. 10, No. 2, pp. 241–258. https://doi.org/10.3926/jotse.922
Anagnostopoulos, T., Papakyriakopoulos, D., Psaromiligkos, Y. and Retalis, S. (2024) ‘Exploiting LSTM neural network algorithm potentiality for early identification of delayed graduation in higher education’, WSEAS Transactions on Information Science and Applications, Vol. 21, pp. 524–532. https://doi.org/10.37394/23209.2024.21.48
Andrade-Girón, D., Sandivar-Rosas, J., Marín-Rodriguez, W., Susanibar-Ramirez, E., Toro-Dextre, E., Ausejo-Sanchez, J., Villarreal-Torres, H. and Angeles-Morales, J. (2023) ‘Predicting student dropout based on machine learning and deep learning: A systematic review’, EAI Endorsed Transactions on Scalable Information Systems, Vol. 10, No. 5, pp. 1–11. https://doi.org/10.4108/eetsis.3586
Anggrawan, A., Hairani, H. and Satria, C. (2023) ‘Improving SVM classification performance on unbalanced student graduation time data using SMOTE’, International Journal of Information and Education Technology, Vol. 13, No. 2, pp. 289–295. https://doi.org/10.18178/ijiet.2023.13.2.1806
Arumugam, S., Vinodhini, G. and Chandrasekaran, R. M. (2018) ‘Predicting students’ academic performance in the university using meta decision tree classifiers’, Journal of Computer Science, Vol. 14, No. 5, pp. 654–662. https://doi.org/10.3844/jcssp.2018.654.662
Baas, J., Schotten, M., Plume, A., Côté, G. and Karimi, R. (2020) ‘Scopus as a curated, high-quality bibliometric data source for academic research in quantitative science studies’, Quantitative Science Studies, Vol. 1, No. 1, pp. 377–386. https://doi.org/10.1162/qss_a_00019
Barramuño, M., Meza-Narváez, C. and Gálvez-García, G. (2022) ‘Prediction of student attrition risk using machine learning’, Journal of Applied Research in Higher Education, Vol. 14, No. 3, pp. 974–986. https://doi.org/10.1108/JARHE-02-2021-0073
Bedregal-Alpaca, N., Cornejo-Aparicio, V., Zárate-Valderrama, J. and Yanque-Churo, P. (2020) ‘Classification models for determining types of academic risk and predicting dropout in university students’, International Journal of Advanced Computer Science and Applications, Vol. 11, No. 1, pp. 266–272. https://doi.org/10.14569/IJACSA.2020.0110133
Budiman, F., Saputro, I. A., Purwanto, P. and Andono, P. N. (2022) ‘Optimization of classification results by minimizing class imbalance on decision tree algorithm’, in: International Seminar on Machine Learning, Optimization, and Data Science (ISMODE 2021), pp. 6–11. https://doi.org/10.1109/ISMODE53584.2022.9743062
Cañete-Sifuentes, L., Robles, V., Menasalvas, E. and Monroy, R. (2023) ‘Comparing automated machine learning against an off-the-shelf pattern-based classifier in a class imbalance problem: Predicting university dropout’, IEEE Access, Vol. 11, pp. 139147–139156. https://doi.org/10.1109/ACCESS.2023.3336596
Cannistrà, M., Masci, C., Ieva, F., Agasisti, T. and Paganoni, A. M. (2022) ‘Early-predicting dropout of university students: An application of innovative multilevel machine learning and statistical techniques’, Studies in Higher Education, Vol. 47, No. 9, pp. 1935–1956. https://doi.org/10.1080/03075079.2021.2018415
Canto, N. G., De Oliveira, M. A. and De Mattos Veroneze, G. (2022) ‘Supervised learning applied to graduation forecast of industrial engineering students’, European Journal of Educational Research, Vol. 11, No. 1, pp. 325–337. https://doi.org/10.12973/eu-jer.11.1.325
Cho, C. H., Yu, Y. W. and Kim, H. G. (2023) ‘A study on dropout prediction for university students using machine learning’, Applied Sciences, Vol. 13, No. 21, p. 12004. https://doi.org/10.3390/app132112004
Csalódi, R. and Abonyi, J. (2021) ‘Integrated survival analysis and frequent pattern mining for course failure-based prediction of student dropout’, Mathematics, Vol. 9, No. 5, p. 463. https://doi.org/10.3390/math9050463
Cuizon, J. C. (2021) ‘Ensemble predictive model for academic churn risk using plurality voting’, Mindanao Journal of Science and Technology, Vol. 19, No. 1, pp. 224–235. https://doi.org/10.61310/mndjsteect.1028.21
Darenoh, N. V., Bachtiar, F. A. and Perdana, R. S. (2024) ‘Prediction of on-time student graduation with deep learning method’, Journal of ICT Research and Applications, Vol. 18, No. 1, pp. 1–20. https://doi.org/10.5614/itbj.ict.res.appl.2023.18.1.1
Daza, A., Guerra, C., Cervera, N. and Burgos, E. (2022) ‘Predicting academic performance through data mining: A systematic literature review’, TEM Journal, Vol. 11, No. 2, pp. 939–949. https://doi.org/10.18421/TEM112-57
Delen, D., Davazdahemami, B. and Rasouli Dezfouli, E. (2024) ‘Predicting and mitigating freshmen student attrition: A local-explainable machine learning framework’, Information Systems Frontiers, Vol. 26, No. 2, pp. 641–662. https://doi.org/10.1007/s10796-023-10397-3
Deleña, R. D., Dia, N. J., Sacayan, R. R., Sieras, J. C., Khalid, S. A., Macatotong, A. H. T. and Gulam, S. B. (2025) ‘Predicting student retention: A comparative study of machine learning approach utilizing sociodemographic and academic factors’, Systems and Soft Computing, Vol. 7, p. 200352. https://doi.org/10.1016/j.sasc.2025.200352
de la Cruz Huayanay, A., Bazán, J. L. and Russo, C. M. (2024) ‘Performance of evaluation metrics for classification in imbalanced data’, Computational Statistics, Vol. 39, No. 3, pp. 1447–1473. https://doi.org/10.1007/s00180-024-01539-5
Delogu, M., Lagravinese, R., Paolini, D. and Resce, G. (2024) ‘Predicting dropout from higher education: Evidence from Italy’, Economic Modelling, Vol. 130, p. 106583. https://doi.org/10.1016/j.econmod.2023.106583
Febro, J. D. (2019) ‘Utilizing feature selection in identifying predicting factors of student retention’, International Journal of Advanced Computer Science and Applications, Vol. 10, No. 9, pp. 269–274. https://doi.org/10.14569/IJACSA.2019.0100934
Fernandez-Garcia, A. J., Preciado, J. C., Melchor, F., Rodriguez-Echeverria, R., Conejero, J. M. and Sanchez-Figueroa, F. (2021) ‘A real-life machine learning experience for predicting university dropout at different stages using academic data’, IEEE Access, Vol. 9, pp. 133076–133090. https://doi.org/10.1109/ACCESS.2021.3115851
Fontana, L., Masci, C., Ieva, F. and Paganoni, A. M. (2021) ‘Performing learning analytics via generalised mixed-effects trees’, Data, Vol. 6, No. 7, p. 74. https://doi.org/10.3390/data6070074
Freitas, F. A. D. S., Vasconcelos, F. F. X., Peixoto, S. A., Hassan, M. M., Ali Akber Dewan, M., de Albuquerque, V. H. C. and Rebouças Filho, P. P. (2020) ‘IoT system for school dropout prediction using machine learning techniques based on socioeconomic data’, Electronics, Vol. 9, No. 10, p. 1613. https://doi.org/10.3390/electronics9101613
Gonzalez-Nucamendi, A., Noguez, J., Neri, L., Robledo-Rella, V. and García-Castelán, R. M. G. (2023) ‘Predictive analytics study to determine undergraduate students at risk of dropout’, Frontiers in Education, Vol. 8, p. 1244686. https://doi.org/10.3389/feduc.2023.1244686
Goran, R., Jovanovic, L., Bacanin, N., Stanković, M. S., Simic, V., Antonijevic, M. and Zivkovic, M. (2024) ‘Identifying and understanding student dropouts using metaheuristic optimized classifiers and explainable artificial intelligence techniques’, IEEE Access, Vol. 12, pp. 122377–122400. https://doi.org/10.1109/ACCESS.2024.3446653
Gutierrez-Pachas, D. A., Garcia-Zanabria, G., Cuadros-Vargas, E., Camara-Chavez, G. and Gomez-Nieto, E. (2023) ‘Supporting decision-making process on higher education dropout by analyzing academic, socioeconomic, and equity factors through machine learning and survival analysis methods in the Latin American context’, Education Sciences, Vol. 13, No. 2, p. 154. https://doi.org/10.3390/educsci13020154
Haerani, E., Syafria, F., Lestari, F., Novriyanto, N. and Marzuki, I. (2023) ‘Classification academic data using machine learning for decision making process’, Journal of Applied Engineering and Technological Science (JAETS), Vol. 4, No. 2, pp. 955–968. https://doi.org/10.37385/jaets.v4i2.1983
Hammoodi, M. S. and Al-Azawei, A. (2022) ‘Using socio-demographic information in predicting students’ degree completion based on a dynamic model’, International Journal of Intelligent Engineering and Systems, Vol. 15, No. 2, pp. 107–115. https://doi.org/10.22266/ijies2022.0430.11
Hammoudi Halat, D., Abdel-Salam, A.-S. G., Bensaid, A., Soltani, A., Alsarraj, L., Dalli, R. and Malki, A. (2023) ‘Use of machine learning to assess factors affecting progression, retention, and graduation in first-year health professions students in Qatar: A longitudinal study’, BMC Medical Education, Vol. 23, No. 1, p. 909. https://doi.org/10.1186/s12909-023-04887-w
Helbach, J., Pieper, D., Mathes, T., Rombey, T., Zeeb, H., Allers, K. and Hoffmann, F. (2022) ‘Restrictions and their reporting in systematic reviews of effectiveness: An observational study’, BMC Medical Research Methodology, Vol. 22, No. 1, p. 230. https://doi.org/10.1186/s12874-022-01710-w
Herianto, H., Kurniawan, B., Hartomi, Z. H., Irawan, Y. and Anam, M. K. (2024) ‘Machine learning algorithm optimization using stacking technique for graduation prediction’, Journal of Applied Data Sciences, Vol. 5, No. 3, pp. 1272–1285. https://doi.org/10.47738/jads.v5i3.316
Hooper, S. E., Ragland, N. and Artemiou, E. (2025) ‘Random forest models reveal academic and financial factors outweigh demographics in predicting completion of a year-round veterinary program’, Journal of the American Veterinary Medical Association, Vol. 263, No. 2, pp. 1–9. https://doi.org/10.2460/javma.24.08.0501
Hoyos Osorio, J. K. and Daza Santacoloma, G. (2023) ‘Predictive model to identify college students with high dropout rates’, Revista Electrónica de Investigación Educativa, Vol. 25, pp. 1–10. https://doi.org/10.24320/redie.2023.25.e13.5398
Kaensar, C. and Wongnin, W. (2023) ‘Predicting new student performances and identifying important attributes of admission data using machine learning techniques with hyperparameter tuning’, Eurasia Journal of Mathematics, Science and Technology Education, Vol. 19, No. 12, p. 2369. https://doi.org/10.29333/ejmste/13863
Kim, S., Choi, E., Jun, Y.-K. and Lee, S. (2023) ‘Student dropout prediction for university with high precision and recall’, Applied Sciences, Vol. 13, No. 10, p. 6275. https://doi.org/10.3390/app13106275
Kitchenham, B. (2004) Procedures for performing systematic reviews, Keele: Keele University, pp. 1–26.
Kurniadi, D., Abdurachman, E., Warnars, H. L. H. S. and Suparta, W. (2021) ‘Predicting student performance with multi-level representation in an intelligent academic recommender system using backpropagation neural network’, ICIC Express Letters, Part B: Applications, Vol. 12, No. 10, pp. 883–890. https://doi.org/10.24507/icicelb.12.10.883
Luque, A., Carrasco, A., Martín, A. and de las Heras, A. (2019) ‘The impact of class imbalance in classification performance metrics based on the binary confusion matrix’, Pattern Recognition, Vol. 91, pp. 216–231. https://doi.org/10.1016/j.patcog.2019.02.023
Martins, M. V., Baptista, L., Machado, J. and Realinho, V. (2023) ‘Multi-class phased prediction of academic performance and dropout in higher education’, Applied Sciences, Vol. 13, No. 8, p. 4702. https://doi.org/10.3390/app13084702
Martins, M. V., Tolledo, D., Machado, J., Baptista, L. M. T. and Realinho, V. (2021) ‘Early prediction of student’s performance in higher education: A case study’, in: Rocha, Á., Adeli, H., Reis, L. P. and Costanzo, S. (eds.), Trends and Applications in Information Systems and Technologies, Cham: Springer, pp. 166–175. https://doi.org/10.1007/978-3-030-72657-7_16
Matz, S. C., Bukow, C. S., Peters, H., Deacons, C., Dinu, A. and Stachl, C. (2023) ‘Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics’, Scientific Reports, Vol. 13, No. 1, p. 5705. https://doi.org/10.1038/s41598-023-32484-w
Mongeon, P. and Paul-Hus, A. (2016) ‘The journal coverage of Web of Science and Scopus: A comparative analysis’, Scientometrics, Vol. 106, No. 1, pp. 213–228. https://doi.org/10.1007/s11192-015-1765-5
Moreira da Silva, D. E., Solteiro Pires, E. J., Reis, A., de Moura Oliveira, P. B. and Barroso, J. (2022) ‘Forecasting students dropout: A UTAD university study’, Future Internet, Vol. 14, No. 3, p. 76. https://doi.org/10.3390/fi14030076
Mouchantaf, N. and Chamoun, M. (2023) ‘Predicting student dropout with minimal information’, Iraqi Journal of Science, Vol. 64, No. 10, pp. 5265–5279. https://doi.org/10.24996/ijs.2023.64.10.33
Nagy, M. and Molontay, R. (2024) ‘Interpretable dropout prediction: Towards XAI-based personalized intervention’, International Journal of Artificial Intelligence in Education, Vol. 34, No. 2, pp. 274–300. https://doi.org/10.1007/s40593-023-00331-8
Nanglae, L., Iam-On, N., Boongoen, T., Kaewchay, K. and Mullaney, J. (2021) ‘Determining patterns of student graduation using a bi-level learning framework’, Bulletin of Electrical Engineering and Informatics, Vol. 10, No. 4, pp. 2201–2211. https://doi.org/10.11591/eei.v10i4.2502
Ndunagu, J. N., Oyewola, D. O., Garki, F. S., Onyeakazi, J. C., Ezeanya, C. U. and Ukwandu, E. (2024) ‘Deep learning for predicting attrition rate in open and distance learning (ODL) institutions’, Computers, Vol. 13, No. 9, p. 229. https://doi.org/10.3390/computers13090229
Nguyen Thi Cam, H., Sarlan, A. and Arshad, N. I. (2024) ‘A hybrid model integrating recurrent neural networks and the semi-supervised support vector machine for identification of early student dropout risk’, PeerJ Computer Science, Vol. 10, p. e2572. https://doi.org/10.7717/peerj-cs.2572
Niyogisubizo, J., Liao, L., Nziyumva, E., Murwanashyaka, E. and Nshimyumukiza, P. C. (2022) ‘Predicting student’s dropout in university classes using two-layer ensemble machine learning approach: A novel stacked generalization’, Computers and Education: Artificial Intelligence, Vol. 3, p. 100066. https://doi.org/10.1016/j.caeai.2022.100066
Nuanmeesri, S., Poomhiran, L., Chopvitayakun, S. and Kadmateekarun, P. (2022) ‘Improving dropout forecasting during the COVID-19 pandemic through feature selection and multilayer perceptron neural network’, International Journal of Information and Education Technology, Vol. 12, No. 9, pp. 851–857. https://doi.org/10.18178/ijiet.2022.12.9.1693
Okewu, E., Adewole, P., Misra, S., Maskeliunas, R. and Damasevicius, R. (2021) ‘Artificial neural networks for educational data mining in higher education: A systematic literature review’, Applied Artificial Intelligence, Vol. 35, No. 13, pp. 983–1021. https://doi.org/10.1080/08839514.2021.1922847
Okoye, K., Nganji, J. T., Escamilla, J. and Hosseini, S. (2024) ‘Machine learning model (RG-DMML) and ensemble algorithm for prediction of students’ retention and graduation in education’, Computers and Education: Artificial Intelligence, Vol. 6, p. 100205. https://doi.org/10.1016/j.caeai.2024.100205
de Oliveira, C. F., Sobral, S. R., Ferreira, M. J. and Moreira, F. (2021) ‘How does learning analytics contribute to prevent students’ dropout in higher education: A systematic literature review’, Big Data and Cognitive Computing, Vol. 5, No. 4, p. 64. https://doi.org/10.3390/bdcc5040064
Opazo, D., Moreno, S., Álvarez-Miranda, E. and Pereira, J. (2021) ‘Analysis of first-year university student dropout through machine learning models: A comparison between universities’, Mathematics, Vol. 9, No. 20, p. 2599. https://doi.org/10.3390/math9202599
Oqaidi, K., Aouhassi, S. and Mansouri, K. (2025) ‘Predicting graduation in Moroccan open-access bachelors: Early indicators and re-enrollment data’, Bulletin of Electrical Engineering and Informatics, Vol. 14, No. 1, pp. 524–532. https://doi.org/10.11591/eei.v14i1.8580
Ortigosa, A., Carro, R. M., Bravo-Agapito, J., Lizcano, D., Alcolea, J. J. and Blanco, Ó. (2019) ‘From lab to production: Lessons learnt and real-life challenges of an early student-dropout prevention system’, IEEE Transactions on Learning Technologies, Vol. 12, No. 2, pp. 264–277. https://doi.org/10.1109/TLT.2019.2911608
Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., McDonald, S., McGuinness, L. A., Stewart, L. A., Thomas, J., Tricco, A. C., Welch, V. A., Whiting, P. and Moher, D. (2021) ‘The PRISMA 2020 statement: An updated guideline for reporting systematic reviews’, BMJ, p. n71. https://doi.org/10.1136/bmj.n71
Palacios, C. A., Reyes-Suárez, J. A., Bearzotti, L. A., Leiva, V. and Marchant, C. (2021) ‘Knowledge discovery for higher education student retention based on data mining: Machine learning algorithms and case study in Chile’, Entropy, Vol. 23, No. 4, p. 485. https://doi.org/10.3390/e23040485
Pelima, L. R., Sukmana, Y. and Rosmansyah, Y. (2024) ‘Predicting university student graduation using academic performance and machine learning: A systematic literature review’, IEEE Access, Vol. 12, pp. 23451–23465. https://doi.org/10.1109/ACCESS.2024.3361479
Phan, M., De Caigny, A. and Coussement, K. (2023) ‘A decision support framework to incorporate textual data for early student dropout prediction in higher education’, Decision Support Systems, Vol. 168, p. 113940. https://doi.org/10.1016/j.dss.2023.113940
Quimiz-Moreira, M., Delgadillo, R., Parraga-Alava, J., Maculan, N. and Mauricio, D. (2025) ‘Factors, prediction, explainability, and simulating university dropout through machine learning: A systematic review, 2012–2024’, Computation, Vol. 13, No. 8, p. 198. https://doi.org/10.3390/computation13080198
Rabelo, A. M. and Zárate, L. E. (2025) ‘A model for predicting dropout of higher education students’, Data Science and Management, Vol. 8, No. 1, pp. 72–85. https://doi.org/10.1016/j.dsm.2024.07.001
Realinho, V., Martins, M. V., Machado, J. and Baptista, L. (2021) Predict students’ dropout and academic success [Dataset], UCI Machine Learning Repository. https://doi.org/10.24432/C5MC89
Rethlefsen, M. L., Kirtley, S., Waffenschmidt, S., Ayala, A. P., Moher, D., Page, M. J., Koffel, J. B., Blunt, H., Brigham, T., Chang, S., Clark, J., Conway, A., Couban, R., de Kock, S., Farrah, K., Fehrmann, P., Foster, M., Fowler, S. A., Glanville, J., Harris, E., Hoffecker, L., Isojarvi, J., Kaunelis, D., Ket, H., Levay, P., Lyon, J., McGowan, J., Murad, M. H., Nicholson, J., Pannabecker, V., Paynter, R., Pinotti, R., Ross-White, A., Sampson, M., Shields, T., Stevens, A., Sutton, A., Weinfurter, E., Wright, K. and Young, S. (2021) ‘PRISMA-S: An extension to the PRISMA statement for reporting literature searches in systematic reviews’, Systematic Reviews, Vol. 10, No. 1, p. 39. https://doi.org/10.1186/s13643-020-01542-z
Rodríguez-Muñiz, L. J., Bernardo, A. B., Esteban, M. and Díaz, I. (2019) ‘Dropout and transfer paths: What are the risky profiles when analyzing university persistence with machine learning techniques?’, PLoS ONE, Vol. 14, No. 6, p. e0218796. https://doi.org/10.1371/journal.pone.0218796
Rose, A. L. P. J. and Mary, A. C. (2022) ‘An early intervention technique for at-risk prediction of higher education students in cloud-based virtual learning environment using classification algorithms during COVID-19’, International Journal of Advanced Computer Science and Applications, Vol. 13, No. 1, pp. 612–621. https://doi.org/10.14569/IJACSA.2022.0130174
Roslan, N., Jamil, J. M., Shaharanee, I. N. M. and Sultan Alawi, S. J. (2024) ‘Prediction of student dropout in Malaysian’s private higher education institute using data mining application’, Journal of Advanced Research in Applied Sciences and Engineering Technology, Vol. 45, No. 2, pp. 168–176. https://doi.org/10.37934/araset.45.2.168176
Rovira, S., Puertas, E. and Igual, L. (2017) ‘Data-driven system to predict academic grades and dropout’, PLoS ONE, Vol. 12, No. 2, p. e0171207. https://doi.org/10.1371/journal.pone.0171207
Salam, A. and Zeniarja, J. (2023) ‘Classification of deep learning convolutional neural network feature extraction for student graduation prediction’, Indonesian Journal of Electrical Engineering and Computer Science, Vol. 32, No. 1, p. 335. https://doi.org/10.11591/ijeecs.v32.i1.pp335-341
Salinas-Chipana, J., Obregon-Palomino, L., Iparraguirre-Villanueva, O. and Cabanillas-Carbonell, M. (2024) ‘Machine learning models for predicting student dropout—a review’, in: Yang, X.-S., Sherratt, R. S., Dey, N. and Joshi, A. (eds.), Proceedings of Eighth International Congress on Information and Communication Technology, Singapore: Springer Nature Singapore, pp. 1003–1014. https://doi.org/10.1007/978-981-99-3043-2_83
Sandoval-Palis, I., Naranjo, D., Vidal, J. and Gilar-Corbi, R. (2020) ‘Early dropout prediction model: A case study of university leveling course students’, Sustainability, Vol. 12, No. 22, p. 9314. https://doi.org/10.3390/su12229314
Sani, N. S., Fikri, A., Ali, Z., Zakree, M. and Nadiyah, K. (2020) ‘Drop-out prediction in higher education among B40 students’, International Journal of Advanced Computer Science and Applications, Vol. 11, No. 11, pp. 550–559. https://doi.org/10.14569/IJACSA.2020.0111169
Sayed, M. (2024) ‘Student progression and dropout rates using convolutional neural network: A case study of the Arab Open University’, Journal of Advanced Computational Intelligence and Intelligent Informatics, Vol. 28, No. 3, pp. 668–678. https://doi.org/10.20965/jaciii.2024.p0668
Segura, M., Mello, J. and Hernández, A. (2022) ‘Machine learning prediction of university student dropout: Does preference play a key role?’, Mathematics, Vol. 10, No. 18, p. 3359. https://doi.org/10.3390/math10183359
Setiadi, H., Sanjaya, K., Wijayanto, A., Wardhani, D. W. and Cahyono, H. D. (2024) ‘Comparative analysis of classification algorithms using feature selection techniques to predict on-time student graduation’, Ingénierie des systèmes d’information, Vol. 29, No. 4, pp. 1365–1379. https://doi.org/10.18280/isi.290412
Setiawan, R., Nursasongko, E., Syukur, A., Budiman, F. and Kurniadi, D. (2025) ‘Handling class imbalance in student success prediction using machine learning: A comparison of SMOTE and SMOTETomek’, in: 2025 International Conference on Smart Computing, IoT and Machine Learning (SIML), pp. 1–6. https://doi.org/10.1109/SIML65326.2025.11081128
Song, Z., Sung, S.-H., Park, D.-M. and Park, B.-K. (2023) ‘All-year dropout prediction modeling and analysis for university students’, Applied Sciences, Vol. 13, No. 2, p. 1143. https://doi.org/10.3390/app13021143
Tsai, S.-C., Chen, C.-H., Shiao, Y.-T., Ciou, J.-S. and Wu, T.-N. (2020) ‘Precision education with statistical learning and deep learning: A case study in Taiwan’, International Journal of Educational Technology in Higher Education, Vol. 17, No. 1, p. 12. https://doi.org/10.1186/s41239-020-00186-2
Uliyan, D., Aljaloud, A. S., Alkhalil, A., Amer, H. S. Al, Mohamed, M. A. E. A. and Alogali, A. F. M. (2021) ‘Deep learning model to predict students retention using BLSTM and CRF’, IEEE Access, Vol. 9, pp. 135550–135558. https://doi.org/10.1109/ACCESS.2021.3117117
Vaarma, M. and Li, H. (2024) ‘Predicting student dropouts with machine learning: An empirical study in Finnish higher education’, Technology in Society, Vol. 76, p. 102474. https://doi.org/10.1016/j.techsoc.2024.102474
Vega, H., Sanez, E., De La Cruz, P., Moquillaza, S. and Pretell, J. (2022) ‘Intelligent system to predict university students dropout’, International Journal of Online and Biomedical Engineering (iJOE), Vol. 18, No. 7, pp. 27–43. https://doi.org/10.3991/ijoe.v18i07.30195
Véliz Palomino, J. C. and Ortega, A. M. (2023) ‘Dropout intentions in higher education: Systematic literature review’, Journal on Efficiency and Responsibility in Education and Science, Vol. 16, No. 2, pp. 149–158. https://doi.org/10.7160/eriesj.2023.160206
Vidal, J., Gilar-Corbi, R., Pozo-Rico, T., Castejón, J.-L. and Sánchez-Almeida, T. (2022) ‘Predictors of university attrition: Looking for an equitable and sustainable higher education’, Sustainability, Vol. 14, No. 17, p. 10994. https://doi.org/10.3390/su141710994
Villar, A. and de Andrade, C. R. V. (2024) ‘Supervised machine learning algorithms for predicting student dropout and academic success: A comparative study’, Discover Artificial Intelligence, Vol. 4, No. 1, p. 2. https://doi.org/10.1007/s44163-023-00079-z
Villegas-Ch, W., Govea, J. and Revelo-Tapia, S. (2023) ‘Improving student retention in institutions of higher education through machine learning: A sustainable approach’, Sustainability, Vol. 15, No. 19, p. 14512. https://doi.org/10.3390/su151914512
Won, H.-S., Kim, M.-J., Kim, D., Kim, H.-S. and Kim, K.-M. (2023) ‘University student dropout prediction using pretrained language models’, Applied Sciences, Vol. 13, No. 12, p. 7073. https://doi.org/10.3390/app13127073
Yaqin, A., Laksito, A. D. and Fatonah, S. (2021) ‘Evaluation of backpropagation neural network models for early prediction of student’s graduation in XYZ University’, International Journal on Advanced Science, Engineering and Information Technology, Vol. 11, No. 2, pp. 610–617. https://doi.org/10.18517/ijaseit.11.2.11152
Yaqin, A., Rahardi, M. and Abdulloh, F. F. (2022) ‘Accuracy enhancement of prediction method using SMOTE for early prediction student’s graduation in XYZ University’, International Journal of Advanced Computer Science and Applications, Vol. 13, No. 6, pp. 418–424. https://doi.org/10.14569/IJACSA.2022.0130652
Zanellati, A., Zingaro, S. P. and Gabbrielli, M. (2024) ‘Balancing performance and explainability in academic dropout prediction’, IEEE Transactions on Learning Technologies, Vol. 17, pp. 2086–2099. https://doi.org/10.1109/TLT.2024.3425959
Additional Files
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Ridwan Setiawan, Edi Noersasongko, Abdul Syukur, Fikri Budiman, Dede Kurniadi

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors declare with this manuscript intended for publication to ERIES Journal that:
- all co-authors agree with the publication of the manuscript even after amendments arising from peer review;
- all co-authors agree with the posting of the full text of this work on the web page of ERIES Journal and to the inclusion of references in databases accessible on the internet;
- no results of other researchers were used in the submitted manuscript without their consent, proper citation, or acknowledgement of their cooperation or material provided;
- the results (or any part of them) used in the manuscript have not been sent for publication to any other journal nor have they already been published (or if so, that the relevant works are cited in this manuscript);
- submission of the manuscript for publication was completed in accordance with the publishing regulations pertaining to place of work;
- experiments performed comply with current laws and written consent of the Scientific Ethics Committee / National Animal Care Authority (as is mentioned in the manuscript submitted);
- grant holders confirm that they have been informed of the submitted manuscript and they agree to its publication.
Authors retain copyright and grant ERIES Journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the published work with an acknowledgement of its initial publication in ERIES Journal. Moreover, authors are able to post the published work in an institutional repository with an acknowledgement of its initial publication in ERIES Journal. In addition, authors are permitted and encouraged to post the published work online (e.g. institutional repositories or on their website) as it can lead to productive exchanges, as well as earlier and greater citation of published work.



