Imbalanced Multi-class Prediction of Student Drop-out and Graduation

A Systematic Literature Review

Authors

  • Ridwan Setiawan Institut Teknologi Garut, Indonesia; Universitas Dian Nuswantoro, Indonesia https://orcid.org/0000-0002-5229-8561
  • Edi Noersasongko Universitas Dian Nuswantoro
  • Abdul Syukur Universitas Dian Nuswantoro
  • Fikri Budiman Universitas Dian Nuswantoro
  • Dede Kurniadi Institut Teknologi Garut

DOI:

https://doi.org/10.7160/eriesj.2026.190106

Keywords:

Higher Education, Imbalanced Classification, Learning Analytics, Machine Learning, Responsibility, Student Dropout

Abstract

Student study status prediction, including drop-out and graduation, is a widely studied topic in higher education. Yet, evidence across studies remains difficult to compare due to differences in targets, imbalance treatment, metrics, and validation strategies. This systematic literature review synthesizes 70 peer‑reviewed articles published between 2017 and 2025 that apply machine learning or deep learning to predict study outcomes under class imbalance. Results reveal a strong dominance of binary targets, while multi‑class experiments are relatively rare, though they better reflect institutional categories and expose larger performance gaps across classes. Reported imbalance handling includes data‑level resampling, algorithm‑level class weighting, and ensemble or hybrid designs, but many studies lack sufficient procedural detail. Evaluation practices vary considerably; studies reporting per-class measures and imbalance-aware metrics, such as macro F1 and balanced accuracy, provide more decision-relevant evidence than those relying mainly on accuracy. Validation strategies range from hold‑out and stratified cross‑validation to nested validation, temporal splits, and external testing, shaping the credibility of reported performance for deployment. We propose an integrative taxonomy linking target formulation, imbalance degree, handling strategy, and evaluation design to enhance intervention efficiency through capacity‑aware prioritization, while strengthening responsibility through transparent reporting, defensible validation, and explicit attention to minority class performance.

References

Alvarado-Uribe, J., Mejía-Almada, P., Masetto Herrera, A. L., Molontay, R., Hilliger, I., Hegde, V., Montemayor Gallegos, J. E., Ramírez Díaz, R. A. and Ceballos, H. G. (2022) ‘Student dataset from Tecnologico de Monterrey in Mexico to predict dropout in higher education’, Data, Vol. 7, No. 9, p. 119. https://doi.org/10.3390/data7090119

Alvarez, N. L., Callejas, Z. and Griol, D. (2020) ‘Predicting computer engineering students’ dropout in Cuban higher education with pre-enrollment and early performance data’, Journal of Technology and Science Education, Vol. 10, No. 2, pp. 241–258. https://doi.org/10.3926/jotse.922

Anagnostopoulos, T., Papakyriakopoulos, D., Psaromiligkos, Y. and Retalis, S. (2024) ‘Exploiting LSTM neural network algorithm potentiality for early identification of delayed graduation in higher education’, WSEAS Transactions on Information Science and Applications, Vol. 21, pp. 524–532. https://doi.org/10.37394/23209.2024.21.48

Andrade-Girón, D., Sandivar-Rosas, J., Marín-Rodriguez, W., Susanibar-Ramirez, E., Toro-Dextre, E., Ausejo-Sanchez, J., Villarreal-Torres, H. and Angeles-Morales, J. (2023) ‘Predicting student dropout based on machine learning and deep learning: A systematic review’, EAI Endorsed Transactions on Scalable Information Systems, Vol. 10, No. 5, pp. 1–11. https://doi.org/10.4108/eetsis.3586

Anggrawan, A., Hairani, H. and Satria, C. (2023) ‘Improving SVM classification performance on unbalanced student graduation time data using SMOTE’, International Journal of Information and Education Technology, Vol. 13, No. 2, pp. 289–295. https://doi.org/10.18178/ijiet.2023.13.2.1806

Arumugam, S., Vinodhini, G. and Chandrasekaran, R. M. (2018) ‘Predicting students’ academic performance in the university using meta decision tree classifiers’, Journal of Computer Science, Vol. 14, No. 5, pp. 654–662. https://doi.org/10.3844/jcssp.2018.654.662

Baas, J., Schotten, M., Plume, A., Côté, G. and Karimi, R. (2020) ‘Scopus as a curated, high-quality bibliometric data source for academic research in quantitative science studies’, Quantitative Science Studies, Vol. 1, No. 1, pp. 377–386. https://doi.org/10.1162/qss_a_00019

Barramuño, M., Meza-Narváez, C. and Gálvez-García, G. (2022) ‘Prediction of student attrition risk using machine learning’, Journal of Applied Research in Higher Education, Vol. 14, No. 3, pp. 974–986. https://doi.org/10.1108/JARHE-02-2021-0073

Bedregal-Alpaca, N., Cornejo-Aparicio, V., Zárate-Valderrama, J. and Yanque-Churo, P. (2020) ‘Classification models for determining types of academic risk and predicting dropout in university students’, International Journal of Advanced Computer Science and Applications, Vol. 11, No. 1, pp. 266–272. https://doi.org/10.14569/IJACSA.2020.0110133

Budiman, F., Saputro, I. A., Purwanto, P. and Andono, P. N. (2022) ‘Optimization of classification results by minimizing class imbalance on decision tree algorithm’, in: International Seminar on Machine Learning, Optimization, and Data Science (ISMODE 2021), pp. 6–11. https://doi.org/10.1109/ISMODE53584.2022.9743062

Cañete-Sifuentes, L., Robles, V., Menasalvas, E. and Monroy, R. (2023) ‘Comparing automated machine learning against an off-the-shelf pattern-based classifier in a class imbalance problem: Predicting university dropout’, IEEE Access, Vol. 11, pp. 139147–139156. https://doi.org/10.1109/ACCESS.2023.3336596

Cannistrà, M., Masci, C., Ieva, F., Agasisti, T. and Paganoni, A. M. (2022) ‘Early-predicting dropout of university students: An application of innovative multilevel machine learning and statistical techniques’, Studies in Higher Education, Vol. 47, No. 9, pp. 1935–1956. https://doi.org/10.1080/03075079.2021.2018415

Canto, N. G., De Oliveira, M. A. and De Mattos Veroneze, G. (2022) ‘Supervised learning applied to graduation forecast of industrial engineering students’, European Journal of Educational Research, Vol. 11, No. 1, pp. 325–337. https://doi.org/10.12973/eu-jer.11.1.325

Cho, C. H., Yu, Y. W. and Kim, H. G. (2023) ‘A study on dropout prediction for university students using machine learning’, Applied Sciences, Vol. 13, No. 21, p. 12004. https://doi.org/10.3390/app132112004

Csalódi, R. and Abonyi, J. (2021) ‘Integrated survival analysis and frequent pattern mining for course failure-based prediction of student dropout’, Mathematics, Vol. 9, No. 5, p. 463. https://doi.org/10.3390/math9050463

Cuizon, J. C. (2021) ‘Ensemble predictive model for academic churn risk using plurality voting’, Mindanao Journal of Science and Technology, Vol. 19, No. 1, pp. 224–235. https://doi.org/10.61310/mndjsteect.1028.21

Darenoh, N. V., Bachtiar, F. A. and Perdana, R. S. (2024) ‘Prediction of on-time student graduation with deep learning method’, Journal of ICT Research and Applications, Vol. 18, No. 1, pp. 1–20. https://doi.org/10.5614/itbj.ict.res.appl.2023.18.1.1

Daza, A., Guerra, C., Cervera, N. and Burgos, E. (2022) ‘Predicting academic performance through data mining: A systematic literature review’, TEM Journal, Vol. 11, No. 2, pp. 939–949. https://doi.org/10.18421/TEM112-57

Delen, D., Davazdahemami, B. and Rasouli Dezfouli, E. (2024) ‘Predicting and mitigating freshmen student attrition: A local-explainable machine learning framework’, Information Systems Frontiers, Vol. 26, No. 2, pp. 641–662. https://doi.org/10.1007/s10796-023-10397-3

Deleña, R. D., Dia, N. J., Sacayan, R. R., Sieras, J. C., Khalid, S. A., Macatotong, A. H. T. and Gulam, S. B. (2025) ‘Predicting student retention: A comparative study of machine learning approach utilizing sociodemographic and academic factors’, Systems and Soft Computing, Vol. 7, p. 200352. https://doi.org/10.1016/j.sasc.2025.200352

de la Cruz Huayanay, A., Bazán, J. L. and Russo, C. M. (2024) ‘Performance of evaluation metrics for classification in imbalanced data’, Computational Statistics, Vol. 39, No. 3, pp. 1447–1473. https://doi.org/10.1007/s00180-024-01539-5

Delogu, M., Lagravinese, R., Paolini, D. and Resce, G. (2024) ‘Predicting dropout from higher education: Evidence from Italy’, Economic Modelling, Vol. 130, p. 106583. https://doi.org/10.1016/j.econmod.2023.106583

Febro, J. D. (2019) ‘Utilizing feature selection in identifying predicting factors of student retention’, International Journal of Advanced Computer Science and Applications, Vol. 10, No. 9, pp. 269–274. https://doi.org/10.14569/IJACSA.2019.0100934

Fernandez-Garcia, A. J., Preciado, J. C., Melchor, F., Rodriguez-Echeverria, R., Conejero, J. M. and Sanchez-Figueroa, F. (2021) ‘A real-life machine learning experience for predicting university dropout at different stages using academic data’, IEEE Access, Vol. 9, pp. 133076–133090. https://doi.org/10.1109/ACCESS.2021.3115851

Fontana, L., Masci, C., Ieva, F. and Paganoni, A. M. (2021) ‘Performing learning analytics via generalised mixed-effects trees’, Data, Vol. 6, No. 7, p. 74. https://doi.org/10.3390/data6070074

Freitas, F. A. D. S., Vasconcelos, F. F. X., Peixoto, S. A., Hassan, M. M., Ali Akber Dewan, M., de Albuquerque, V. H. C. and Rebouças Filho, P. P. (2020) ‘IoT system for school dropout prediction using machine learning techniques based on socioeconomic data’, Electronics, Vol. 9, No. 10, p. 1613. https://doi.org/10.3390/electronics9101613

Gonzalez-Nucamendi, A., Noguez, J., Neri, L., Robledo-Rella, V. and García-Castelán, R. M. G. (2023) ‘Predictive analytics study to determine undergraduate students at risk of dropout’, Frontiers in Education, Vol. 8, p. 1244686. https://doi.org/10.3389/feduc.2023.1244686

Goran, R., Jovanovic, L., Bacanin, N., Stanković, M. S., Simic, V., Antonijevic, M. and Zivkovic, M. (2024) ‘Identifying and understanding student dropouts using metaheuristic optimized classifiers and explainable artificial intelligence techniques’, IEEE Access, Vol. 12, pp. 122377–122400. https://doi.org/10.1109/ACCESS.2024.3446653

Gutierrez-Pachas, D. A., Garcia-Zanabria, G., Cuadros-Vargas, E., Camara-Chavez, G. and Gomez-Nieto, E. (2023) ‘Supporting decision-making process on higher education dropout by analyzing academic, socioeconomic, and equity factors through machine learning and survival analysis methods in the Latin American context’, Education Sciences, Vol. 13, No. 2, p. 154. https://doi.org/10.3390/educsci13020154

Haerani, E., Syafria, F., Lestari, F., Novriyanto, N. and Marzuki, I. (2023) ‘Classification academic data using machine learning for decision making process’, Journal of Applied Engineering and Technological Science (JAETS), Vol. 4, No. 2, pp. 955–968. https://doi.org/10.37385/jaets.v4i2.1983

Hammoodi, M. S. and Al-Azawei, A. (2022) ‘Using socio-demographic information in predicting students’ degree completion based on a dynamic model’, International Journal of Intelligent Engineering and Systems, Vol. 15, No. 2, pp. 107–115. https://doi.org/10.22266/ijies2022.0430.11

Hammoudi Halat, D., Abdel-Salam, A.-S. G., Bensaid, A., Soltani, A., Alsarraj, L., Dalli, R. and Malki, A. (2023) ‘Use of machine learning to assess factors affecting progression, retention, and graduation in first-year health professions students in Qatar: A longitudinal study’, BMC Medical Education, Vol. 23, No. 1, p. 909. https://doi.org/10.1186/s12909-023-04887-w

Helbach, J., Pieper, D., Mathes, T., Rombey, T., Zeeb, H., Allers, K. and Hoffmann, F. (2022) ‘Restrictions and their reporting in systematic reviews of effectiveness: An observational study’, BMC Medical Research Methodology, Vol. 22, No. 1, p. 230. https://doi.org/10.1186/s12874-022-01710-w

Herianto, H., Kurniawan, B., Hartomi, Z. H., Irawan, Y. and Anam, M. K. (2024) ‘Machine learning algorithm optimization using stacking technique for graduation prediction’, Journal of Applied Data Sciences, Vol. 5, No. 3, pp. 1272–1285. https://doi.org/10.47738/jads.v5i3.316

Hooper, S. E., Ragland, N. and Artemiou, E. (2025) ‘Random forest models reveal academic and financial factors outweigh demographics in predicting completion of a year-round veterinary program’, Journal of the American Veterinary Medical Association, Vol. 263, No. 2, pp. 1–9. https://doi.org/10.2460/javma.24.08.0501

Hoyos Osorio, J. K. and Daza Santacoloma, G. (2023) ‘Predictive model to identify college students with high dropout rates’, Revista Electrónica de Investigación Educativa, Vol. 25, pp. 1–10. https://doi.org/10.24320/redie.2023.25.e13.5398

Kaensar, C. and Wongnin, W. (2023) ‘Predicting new student performances and identifying important attributes of admission data using machine learning techniques with hyperparameter tuning’, Eurasia Journal of Mathematics, Science and Technology Education, Vol. 19, No. 12, p. 2369. https://doi.org/10.29333/ejmste/13863

Kim, S., Choi, E., Jun, Y.-K. and Lee, S. (2023) ‘Student dropout prediction for university with high precision and recall’, Applied Sciences, Vol. 13, No. 10, p. 6275. https://doi.org/10.3390/app13106275

Kitchenham, B. (2004) Procedures for performing systematic reviews, Keele: Keele University, pp. 1–26.

Kurniadi, D., Abdurachman, E., Warnars, H. L. H. S. and Suparta, W. (2021) ‘Predicting student performance with multi-level representation in an intelligent academic recommender system using backpropagation neural network’, ICIC Express Letters, Part B: Applications, Vol. 12, No. 10, pp. 883–890. https://doi.org/10.24507/icicelb.12.10.883

Luque, A., Carrasco, A., Martín, A. and de las Heras, A. (2019) ‘The impact of class imbalance in classification performance metrics based on the binary confusion matrix’, Pattern Recognition, Vol. 91, pp. 216–231. https://doi.org/10.1016/j.patcog.2019.02.023

Martins, M. V., Baptista, L., Machado, J. and Realinho, V. (2023) ‘Multi-class phased prediction of academic performance and dropout in higher education’, Applied Sciences, Vol. 13, No. 8, p. 4702. https://doi.org/10.3390/app13084702

Martins, M. V., Tolledo, D., Machado, J., Baptista, L. M. T. and Realinho, V. (2021) ‘Early prediction of student’s performance in higher education: A case study’, in: Rocha, Á., Adeli, H., Reis, L. P. and Costanzo, S. (eds.), Trends and Applications in Information Systems and Technologies, Cham: Springer, pp. 166–175. https://doi.org/10.1007/978-3-030-72657-7_16

Matz, S. C., Bukow, C. S., Peters, H., Deacons, C., Dinu, A. and Stachl, C. (2023) ‘Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics’, Scientific Reports, Vol. 13, No. 1, p. 5705. https://doi.org/10.1038/s41598-023-32484-w

Mongeon, P. and Paul-Hus, A. (2016) ‘The journal coverage of Web of Science and Scopus: A comparative analysis’, Scientometrics, Vol. 106, No. 1, pp. 213–228. https://doi.org/10.1007/s11192-015-1765-5

Moreira da Silva, D. E., Solteiro Pires, E. J., Reis, A., de Moura Oliveira, P. B. and Barroso, J. (2022) ‘Forecasting students dropout: A UTAD university study’, Future Internet, Vol. 14, No. 3, p. 76. https://doi.org/10.3390/fi14030076

Mouchantaf, N. and Chamoun, M. (2023) ‘Predicting student dropout with minimal information’, Iraqi Journal of Science, Vol. 64, No. 10, pp. 5265–5279. https://doi.org/10.24996/ijs.2023.64.10.33

Nagy, M. and Molontay, R. (2024) ‘Interpretable dropout prediction: Towards XAI-based personalized intervention’, International Journal of Artificial Intelligence in Education, Vol. 34, No. 2, pp. 274–300. https://doi.org/10.1007/s40593-023-00331-8

Nanglae, L., Iam-On, N., Boongoen, T., Kaewchay, K. and Mullaney, J. (2021) ‘Determining patterns of student graduation using a bi-level learning framework’, Bulletin of Electrical Engineering and Informatics, Vol. 10, No. 4, pp. 2201–2211. https://doi.org/10.11591/eei.v10i4.2502

Ndunagu, J. N., Oyewola, D. O., Garki, F. S., Onyeakazi, J. C., Ezeanya, C. U. and Ukwandu, E. (2024) ‘Deep learning for predicting attrition rate in open and distance learning (ODL) institutions’, Computers, Vol. 13, No. 9, p. 229. https://doi.org/10.3390/computers13090229

Nguyen Thi Cam, H., Sarlan, A. and Arshad, N. I. (2024) ‘A hybrid model integrating recurrent neural networks and the semi-supervised support vector machine for identification of early student dropout risk’, PeerJ Computer Science, Vol. 10, p. e2572. https://doi.org/10.7717/peerj-cs.2572

Niyogisubizo, J., Liao, L., Nziyumva, E., Murwanashyaka, E. and Nshimyumukiza, P. C. (2022) ‘Predicting student’s dropout in university classes using two-layer ensemble machine learning approach: A novel stacked generalization’, Computers and Education: Artificial Intelligence, Vol. 3, p. 100066. https://doi.org/10.1016/j.caeai.2022.100066

Nuanmeesri, S., Poomhiran, L., Chopvitayakun, S. and Kadmateekarun, P. (2022) ‘Improving dropout forecasting during the COVID-19 pandemic through feature selection and multilayer perceptron neural network’, International Journal of Information and Education Technology, Vol. 12, No. 9, pp. 851–857. https://doi.org/10.18178/ijiet.2022.12.9.1693

Okewu, E., Adewole, P., Misra, S., Maskeliunas, R. and Damasevicius, R. (2021) ‘Artificial neural networks for educational data mining in higher education: A systematic literature review’, Applied Artificial Intelligence, Vol. 35, No. 13, pp. 983–1021. https://doi.org/10.1080/08839514.2021.1922847

Okoye, K., Nganji, J. T., Escamilla, J. and Hosseini, S. (2024) ‘Machine learning model (RG-DMML) and ensemble algorithm for prediction of students’ retention and graduation in education’, Computers and Education: Artificial Intelligence, Vol. 6, p. 100205. https://doi.org/10.1016/j.caeai.2024.100205

de Oliveira, C. F., Sobral, S. R., Ferreira, M. J. and Moreira, F. (2021) ‘How does learning analytics contribute to prevent students’ dropout in higher education: A systematic literature review’, Big Data and Cognitive Computing, Vol. 5, No. 4, p. 64. https://doi.org/10.3390/bdcc5040064

Opazo, D., Moreno, S., Álvarez-Miranda, E. and Pereira, J. (2021) ‘Analysis of first-year university student dropout through machine learning models: A comparison between universities’, Mathematics, Vol. 9, No. 20, p. 2599. https://doi.org/10.3390/math9202599

Oqaidi, K., Aouhassi, S. and Mansouri, K. (2025) ‘Predicting graduation in Moroccan open-access bachelors: Early indicators and re-enrollment data’, Bulletin of Electrical Engineering and Informatics, Vol. 14, No. 1, pp. 524–532. https://doi.org/10.11591/eei.v14i1.8580

Ortigosa, A., Carro, R. M., Bravo-Agapito, J., Lizcano, D., Alcolea, J. J. and Blanco, Ó. (2019) ‘From lab to production: Lessons learnt and real-life challenges of an early student-dropout prevention system’, IEEE Transactions on Learning Technologies, Vol. 12, No. 2, pp. 264–277. https://doi.org/10.1109/TLT.2019.2911608

Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., McDonald, S., McGuinness, L. A., Stewart, L. A., Thomas, J., Tricco, A. C., Welch, V. A., Whiting, P. and Moher, D. (2021) ‘The PRISMA 2020 statement: An updated guideline for reporting systematic reviews’, BMJ, p. n71. https://doi.org/10.1136/bmj.n71

Palacios, C. A., Reyes-Suárez, J. A., Bearzotti, L. A., Leiva, V. and Marchant, C. (2021) ‘Knowledge discovery for higher education student retention based on data mining: Machine learning algorithms and case study in Chile’, Entropy, Vol. 23, No. 4, p. 485. https://doi.org/10.3390/e23040485

Pelima, L. R., Sukmana, Y. and Rosmansyah, Y. (2024) ‘Predicting university student graduation using academic performance and machine learning: A systematic literature review’, IEEE Access, Vol. 12, pp. 23451–23465. https://doi.org/10.1109/ACCESS.2024.3361479

Phan, M., De Caigny, A. and Coussement, K. (2023) ‘A decision support framework to incorporate textual data for early student dropout prediction in higher education’, Decision Support Systems, Vol. 168, p. 113940. https://doi.org/10.1016/j.dss.2023.113940

Quimiz-Moreira, M., Delgadillo, R., Parraga-Alava, J., Maculan, N. and Mauricio, D. (2025) ‘Factors, prediction, explainability, and simulating university dropout through machine learning: A systematic review, 2012–2024’, Computation, Vol. 13, No. 8, p. 198. https://doi.org/10.3390/computation13080198

Rabelo, A. M. and Zárate, L. E. (2025) ‘A model for predicting dropout of higher education students’, Data Science and Management, Vol. 8, No. 1, pp. 72–85. https://doi.org/10.1016/j.dsm.2024.07.001

Realinho, V., Martins, M. V., Machado, J. and Baptista, L. (2021) Predict students’ dropout and academic success [Dataset], UCI Machine Learning Repository. https://doi.org/10.24432/C5MC89

Rethlefsen, M. L., Kirtley, S., Waffenschmidt, S., Ayala, A. P., Moher, D., Page, M. J., Koffel, J. B., Blunt, H., Brigham, T., Chang, S., Clark, J., Conway, A., Couban, R., de Kock, S., Farrah, K., Fehrmann, P., Foster, M., Fowler, S. A., Glanville, J., Harris, E., Hoffecker, L., Isojarvi, J., Kaunelis, D., Ket, H., Levay, P., Lyon, J., McGowan, J., Murad, M. H., Nicholson, J., Pannabecker, V., Paynter, R., Pinotti, R., Ross-White, A., Sampson, M., Shields, T., Stevens, A., Sutton, A., Weinfurter, E., Wright, K. and Young, S. (2021) ‘PRISMA-S: An extension to the PRISMA statement for reporting literature searches in systematic reviews’, Systematic Reviews, Vol. 10, No. 1, p. 39. https://doi.org/10.1186/s13643-020-01542-z

Rodríguez-Muñiz, L. J., Bernardo, A. B., Esteban, M. and Díaz, I. (2019) ‘Dropout and transfer paths: What are the risky profiles when analyzing university persistence with machine learning techniques?’, PLoS ONE, Vol. 14, No. 6, p. e0218796. https://doi.org/10.1371/journal.pone.0218796

Rose, A. L. P. J. and Mary, A. C. (2022) ‘An early intervention technique for at-risk prediction of higher education students in cloud-based virtual learning environment using classification algorithms during COVID-19’, International Journal of Advanced Computer Science and Applications, Vol. 13, No. 1, pp. 612–621. https://doi.org/10.14569/IJACSA.2022.0130174

Roslan, N., Jamil, J. M., Shaharanee, I. N. M. and Sultan Alawi, S. J. (2024) ‘Prediction of student dropout in Malaysian’s private higher education institute using data mining application’, Journal of Advanced Research in Applied Sciences and Engineering Technology, Vol. 45, No. 2, pp. 168–176. https://doi.org/10.37934/araset.45.2.168176

Rovira, S., Puertas, E. and Igual, L. (2017) ‘Data-driven system to predict academic grades and dropout’, PLoS ONE, Vol. 12, No. 2, p. e0171207. https://doi.org/10.1371/journal.pone.0171207

Salam, A. and Zeniarja, J. (2023) ‘Classification of deep learning convolutional neural network feature extraction for student graduation prediction’, Indonesian Journal of Electrical Engineering and Computer Science, Vol. 32, No. 1, p. 335. https://doi.org/10.11591/ijeecs.v32.i1.pp335-341

Salinas-Chipana, J., Obregon-Palomino, L., Iparraguirre-Villanueva, O. and Cabanillas-Carbonell, M. (2024) ‘Machine learning models for predicting student dropout—a review’, in: Yang, X.-S., Sherratt, R. S., Dey, N. and Joshi, A. (eds.), Proceedings of Eighth International Congress on Information and Communication Technology, Singapore: Springer Nature Singapore, pp. 1003–1014. https://doi.org/10.1007/978-981-99-3043-2_83

Sandoval-Palis, I., Naranjo, D., Vidal, J. and Gilar-Corbi, R. (2020) ‘Early dropout prediction model: A case study of university leveling course students’, Sustainability, Vol. 12, No. 22, p. 9314. https://doi.org/10.3390/su12229314

Sani, N. S., Fikri, A., Ali, Z., Zakree, M. and Nadiyah, K. (2020) ‘Drop-out prediction in higher education among B40 students’, International Journal of Advanced Computer Science and Applications, Vol. 11, No. 11, pp. 550–559. https://doi.org/10.14569/IJACSA.2020.0111169

Sayed, M. (2024) ‘Student progression and dropout rates using convolutional neural network: A case study of the Arab Open University’, Journal of Advanced Computational Intelligence and Intelligent Informatics, Vol. 28, No. 3, pp. 668–678. https://doi.org/10.20965/jaciii.2024.p0668

Segura, M., Mello, J. and Hernández, A. (2022) ‘Machine learning prediction of university student dropout: Does preference play a key role?’, Mathematics, Vol. 10, No. 18, p. 3359. https://doi.org/10.3390/math10183359

Setiadi, H., Sanjaya, K., Wijayanto, A., Wardhani, D. W. and Cahyono, H. D. (2024) ‘Comparative analysis of classification algorithms using feature selection techniques to predict on-time student graduation’, Ingénierie des systèmes d’information, Vol. 29, No. 4, pp. 1365–1379. https://doi.org/10.18280/isi.290412

Setiawan, R., Nursasongko, E., Syukur, A., Budiman, F. and Kurniadi, D. (2025) ‘Handling class imbalance in student success prediction using machine learning: A comparison of SMOTE and SMOTETomek’, in: 2025 International Conference on Smart Computing, IoT and Machine Learning (SIML), pp. 1–6. https://doi.org/10.1109/SIML65326.2025.11081128

Song, Z., Sung, S.-H., Park, D.-M. and Park, B.-K. (2023) ‘All-year dropout prediction modeling and analysis for university students’, Applied Sciences, Vol. 13, No. 2, p. 1143. https://doi.org/10.3390/app13021143

Tsai, S.-C., Chen, C.-H., Shiao, Y.-T., Ciou, J.-S. and Wu, T.-N. (2020) ‘Precision education with statistical learning and deep learning: A case study in Taiwan’, International Journal of Educational Technology in Higher Education, Vol. 17, No. 1, p. 12. https://doi.org/10.1186/s41239-020-00186-2

Uliyan, D., Aljaloud, A. S., Alkhalil, A., Amer, H. S. Al, Mohamed, M. A. E. A. and Alogali, A. F. M. (2021) ‘Deep learning model to predict students retention using BLSTM and CRF’, IEEE Access, Vol. 9, pp. 135550–135558. https://doi.org/10.1109/ACCESS.2021.3117117

Vaarma, M. and Li, H. (2024) ‘Predicting student dropouts with machine learning: An empirical study in Finnish higher education’, Technology in Society, Vol. 76, p. 102474. https://doi.org/10.1016/j.techsoc.2024.102474

Vega, H., Sanez, E., De La Cruz, P., Moquillaza, S. and Pretell, J. (2022) ‘Intelligent system to predict university students dropout’, International Journal of Online and Biomedical Engineering (iJOE), Vol. 18, No. 7, pp. 27–43. https://doi.org/10.3991/ijoe.v18i07.30195

Véliz Palomino, J. C. and Ortega, A. M. (2023) ‘Dropout intentions in higher education: Systematic literature review’, Journal on Efficiency and Responsibility in Education and Science, Vol. 16, No. 2, pp. 149–158. https://doi.org/10.7160/eriesj.2023.160206

Vidal, J., Gilar-Corbi, R., Pozo-Rico, T., Castejón, J.-L. and Sánchez-Almeida, T. (2022) ‘Predictors of university attrition: Looking for an equitable and sustainable higher education’, Sustainability, Vol. 14, No. 17, p. 10994. https://doi.org/10.3390/su141710994

Villar, A. and de Andrade, C. R. V. (2024) ‘Supervised machine learning algorithms for predicting student dropout and academic success: A comparative study’, Discover Artificial Intelligence, Vol. 4, No. 1, p. 2. https://doi.org/10.1007/s44163-023-00079-z

Villegas-Ch, W., Govea, J. and Revelo-Tapia, S. (2023) ‘Improving student retention in institutions of higher education through machine learning: A sustainable approach’, Sustainability, Vol. 15, No. 19, p. 14512. https://doi.org/10.3390/su151914512

Won, H.-S., Kim, M.-J., Kim, D., Kim, H.-S. and Kim, K.-M. (2023) ‘University student dropout prediction using pretrained language models’, Applied Sciences, Vol. 13, No. 12, p. 7073. https://doi.org/10.3390/app13127073

Yaqin, A., Laksito, A. D. and Fatonah, S. (2021) ‘Evaluation of backpropagation neural network models for early prediction of student’s graduation in XYZ University’, International Journal on Advanced Science, Engineering and Information Technology, Vol. 11, No. 2, pp. 610–617. https://doi.org/10.18517/ijaseit.11.2.11152

Yaqin, A., Rahardi, M. and Abdulloh, F. F. (2022) ‘Accuracy enhancement of prediction method using SMOTE for early prediction student’s graduation in XYZ University’, International Journal of Advanced Computer Science and Applications, Vol. 13, No. 6, pp. 418–424. https://doi.org/10.14569/IJACSA.2022.0130652

Zanellati, A., Zingaro, S. P. and Gabbrielli, M. (2024) ‘Balancing performance and explainability in academic dropout prediction’, IEEE Transactions on Learning Technologies, Vol. 17, pp. 2086–2099. https://doi.org/10.1109/TLT.2024.3425959

Additional Files

Published

2026-03-31

How to Cite

Setiawan, R., Noersasongko, E., Syukur, A. ., Budiman, F. and Kurniadi, D. (2026) ’Imbalanced Multi-class Prediction of Student Drop-out and Graduation: A Systematic Literature Review’, Journal on Efficiency and Responsibility in Education and Science, vol. 19, no. 1, pp. 72–90. https://doi.org/10.7160/eriesj.2026.190106