A Systematic Approach to Predicting Students' Academic Performance

A Review of Recent Literature

Authors

DOI:

https://doi.org/10.7160/eriesj.2026.190101

Keywords:

Academic Performance Prediction, Systematic Literature Review, Machine Learning, Data-Driven Education, Academic Intervention

Abstract

The rapid expansion of digital learning has generated large volumes of educational data, creating new opportunities to apply machine learning (ML) and data mining techniques to predict student academic performance. This study synthesizes 58 empirical studies that used Decision Trees, Random Forests, Support Vector Machines, Logistic Regression, and Artificial Neural Networks to identify at-risk students and improve educational outcomes.

The review focuses on predictor variables, validation methods, accuracy rates, and performance metrics. Findings suggest that the most effective predictive models combine four categories of variables: demographic factors, academic indicators, digital behavioral features, and psychosocial attributes. Among the algorithms examined, Random Forest and Artificial Neural Networks demonstrated the strongest predictive performance, achieving accuracy rates of 85%–93% across k-fold cross-validation and train-test split validation.

Performance measures such as precision, recall, F1 score, and AUC further confirm the robustness and generalizability of these models. ML-based academic prediction systems can strengthen early warning systems, support data-driven policymaking, and enable personalized learning interventions. The study concludes that combining multidimensional predictors with explainable AI can improve equity, personalization, operational efficiency, and accountability in educational decision-making.

References

Abdrakhmanov, R., Zhaxanova, A., Karatayeva, M., Niyazova, G. Z., Berkimbayev, K. and Tuimebayev, A. (2024) ‘Development of a Framework for Predicting Students’ Academic Performance in STEM Education using Machine Learning Methods’, International Journal of Advanced Computer Science and Applications, Vol. 15, No. 1, pp. 38–46. https://doi.org/10.14569/IJACSA.2024.0150105

Abdelaziz, A. A., Shaker, H., Tolba, A. S. and Abdelfattah, F. (2025) ‘Predictive learning analytics: Analyzing student participation and performance on Moodle’, in: Hassanien, A. E., Darwish, A., and El-Askary, H. (eds.), The International Conference on Advanced Intelligent Systems and Informatics, Studies in Systems, Decision and Control, Vol. 601, Cham: Springer, pp. 373–384. https://doi.org/10.1007/978-3-031-92240-4_34

Abdolrasol, M. G. M., Hussain, S. M. S., Ustun, T. S., Sarker, M. R., Hannan, M. A., Mohamed, R., Ali, J. A., Mekhilef, S. and Milad, A. (2021) ‘Artificial neural networks based optimization techniques: A review’, Electronics, Vol. 10, No. 21, pp. 26–89. https://doi.org/10.3390/electronics10212689

Abiodun, O. I., Jantan, A., Omolara, A. E., Dada, K. V., Mohamed, N. A. and Arshad, H. (2018) ‘State-of-the-art in artificial neural network applications: A survey’, Heliyon, Vol. 4, No. 11, pp. 09–38. https://doi.org/10.1016/j.heliyon.2018.e00938

Abiodun, O. I., Jantan, A., Omolara, A. E., Dada, K. V., Umar, A. M., Linus, O. U., Arshad, H., Kazaure, A. A., Gana, U. and Kiru, M. U. (2019) ‘Comprehensive review of artificial neural network applications to pattern recognition’, IEEE Access, Vol. 7, pp. 158820–158846. https://doi.org/10.1109/ACCESS.2019.2945545

Adejo, O. W. and Connolly, T. (2018) ‘Predicting student academic performance using multi-model heterogeneous ensemble approach’, Journal of Applied Research in Higher Education’, Vol. 10, No. 1, pp. 61–75. https://doi.org/10.1108/JARHE-09-2017-0113

Aghbashlo, M., Hosseinpour, S. and Mujumdar, A. S. (2015) ‘Application of artificial neural networks (ANNs) in drying technology: a comprehensive review’, Drying Technology, Vol. 33, No.12, pp. 1397–1462. https://doi.org/10.1080/07373937.2015.1036288

Akhatkulov, S., Yusupov, O. and Omonov, A. (2024) ‘Predicting students’ future final exam results using machine learning algorithms: A comparative analysis’, AIP Conference Proceedings, Vol. 3244, p. 030071. https://doi.org/10.1063/5.0241786

Al-Khlifeh, E., Tarawneh, A. S., Almohammadi, K., Alrashidi, M., Hassanat, R. and Hassanat, A. B. (2025) ‘Decision tree-based learning and laboratory data mining: an efficient approach to amebiasis testing’, Parasites and Vectors, Vol. 18, No.1, pp. 1–18. https://doi.org/10.1186/s13071-024-06618-6

Alalawi, K., Athauda, R. and Chiong, R. (2023) ‘Contextualizing the current state of research on the use of machine learning for student performance prediction: A systematic literature review’, Engineering Reports, Vol. 5, No. 12, pp. 1–25. https://doi.org/10.1002/eng2.12699

Albreiki, B., Zaki, N. and Alashwal, H. (2021) ‘A systematic literature review of student’performance prediction using machine learning techniques’, Education Sciences, Vol. 11, No. 9, p. 552. https://doi.org/10.3390/educsci11090552

Alhassan, A., Zafar, B. and Mueen, A. (2020) ‘Predict students’ academic performance based on their assessment grades and online activity data’, International Journal of Advanced Computer Science and Applications, Vol. 11, No. 4, pp. 185–194. https://doi.org/10.14569/IJACSA.2020.0110425

Antonakis, A. C. and Sfakianakis, M. E. (2009) ‘Assessing naïve Bayes as a method for screening credit applicants’, Journal of Applied Statistics, Vol. 36, No. 5, pp. 537–545. https://doi.org/10.1080/02664760802554263

Baashar, Y., Alkawsi, G., Ali, N., Alhussian, H. and Bahbouh, H. (2021) ‘Predicting student’s performance using machine learning methods: A systematic literature review’, in: 2021 International Conference on Computer and Information Sciences (ICCOINS), pp. 357–362. https://doi.org/10.1109/ICCOINS49721.2021.9497185

Balaji, P., Alelyani, S., Qahmash, A. and Mohana, M. (2021) ‘Contributions of Machine Learning Models towards Student Academic Performance Prediction: A Systematic Review’, Applied Sciences, Vol. 11, No. 21, p. 10007. https://doi.org/10.3390/app112110007

Baneres, D., Rodríguez-Gonzalez, M. E. and Serra, M. (2019) ‘An Early Feedback Prediction System for Learners At-Risk within a First-Year Higher Education Course’, IEEE Transactions on Learning Technologies, Vol.12, No.2, pp. 249–263. https://doi.org/10.1109/TLT.2019.2912167

Bhimavarapu, N., Prasanthi, B. V., Lakshmi Veenadhari, C. H., Durga Satish, M., Matta, V. D. R. and Pradeep, I. K. (2025) ‘Predicting student academic performance using machine learning: A comparison of classification algorithms’, in: Smys, S., Tavares, J. M. R. S. and Balas, V. E. (eds.), Springer Proceedings in Mathematics and Statistics, Vol. 441, Singapore: Springer, pp. 703–716. https://doi.org/10.1007/978-3-031-51338-1_56

Bilquise, G., Ibrahim, S. and Salhieh, S. M. (2024) ‘Investigating student acceptance of an academic advising chatbot in higher education institutions’, Education and Information Technologies, Vol. 29, No. 5, pp. 6357–6382. https://doi.org/10.1007/s10639-023-12076-x

Blašková, V. and Staňková, M. (2023) ‘Graduate Employability as a Key to the Efficiency of Tertiary Education’, Journal on Efficiency and Responsibility in Education and Science, Vol. 16, No. 4, pp. 262–274. https://doi.org/10.7160/eriesj.2023.160401

Boujmiraz, S., Darhmaoui, H. and Drissi el Maliani, A. (2026) ‘Predicting student performance: A comprehensive review of machine learning, deep learning, and explainable AI approaches’, Computers and Education: Artificial Intelligence, Vol. 10, No. 1, p. 100548. https://doi.org/10.1016/j.caeai.2026.100548

Can, S., Kerkez, F. İlker and Manav, G. . (2025) ‘Physical Education and Sports Teachers’ Perceptions to Benefit from Web 2.0 Tools in Face-to-face Education after Emergency Remote Teaching Process: A Mixed Method Research’, Journal on Efficiency and Responsibility in Education and Science, Vol. 18, No. 1, pp. 1–12. https://doi.org/10.7160/eriesj.2025.180101

Chakrapani, P. and Chitradevi, D. (2022) ‘Academic performance prediction using machine learning: A comprehensive and systematic review’, in: 2022 International Conference on Electronic Systems and Intelligent Computing (ICESIC), pp. 335–340. https://doi.org/10.1109/ICESIC53714.2022.9783512

Cheng, B., Liu, Y. and Jia, Y. (2024) ‘Evaluation of students’ performance during the academic period using the XG-Boost Classifier-Enhanced AEO hybrid model’, Expert Systems with Applications, Vol. 238, p. 122136. https://doi.org/10.1016/j.eswa.2023.122136

Cocca, A., Ciesralová, M., Cocca, M., Greier, K., Uchytil, J. and Ruedl, G. (2025) ‘Validation of the Teachers’ Personal and Professional Skills Questionnaire in the Czech Physical Education Setting’, Journal on Efficiency and Responsibility in Education and Science, Vol. 18, No. 1, pp. 58–63. https://doi.org/10.7160/eriesj.2025.180107

Cruz, M. M. P. and Lumauag, R. G. (2024) ‘Comparative analysis of machine learning algorithms for predicting student academic performance in higher education’, in: Proceedings of the 4th International Conference on Ubiquitous Computing and Intelligent Information Systems (ICUIS 2024), pp. 888–896. https://doi.org/10.1109/ICUIS64676.2024.10866086

Dai, Z. and Lu, P. J. (2024) ‘The application of machine learning in student performance prediction in higher education institutions: A systematic literature review’, in: Proceedings of the 2024 13th International Conference on Computer Technologies and Development (TechDev), pp. 57–62. https://doi.org/10.1109/TechDev64369.2024.00019

Daza, A., Guerra, C., Cervera, N. and Burgos, E. (2022) ‘Predicting Academic Performance through Data Mining: A Systematic Literature’, TEM Journal, Vol. 11, No. 2, pp. 939–949. https://doi.org/10.18421/TEM112-57

De-La-Cruz, P., Rojas-Coaquira, R., Vega-Huerta, H., Pérez-Quintanilla, J. and Lagos-Barzola, M. (2022) ‘A Systematic Review Regarding the Prediction of Academic Performance’, Journal of Computer Science, Vol. 18, No.12, pp. 1219–1231. https://doi.org/10.3844/JCSSP.2022.1219.1231

Deleña, R. D., Dia, N. J., Sacayan, R. R., Sieras, J. C., Khalid, S. A., Macatotong, A. H. T. and Gulam, S. B. (2025) ‘Predicting student retention: A comparative study of machine learning approach utilizing sociodemographic and academic factors’, Systems and Soft Computing, Vol. 7, p. 200352. https://doi.org/10.1016/j.sasc.2025.200352

Farissi, A., Dahlan, H. M. and Shah, Z. A. (2023) ‘High accuracy feature selection using metaheuristic algorithm for classification of student academic performance prediction’, in: Abraham, A., Gandhi, N., Hanne, T. and Hong, T. P. (eds.), Proceedings of International Conference on Intelligent Systems Design and Applications, Lecture Notes on Data Engineering and Communications Technologies, Vol. 179, Cham: Springer, pp. 399–409. https://doi.org/10.1007/978-3-031-36258-3_35

Fundoni, M., Porcu, L. and Melis, G. (2023) ‘Systematic literature review: Main procedures and guidelines for interpreting the results’, in: Bell, E., Bryman, A. and Harley, B. (eds.), Researching and Analysing Business: Research Methods in Practice, Abingdon: Routledge, pp. 55–74. https://doi.org/10.4324/9781003107774-5

García‐Martín, J. and Pérez Fernández, L. M. (2025) ‘A Review of Global Strategies for Achieving Sustainable Development Goal 4 in Higher Education (2020–2024): Key Actions in the Education for Sustainable Development Framework’, Sustainable Development, Vol. 14, No. S2, pp. 843–856. https://doi.org/10.1002/sd.70374

Gholami, R. and Fakhari, N. (2017) ‘Support vector machine: Principles, parameters, and applications’, in: Saha, P., Saha, S. and Balas, V. E. (eds.), Handbook of Neural Computation, London: Academic Press, pp. 515–535. https://doi.org/10.1016/B978-0-12-811318-9.00027-2

Giles, A. and Yazan, B. (2023) ‘Constructing teacher identity in teacher collaboration: What does it mean to be a teacher of culturally and linguistically diverse English learners?’, Journal on Efficiency and Responsibility in Education and Science, Vol. 16, No. 1, pp. 36–45. https://doi.org/10.7160/eriesj.2023.160104

Haneem, F., Kama, N., Ali, R. and Selamat, A. (2017) ‘Applying data analytics approach in systematic literature review: Master data management case study’, Frontiers in Artificial Intelligence and Applications, Vol. 297, pp. 705–715. https://doi.org/10.3233/978-1-61499-800-6-705

Lingjun, H., Levine, R. A., Fan, J., Beemer, J. and Stronach, J. (2018) ‘Random forest as a predictive analytics alternative to regression in institutional research’, Practical Assessment, Research and Evaluation, Vol. 23, no. 1, pp. 1–16. https://doi.org/10.7275/1wpr-m024

Hellas, A., Ihantola, P., Petersen, A., Ajanovski, V., Gutica, M., Hynninen, T., Knutas, A., Leinonen, J., Messom, C. and Liao, S. N. (2018) ‘Predicting academic performance: A systematic literature review’, in: Proceedings Companion of the 23rd Annual ACM Conference on Innovation and Technology in Computer Science Education (ITiCSE 2018), pp. 175–199. https://doi.org/10.1145/3293881.3295783

Hoyos Osorio, J. K. and Daza Santacoloma, G. (2023) ‘Predictive model to identify college students with high dropout rates’, Revista Electrónica de Investigación Educativa, Vol. 25, No. e13, pp. 1–10 . https://doi.org/10.24320/redie.2023.25.e13.5398

Kamal, P. and Ahuja, S. (2019) ‘An ensemble-based model for prediction of academic performance of students in undergrad professional course’, Journal of Engineering, Design and Technology, Vol.17, No. 4, pp. 769–781. https://doi.org/10.1108/JEDT-11-2018-0204

Karim-Abdallah, B., Ayitey Junior, M., Appiahene, P., Harris, E. and Binful, D. K. (2025) ‘Application of Machine Learning Algorithms in Predicting Academic Performance of Students in Higher Education Institutes (HEIS): A Systematic Review and Bibliographic Analysis’, African Journal of Applied Research, Vol. 11, No. 1, pp. 536–559. https://doi.org/10.26437/ajar.v11i1.869

Katarya, R. (2023) ‘A Systematic Review on Predicting the Performance of Students in Higher Education in Offline Mode Using Machine Learning Techniques’, Wireless Personal Communications, Vol. 133, No. 3, pp. 1643–1674. https://doi.org/10.1007/s11277-023-10838-x

Khan, A. and Ghosh, S. K. (2021) ‘Student performance analysis and prediction in classroom learning: A review of educational data mining studies’, Education and Information Technologies, Vol. 26, No. 1, pp. 205–240. https://doi.org/10.1007/s10639-020-10230-3

Khan, S. and Menezes, J. (2020) ‘Predictive modelling to illustrate factors influencing students at risk’, International Journal of Technology Transfer and Commercialisation, Vol. 17, No. 1, pp. 68–75. https://doi.org/10.1504/IJTTC.2020.106574

Krüger, J., Lausberger, C., von Nostitz-Wallwitz, I., Saake, G. and Leich, T. (2020) ‘Search. Review. Repeat? An empirical study of threats to replicating SLR searches’, Empirical Software Engineering, Vol. 25, pp. 627–677. https://doi.org/10.1007/s10664-019-09763-0

Kumar, M., Singh, A. J., Sharma, B. and Cengiz, K. (2022) ‘Evaluation of machine learning algorithms on academic big dataset by using feature selection techniques’, in: Balas, V. E., Solanki, V. K. and Kumar, R. (eds.), Intelligent Network Design Driven by Big Data Analytics, IoT, AI and Cloud Computing, London: Institution of Engineering and Technology, pp. 61–92. https://doi.org/10.1049/PBPC054E_ch4

Kumar, R., Goswami, B., Mhatre, S. M. and Agrawal, S. (2024) ‘Naive bayes in focus: a thorough examination of its algorithmic foundations and use cases’, International Journal of Innovative Science and Research Technology, Vol. 9, No. 5, pp. 2078–2081. https://doi.org/10.38124/ijisrt/IJISRT24MAY1438

Lam, P. X., Mai, P. Q. H., Nguyen, Q. H., Pham, T., Nguyen, T. H. H. and Nguyen, T. H. (2024) ‘Enhancing educational evaluation through predictive student assessment modeling’, Computers and Education: Artificial Intelligence, Vol. 6, p. 100244. https://doi.org/10.1016/j.caeai.2024.100244

Lou, Y. and Colvin, K. F. (2025) ‘Performance prediction using educational data mining techniques: a comparative study’, Discover Education, Vol. 4, No. 112, pp. 1–14. https://doi.org/10.1007/s44217-025-00502-w

Lutsenko, V. and Zgonnikov, M. (2024) ‘Fault tolerant system for data storage, transmission and processing in fog computing using artificial neural networks’, in: Kotenko, I., Badica, C. and Taratukhin, V. (eds.), Proceedings of International Conference on Intelligent Data Engineering and Automated Learning, Lecture Notes in Networks and Systems, Vol. 744, Cham: Springer, pp. 199–212. https://doi.org/10.1007/978-3-031-64010-0_19

Masangu, L., Jadhav, A. and Ajoodha, R. (2021) ‘Predicting student academic performance using data mining techniques’, Advances in Science, Technology and Engineering Systems, Vol. 6, No.1, pp.153–163. https://doi.org/10.25046/aj060117

Matz, S. C., Bukow, C. S., Peters, H., Deacons, C., Dinu, A. and Stachl, C. (2023) ‘Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics’, Scientific Reports, Vol. 13, No. 1, pp. 5705. https://doi.org/10.1038/s41598-023-32484-w

Meghji, A. F., Shaikh, F. B., Wadho, S. A., Bhatti, S. and Ayyasamy, R. K. (2023) ‘Using educational data mining to predict student academic performance’, VFAST Transactions on Software Engineering, Vol. 11, No. 2, pp. 43–49. https://doi.org/10.21015/vtse.v11i2.1475

Molina, O. E., and Cancell, D. R. F. (2021) ‘Is it possible to predict academic performance? An analysis from educational technology’, Revista Fuentes, Vol. 3, No. 23, pp. 363–375. https://doi.org/10.12795/REVISTAFUENTES.2021.14278

Nabil, A., Seyam, M. and Elfetouh, A. A. (2022) ‘Predicting students’ academic performance using machine learning techniques: a literature review’, International Journal of Business Intelligence and Data Mining, Vol. 20, No. 4, pp. 456–479. https://doi.org/10.1504/IJBIDM.2022.123214

Namoun, A. and Alshanqiti, A. (2020) ‘Predicting Student Performance Using Data Mining and Learning Analytics Techniques: A Systematic Literature Review’, Applied Sciences, Vol. 11, No. 1, p. 237. https://doi.org/10.3390/app11010237

Nawang, H., Makhtar, M. and Hamza, W. M. A. F. W. (2021) ‘A systematic literature review on student performance predictions’, International Journal of Advanced Technology and Engineering Exploration, Vol. 8, No. 84, pp. 1441–1453. https://doi.org/10.19101/ijatee.2021.874521

Nazir, M., Noraziah, A., Rahmah, M. and Sharma, A. (2023) ‘Examining the potential of machine learning for predicting academic achievement: A systematic review’, Fusion: Practice and Applications, Vol. 13, No. 2, pp.71–90. https://doi.org/10.54216/FPA.130207

Nguyen, M. H. and Jones, T. E. (2022) ‘Predictors of support for biodiversity loss countermeasure and bushmeat consumption among Vietnamese urban residents’, Conservation Science and Practice, Vol. 4, No. 12, p. e12822. https://doi.org/10.1111/csp2.12822

Niranjala, S. H., Alobaedy, M. M. and Goyal, S. B. (2024) ‘A comparative study of machine learning techniques for predicting student academic performance’, in: Kotenko, I., Badica, C. and Taratukhin, V. (eds.), Proceedings of International Conference on Intelligent Data Engineering and Automated Learning, Lecture Notes in Networks and Systems, Vol. 811, Cham: Springer, pp. 307–315. https://doi.org/10.1007/978-3-031-73318-5_31

Ortiz-Bejar, J., Tellez, E. S., Graff, M., Moctezuma, D. and Miranda-Jimenez, S. (2020) ‘Improving k nearest neighbors and naïve Bayes classifiers through space transformations and model selection’, IEEE Access, Vol. 8, pp. 221669–221688. https://doi.org/10.1109/ACCESS.2020.3042453

Patel, H. I. and Patel, D. (2024) ‘Exploratory Data Analysis and Feature Selection for Predictive Modeling of Student Academic Performance Using a Proposed Dataset’, International Journal of Engineering Trends and Technology, Vol. 72, No. 11, pp. 131–143. https://doi.org/10.14445/22315381/IJETT-V72I11P116

Pek, R. Z., Özyer, S. T., Elhage, T., Özyer, T. and Alhajj, R. (2022) ‘The role of machine learning in identifying students at-risk and minimizing failure’, IEEE Access, Vol. 11, pp. 1224–1243. https://doi.org/10.1109/ACCESS.2022.3232984

Pelima, L., Sukmana, Y. and Rosmansyah, Y. (2024) ‘Predicting University Student Graduation Using Academic Performance and Machine Learning: A Systematic Literature Review’, IEEE Access, Vol. 12, pp. 23451–23465. https://doi.org/10.1109/ACCESS.2024.3361479

Pisner, D. A. and Schnyer, D. M. (2020) ‘Support vector machine’, in: Mechelli, A. and Vieira, S. (eds.), Machine Learning, London: Academic Press, pp. 101–121. https://doi.org/10.1016/B978-0-12-815739-8.00006-7

Prill, R., Karlsson, J., Ayeni, O. R. and Becker, R. (2021) ‘Author guidelines for conducting systematic reviews and meta-analyses’, Knee Surgery, Sports Traumatology, Arthroscopy, Vol.29, No. 9, pp. 2739–2744. https://doi.org/10.1007/s00167-021-06631-7

Rajendran, S., Chamundeswari, S. and Sinha, A. A. (2022) ‘Predicting the academic performance of middle- and high-school students using machine learning algorithms’, Social Sciences and Humanities Open, Vol. 6, No. 1, p. 100357. https://doi.org/10.1016/j.ssaho.2022.100357

Reddy, A. L., Sathish, T. and Sangeetha, N. (2024) ‘Prediction of student results using novel random forest in comparison with decision tree to improve accuracy’, AIP Conference Proceedings, Vol. 2853, p. 020053. https://doi.org/10.1063/5.0198498

Reimers, F. M. (2024) ‘The sustainable development goals and education, achievements and opportunities’, International Journal of Educational Development, Vol. 104, p. 102965. https://doi.org/10.1016/j.ijedudev.2023.102965

Ripan, R. C., Sarker, I. H., Hasan Furhad, M., Musfique Anwar, M. and Hoque, M. M. (2021) ‘An effective heart disease prediction model based on machine learning techniques’, in: Hassanien, A. E., Bhatnagar, R., Darwish, A. and Hameed, K. (eds.), Proceedings of International Conference on Advanced Intelligent Systems and Informatics, Advances in Intelligent Systems and Computing, Vol. 1375, Cham: Springer, pp. 280–288. https://doi.org/10.1007/978-3-030-73050-5_28

Rodrigues, L. S., Santos, M., De Araújo Costa, I. P. and Moreira, M. (2022) ‘Student Performance Prediction on Primary and Secondary Schools-A Systematic Literature Review’, Procedia Computer Science, Vol. 214, pp. 680–687. https://doi.org/10.1016/j.procs.2022.11.229

Rodríguez-Hernández, C. F., Musso, M., Kyndt, E. and Cascallar, E. (2021) ‘Artificial neural networks in academic performance prediction: Systematic implementation and predictor evaluation’, Computers and Education: Artificial Intelligence, Vol. 2, p. 100018. https://doi.org/10.1016/j.caeai.2021.100018

Rokach, L. (2016) ‘Decision forest: Twenty years of research’, Information Fusion, Vol. 27, No. 1, pp. 111–125. https://doi.org/10.1016/j.inffus.2015.06.005

Roslan, M. H. B. and Chen, C. J. (2022) ‘Educational Data Mining for Student Performance Prediction: A Systematic Literature Review (2015-2021)’, International Journal of Emerging Technologies in Learning, Vol. 17, No. 5, pp. 147–179. https://doi.org/10.3991/ijet.v17i05.27685

Salloum, S. A., Basiouni, A., Alfaisal, R., Salloum, A. and Shaalan, K. (2024) ‘Predicting student retention in higher education using machine learning’, in: Hassanien, A. E., Darwish, A. and El-Askary, H. (eds.), Proceedings of International Conference on Advanced Intelligent Systems and Informatics, Communications in Computer and Information Science, Vol. 2162, Cham: Springer, pp. 197–206. https://doi.org/10.1007/978-3-031-65996-6_17

Salman, H. A., Kalakech, A. and Steiti, A. (2024) ‘Random Forest Algorithm Overview’, Babylonian Journal of Machine Learning, Vol. 2024, pp. 69–79. https://doi.org/10.58496/BJML/2024/007

Salmi, J. (2015) ‘Evidence-based policies in higher education: Data analytics, impact assessment and reporting’, in: Curaj, A., Matei, L., Pricopie, R., Salmi, J. and Scott, P. (eds.), The European Higher Education Area: Between Critical Reflections and Future Policies, Cham: Springer International Publishing, pp. 807–813. https://doi.org/10.1007/978-3-319-20877-0_49

Santiketa, N., Chaikhan, S., Ninrutsirikun, U. and Wattanakitrungroj, N. (2024) ‘Student academic performance prediction using machine learning with various features and scenarios’, in: International Computer Science and Engineering Conference (ICSEC 2024), pp. 1–6. https://doi.org/10.1109/ICSEC62781.2024.10770729

Sarker, S., Paul, M. K., Thasin, S. T. H. and Hasan, M. A. M. (2024) ‘Analyzing students’ academic performance using educational data mining’, Computers and Education: Artificial Intelligence, Vol. 7, p. 100263. https://doi.org/10.1016/j.caeai.2024.100263

Schmidt, A., Cechinel, C., Queiroga, E. M., Primo, T., Ramos, V., Bordin, A. S., Mello, R. F. and Munoz, R. (2025) ‘Analyzing Intervention Strategies Employed in Response to Automated Academic-Risk Identification: A Systematic Review’, IEEE Revista Iberoamericana de Tecnologias Del Aprendizaje, Vol. 20, pp. 77–85. https://doi.org/10.1109/RITA.2025.3540161

Seo, E.-Y., Yang, J., Lee, J.-E. and So, G. (2024) ‘Predictive modelling of student dropout risk: Practical insights from a South Korean distance university’, Heliyon, Vol. 10, No. 11, pp. 1–17. https://doi.org/10.1016/j.heliyon.2024.e30960

Shaik, A. B. and Srinivasan, S. (2019) ‘A brief survey on random forest ensembles in classification model’, in: Gunjan, V. K., Zurada, J. M. and Raman, B. (eds.), Proceedings of International Conference on Recent Trends in Machine Learning, Lecture Notes in Networks and Systems, Vol. 56, Singapore: Springer, pp. 253–260. https://doi.org/10.1007/978-981-13-2354-6_27

Strub, O. (2020) ‘Optimal feature selection for support vector machine classifiers’, in: IEEE International Conference on Industrial Engineering and Engineering Management (IEEM 2020), pp. 304–308. https://doi.org/10.1109/IEEM45057.2020.9309859

Sushma, P. G. and Sriramakrishnan, G. V. (2025) ‘Exploring predictive algorithms: Linear regression and decision tree in student performance’, AIP Conference Proceedings, Vol. 3270, p. 020154. https://doi.org/10.1063/5.0264437

Syed Mustapha, S. (2023) ‘Predictive analysis of students’ learning performance using data mining techniques: A comparative study of feature selection methods’, Applied System Innovation, Vol. 6, No. 5, p. 86. https://doi.org/10.3390/asi6050086

Vaarma, M. and Li, H. (2024) ‘Predicting student dropouts with machine learning: An empirical study in Finnish higher education’, Technology in Society, Vol. 76, p. 102474. https://doi.org/10.1016/j.techsoc.2024.102474

Valdiviezo-Diaz, P. and Chicaiza, J. (2024) ‘Prediction of academic outcomes using machine learning techniques: A survey of findings on higher education’, in: Hassanien, A. E., Darwish, A. and El-Askary, H. (eds.), Proceedings of International Conference on Advanced Intelligent Systems and Informatics, Communications in Computer and Information Science, Vol. 2049, Cham: Springer, pp. 206–218. https://doi.org/10.1007/978-3-031-58956-0_16

Waheed, H., Hassan, S.-U., Aljohani, N. R., Hardman, J. and Nawaz, R. (2020) ‘Predicting academic performance of students from VLE big data using deep learning models’, Computers in Human Behavior, Vol. 104, p. 106189. https://doi.org/10.1016/j.chb.2019.106189

Wandekoken, E. D., Varejão, F. M., Batista, R. and Rauber, T. W. (2011) ‘Support vector machine ensemble based on feature and hyperparameter variation for real-world machine fault diagnosis’, in: Iliadis, L., Jayne, C. and Angelov, P. (eds.), Engineering Applications of Neural Networks, Advances in Intelligent and Soft Computing, Vol. 96, Berlin: Springer, pp. 271–282. https://doi.org/10.1007/978-3-642-20505-7_24

Wickramasinghe, I. and Kalutarage, H. (2021) ‘Naive Bayes: applications, variations and vulnerabilities: a review of literature with code snippets for implementation’, Soft Computing, Vol. 25, No. 3, pp. 2277–2293. https://doi.org/10.1007/s00500-020-05297-6

Wu, M., Subramaniam, G., Zhu, D., Li, C., Ding, H. and Zhang, Y. (2024) ‘Using machine learning-based algorithms to predict academic performance: A systematic literature review’, in: 4th International Conference on Innovative Practices in Technology and Management (ICIPTM 2024), pp. 1–8. https://doi.org/10.1109/ICIPTM59628.2024.10563566

Xu, X., Wang, J., Peng, H. and Wu, R. (2019) ‘Prediction of academic performance associated with internet usage behaviors using machine learning algorithms’, Computers in Human Behavior, Vol. 98, pp.166–173. https://doi.org/10.1016/j.chb.2019.04.015

Yassine, E. A. and Mohammed, K. (2024) ‘A Comparative Analysis of Decision Trees, Bagging, and Random Forests for Predictive Modeling in Monetary Poverty: Evidence from Morocco’, Applied Mathematics and Information Sciences, Vol. 18, No. 2, pp. 233–240. https://doi.org/10.18576/amis/180203

Zhang, L., Li, K. F. and Bourguiba, I. (2021a) ‘Recent advances in academic performance analysis’, in: International Conference on Higher Education Advances (HEAd21), pp. 607–614. https://doi.org/10.4995/HEAd21.2021.13196

Zhang, Y., Yun, Y., An, R., Cui, J., Dai, H. and Shang, X. (2021b) ‘Educational data mining techniques for student performance prediction: method review and comparison analysis’, Frontiers in Psychology, Vol. 12, p. 698490. https://doi.org/10.3389/fpsyg.2021.698490

Additional Files

Published

2026-03-31

How to Cite

Yata Mones, A. (2026) ’A Systematic Approach to Predicting Students’ Academic Performance: A Review of Recent Literature’, Journal on Efficiency and Responsibility in Education and Science, vol. 19, no. 1, pp. 1–14. https://doi.org/10.7160/eriesj.2026.190101