Analyzing feature importance for a predictive undergraduate student dropout model
Computer Science and Information Systems, Tome 20 (2023) no. 1.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

Worldwide, one of the main concerns of universities is to reduce the dropout rate. Several initiatives have been taken to avoid this problem; however, it is essential to recognize at-risk students as early as possible. This article is an extension of a previous study that proposed a predictive model to identify students at risk of dropout from the beginning of their university degree. The new contribution is the analysis of the feature importance for dropout segmented by faculty, degree program, and semester in the different predictive models. In addition, we propose a dropout model based on faculty characteristics to try to infer the dropout based on faculty features. We used data of 30,576 students enrolled in a Higher Education Institution ranging from years 2000 to 2020. The findings indicate that the variables related to Grade Point Average(GPA), socioeconomic factor, and a pass rate of courses taken have a more significant impact on the model, regardless of the semester, faculty, or program. Additionally, we found a significant difference in the predictive power between Science, Technology, Engineering, and Mathematics (STEM) and humanistic programs.
Keywords: dropout model, features importance, data mining, learning analytics
@article{CSIS_2023_20_1_a12,
     author = {Alberto Jim\'enez-Macias and Pedro Manuel Moreno-Marcos and Pedro J. Mu\~noz-Merino and Margarita Ortiz-Rojas and Carlos Delgado Kloos},
     title = {Analyzing feature importance for a predictive undergraduate student dropout model},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {20},
     number = {1},
     year = {2023},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2023_20_1_a12/}
}
TY  - JOUR
AU  - Alberto Jiménez-Macias
AU  - Pedro Manuel Moreno-Marcos
AU  - Pedro J. Muñoz-Merino
AU  - Margarita Ortiz-Rojas
AU  - Carlos Delgado Kloos
TI  - Analyzing feature importance for a predictive undergraduate student dropout model
JO  - Computer Science and Information Systems
PY  - 2023
VL  - 20
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2023_20_1_a12/
ID  - CSIS_2023_20_1_a12
ER  - 
%0 Journal Article
%A Alberto Jiménez-Macias
%A Pedro Manuel Moreno-Marcos
%A Pedro J. Muñoz-Merino
%A Margarita Ortiz-Rojas
%A Carlos Delgado Kloos
%T Analyzing feature importance for a predictive undergraduate student dropout model
%J Computer Science and Information Systems
%D 2023
%V 20
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2023_20_1_a12/
%F CSIS_2023_20_1_a12
Alberto Jiménez-Macias; Pedro Manuel Moreno-Marcos; Pedro J. Muñoz-Merino; Margarita Ortiz-Rojas; Carlos Delgado Kloos. Analyzing feature importance for a predictive undergraduate student dropout model. Computer Science and Information Systems, Tome 20 (2023) no. 1. http://geodesic.mathdoc.fr/item/CSIS_2023_20_1_a12/