Identifying the determinants of academic success: a machine learning approach in spanish higher education

dc.contributor.authorSánchez Sánchez, Ana María
dc.contributor.authorMello Román, Jorge Daniel
dc.contributor.authorSegura Maroto, Marina
dc.contributor.authorHernández Estrada, Adolfo
dc.date.accessioned2026-01-13T09:21:43Z
dc.date.available2026-01-13T09:21:43Z
dc.date.issued2024
dc.description.abstractAcademic performance plays a key role in assessing the quality and equity of a country’s educational system. Studying the aspects or factors that influence university academic performance is an important research opportunity. This article synthesizes research that employs machine learning techniques to identify the determinants of academic performance in first-year university students. A total of 8700 records from the Complutense University of Madrid corresponding to all incoming students in the academic year 2022–2023 have been analyzed, for which information was available on 28 variables related to university access, academic performance corresponding to the first year, and socioeconomic characteristics. The methodology included feature selection using Random Forest and Extreme Gradient Boosting (XGBoost) to identify the main predictors of academic performance and avoid overfitting in the models, followed by analysis with four different machine learning techniques: Linear Regression, Support Vector Regression, Random Forest, and XGBoost. The models showed similar predictive performance, also highlighting the coincidence in the predictors of academic performance both at the end of the first semester and at the end of the first academic year. Our analysis detects the influence of variables that had not appeared in the literature before, the admission option and the number of enrolled credits. This study contributes to understanding the factors that impact academic performance, providing key information for implementing educational policies aimed at achieving excellence in university education. This includes, for example, peer tutoring and mentoring where high- and low-performing students could participate.
dc.description.departmentDepto. de Economía Financiera y Actuarial y Estadística
dc.description.facultyFac. de Ciencias Económicas y Empresariales
dc.description.refereedTRUE
dc.description.statuspub
dc.identifier.citationSánchez-Sánchez, A. M., Mello-Román, J. D., Segura, M., & Hernández, A. (2024). Identifying the Determinants of Academic Success: A Machine Learning Approach in Spanish Higher Education. Systems, 12(10), 425. https://doi.org/10.3390/systems12100425
dc.identifier.doi10.3390/systems12100425
dc.identifier.issn2079-8954
dc.identifier.officialurlhttps://doi.org/10.3390/systems12100425
dc.identifier.urihttps://hdl.handle.net/20.500.14352/129994
dc.issue.number10
dc.journal.titleSystems
dc.language.isoeng
dc.page.initial425
dc.publisherMDPI
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject.keywordAcademic performance
dc.subject.keywordMachine learning
dc.subject.keywordEducational data mining
dc.subject.keywordXGBoost
dc.subject.keywordRandom forest
dc.subject.ucmEstadística aplicada
dc.subject.ucmEnseñanza universitaria
dc.subject.unesco1209 Estadística
dc.titleIdentifying the determinants of academic success: a machine learning approach in spanish higher education
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number12
dspace.entity.typePublication
relation.isAuthorOfPublication5ae8e827-1d33-4211-a0b4-da4578d5a07f
relation.isAuthorOfPublication447cb780-3038-40b7-97a3-06c0fd5f36d7
relation.isAuthorOfPublication1ae35181-080b-4abf-9dc4-56a78e074712
relation.isAuthorOfPublication.latestForDiscovery5ae8e827-1d33-4211-a0b4-da4578d5a07f

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Identifying the Determinants of Academic Success.pdf
Size:
1.67 MB
Format:
Adobe Portable Document Format

Collections