Machine learning XAI for early loan default prediction

Loading...
Thumbnail Image

Full text at PDC

Publication date

2025

Advisors (or tutors)

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Nature
Citations
Google Scholar

Citation

Monje, L., Carrasco, R.A. & Sánchez-Montañés, M. Machine Learning XAI for Early Loan Default Prediction. Comput Econ (2025). https://doi.org/10.1007/s10614-025-10962-9

Abstract

Abstract: Early default prediction with predictive models is of crucial importance for financial institutions, Fintech or Peer to Peer (P2P) lending platforms, as it allows them to effectively mitigate the potential risks associated with customer or debtor defaults, anticipating before this becomes a major problem. This proactive approach serves to avoid the consequent impact on provisions and, subsequently, on the institution's capital. On the other hand, advanced predictive models are often less interpretable than traditional models such as probit (Abdou & Pointon, 2011) and logistic regression (Bolton, 2009; Liu et al. 2024). Due to this lower explainability, our goal was to develop a methodology that allows building an advanced predictive model together with a linguistically interpretable explanation useful for decision making from large volumes of data. For this purpose, our case study was the loan dataset of Lending Club, the largest P2P lending platform in the world. As a result, we obtained a model based on the eXtreme Gradient Boosting (XGBoost) together with its linguistic interpretation using a surrogate model and the 2-tuple fuzzy linguistic model Monje et al., (Mathematics 10:1428, 2022). This model allows us to identify five risk categories (very low, low, medium, high and very high).

Research Projects

Organizational Units

Journal Issue

Description

Keywords

Collections