%0 Journal Article %A Gutierrez-Gallego, Alberto %A Garnica Alcázar, Antonio Óscar %A Jose M. Velasco, Jose M. %A Parra, Daniel %T Balanced Underbagged Ensemble Approach for Classifying Highly Imbalanced Datasets in the Insurance and Financial Sectors %D 2025 %U https://hdl.handle.net/20.500.14352/132943 %X Data bias is a critical challenge in machine learning applications within the financial and insurance sectors, as it can lead to misleading risk assessments and inaccurate predictive models. A prevalent source of bias in real‐world datasets is the imbalanced distribution of classes, which is particularly problematic in fraud detection, credit risk assessment, and claim prediction. Traditional approaches to handling imbalanced data often rely on undersampling or oversampling techniques. However, these methods may generate unrealistic minority class samples or fail to perform effectively when dealing with extreme class imbalances. In this paper, we propose a configurable technique based on the underbagging method, integrated with a classifier for highly imbalanced datasets. Our approach is designed to enhance the predictive accuracy of the minority class while maintaining robust performance for the majority class. We incorporate our methodology into a classification ensemble framework and evaluate its effectiveness by comparing it against 100 combinations of 10 different oversampling and undersampling techniques applied to 10 different machine learning algorithms. The evaluation is conducted on two highly imbalanced real‐world datasets: one related to auto insurance claims and another focused on credit card fraud detection. Our statistical analysis demonstrates that Balanced Underbagged Ensemble achieves superior classification performance in terms of recall for both classes, regardless of the base machine learning model used within the ensemble. Furthermore, our method finds an optimal balance between classification performance and computational efficiency. %~