This article investigates the application of machine learning techniques for predicting corporate default risk. In the credit scoring domain, the class imbalance problem is prevalent, with defaulted cases typically being much less numerous than non-defaulted ones. To address challenges posed by unbalanced datasets, various calibration methodologies, including data sampling strategies and Synthetic Minority Oversampling Technique (SMOTE), are explored to identify the most effective strategy for the specific case. Additionally, hyperparameter tuning plays a crucial role in enhancing machine learning model performance. Therefore, we employ hyperparameter optimization techniques such as Grid Search, Random Search, Bayesian Optimization, and Adaptive Particle Swarm Optimization. The primary objective is to compare these techniques and determine the optimal strategy for predicting company default events. Our evaluation incorporates standard parameters such as accuracy, balanced accuracy, specificity, sensitivity, and F1 score. Another factor to consider to discriminate the various approaches is the execution time, to be able to develop models that can be implemented in market products. The experimental results reveal which approaches result in significantly improved classification performance.

Improved Credit Scoring Model with Hyperparameter Optimization

Chiara Marciano
;
Mario Rosario Guarracino;Brian Daniel Bernhardt
2024-01-01

Abstract

This article investigates the application of machine learning techniques for predicting corporate default risk. In the credit scoring domain, the class imbalance problem is prevalent, with defaulted cases typically being much less numerous than non-defaulted ones. To address challenges posed by unbalanced datasets, various calibration methodologies, including data sampling strategies and Synthetic Minority Oversampling Technique (SMOTE), are explored to identify the most effective strategy for the specific case. Additionally, hyperparameter tuning plays a crucial role in enhancing machine learning model performance. Therefore, we employ hyperparameter optimization techniques such as Grid Search, Random Search, Bayesian Optimization, and Adaptive Particle Swarm Optimization. The primary objective is to compare these techniques and determine the optimal strategy for predicting company default events. Our evaluation incorporates standard parameters such as accuracy, balanced accuracy, specificity, sensitivity, and F1 score. Another factor to consider to discriminate the various approaches is the execution time, to be able to develop models that can be implemented in market products. The experimental results reveal which approaches result in significantly improved classification performance.
2024
9783031733642
9783031733659
File in questo prodotto:
File Dimensione Formato  
978-3-031-73365-9_23.pdf

solo utenti autorizzati

Descrizione: Article
Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 456.76 kB
Formato Adobe PDF
456.76 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11580/112644
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
social impact