The Main Challenges of Machine Learning for Credit Scoring: A review

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Machine learning models and new techniques have been widely researched in credit scoring. For most credit-scoring datasets, data is unbalanced since the “bad” class is usually lower in proportion than the “good” class. Also, the rejection rate is high in some fields, leading to sample bias when training the scores. This paper presents a literature review to address these concerns, bringing the most known techniques to solve them. 490 articles were initially screened in Scopus and Web of Science, of which 88 were subject to content analysis. The results show that a significant number of algorithms have been tested in different datasets. For the class imbalance problem, SMOTE (synthetic minority oversampling technique) is the most used technique, but robust machine learning techniques have also been introduced. Finally, it was noticed that there is a noticeable opportunity for combining different techniques for imbalanced data that can be explored in future research works as a research gap.
Original languageEnglish
Title of host publication2024 19th Iberian Conference on Information Systems and Technologies, CISTI'2024
Publication statusAccepted/In press - 28 Jun 2024
Event19th Iberian Conference on Information Systems and Technologies 2024 - Universidad de Salamanca, Salamanca, Spain
Duration: 25 Jun 202428 Jun 2024
Conference number: 19
https://cisti.eu/index.php?lang=pt

Conference

Conference19th Iberian Conference on Information Systems and Technologies 2024
Abbreviated titleCISTI'2024
Country/TerritorySpain
CitySalamanca
Period25/06/2428/06/24
Internet address

Keywords

  • Credit Scoring
  • Credit Risk
  • Machine Learning
  • Reject Inference
  • Unbalanced Datasets
  • Small and Medium Enterprises

Fingerprint

Dive into the research topics of 'The Main Challenges of Machine Learning for Credit Scoring: A review'. Together they form a unique fingerprint.

Cite this