Geometric SMOTE for regression

Research output: Contribution to journalReview articlepeer-review

Abstract

Learning from imbalanced data sets is known to be a challenging task. There are many proposals to tackle the challenge for classification problems, but regarding regression the solutions are few. In the context of regression, imbalanced learning means that there is a concern with the accurate prediction of the target values in a subset of the continuous target variable, considering that these values rarely occur in the data set. In this article, we extend the G-SMOTE algorithm that is used in classification to regression tasks. G-SMOTE is a pre-processing algorithm that differs from the SMOTE algorithm as it allows the generation of synthetic instances in a geometric region around the selected instances rather than in the line segment that joins the two selected instances. The performance of G-SMOTE for regression was compared against other methods, and the empirical results show that our proposal outperformed those methods.
Original languageEnglish
Article number116387
Pages (from-to)1-8
Number of pages8
JournalExpert Systems with Applications
Volume193
Issue numberMay
Early online date1 Jan 2022
DOIs
Publication statusE-pub ahead of print - 1 Jan 2022

Keywords

  • Imbalanced
  • Data-level
  • Regression

Fingerprint

Dive into the research topics of 'Geometric SMOTE for regression'. Together they form a unique fingerprint.

Cite this