Evolving Strategies in Machine Learning: A Systematic Review of Concept Drift Detection

Research output: Contribution to journalReview articlepeer-review

43 Downloads (Pure)

Abstract

In this comprehensive literature review, we rigorously adhere to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines for our process and reporting. This review employs an innovative method integrating the advanced natural language processing model T5 (Text-to-Text Transfer Transformer) to enhance the accuracy and efficiency of screening and data extraction processes. We assess strategies for handling the concept drift in machine learning using high-impact publications from notable databases that were made accessible via the IEEE and Science Direct APIs. The chronological analysis covering the past two decades provides a historical perspective on methodological advancements, recognizing their strengths and weaknesses through citation metrics and rankings. This review aims to trace the growth and evolution of concept drift mitigation strategies and to provide a valuable resource that guides future research and deepens our understanding of this rapidly changing field. Key findings highlight the effectiveness of diverse methodologies such as drift detection methods, window-based methods, unsupervised statistical methods, and neural network techniques. However, challenges remain, particularly with imbalanced data, computational efficiency, and the application of concept drift detection to non-tabular data like images. This review aims to trace the growth and evolution of concept drift mitigation strategies and provide a valuable resource that guides future research and deepens our understanding of this rapidly changing field.
Original languageEnglish
Article number786
Pages (from-to)1-24
Number of pages24
JournalInformation (Switzerland)
Volume15
Issue number12
Early online date7 Dec 2024
DOIs
Publication statusPublished - 31 Dec 2024

Keywords

  • concept drift
  • systematic review
  • machine learning
  • types of concept drift
  • adaptive strategies
  • Science Direct API
  • IEEE API
  • streaming data
  • non-stationary environments
  • evolving data streams

Cite this