A Review of Big Data and Machine Learning Operations in Official Statistics: MLOps and Feature Store Adoption

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Integrating machine learning (ML) into the official statisticians' toolset is gaining popularity as National Statistical Offices (NSOs) strive to improve their methodologies. This trend poses new challenges and implications for incorporating innovative techniques that ensure the reliability of the official statistical production process. A comprehensive literature review was conducted using Scopus and Web of Science databases to explore the contemporary applications of data science in official statistics. A total of 178 research articles were identified, focusing on areas such as big data, machine learning, and data quality. While the literature review revealed extensive proposals on utilizing alternative data and applying machine learning techniques to support official statistics production, it also identified research gaps in the post-training steps of the machine learning process. Areas requiring further investigation include machine learning operations in a production environment, data quality assurance, and governance.
Original languageEnglish
Title of host publication2024 IEEE 48th Annual Computers, Software, and Applications Conference
Subtitle of host publicationCOMPSAC 2024
EditorsHossain Shahriar, Hiroyuki Ohsaki, Moushumi Sharmin, Dave Towey, AKM Jahangir Alam Majumder, Yoshiaki Hori, Ji-Jiang Yang, Michiharu Takemoto, Nazmus Sakib, Ryohei Banno, Sheikh Iqbal Ahamed
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages711-718
Number of pages8
ISBN (Print)979-8-3503-7696-8
DOIs
Publication statusPublished - Jul 2024
Event48th IEEE Annual Computers, Software, and Applications Conference - Nakanoshima Love Central, Osaka, Japan
Duration: 2 Jul 20244 Jul 2024
Conference number: 48
https://ieeecompsac.computer.org/2024/

Publication series

NameProceedings of the IEEE Annual Computer Software and Applications Conference
PublisherIEEE
ISSN (Print)2836-3795

Conference

Conference48th IEEE Annual Computers, Software, and Applications Conference
Abbreviated titleCOMPSAC 2024
Country/TerritoryJapan
CityOsaka
Period2/07/244/07/24
Internet address

Keywords

  • Feature store
  • Official statistics
  • Machine learning operations
  • Data science
  • Big data
  • Data quality

Fingerprint

Dive into the research topics of 'A Review of Big Data and Machine Learning Operations in Official Statistics: MLOps and Feature Store Adoption'. Together they form a unique fingerprint.

Cite this