Augmenting data warehousing architectures with Hadoop

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

As the volume of available data increases exponentially, traditional data warehouses struggle to transform this data into actionable knowledge. This study explores the potentialities of Hadoop as a data transformation tool in the setting of a traditional data warehouse environment. Hadoop’s distributed parallel execution model and horizontal scalability offer great capabilities when the amounts of data to be processed require the infrastructure to expand. Through a typification of the SQL statements, responsible for the data transformation processes, we were able to understand that Hadoop, and its distributed processing model, delivers outstanding performance results associated with the analytical layer, namely in the aggregation of large data sets. We demonstrate, empirically, the performance gains that can be extracted from Hadoop, in comparison to a Relational Database Management System, regarding speed, storage usage, and scalability potential, and suggest how this can be used to evolve data warehouses into the age of Big Data.

Original languageEnglish
Title of host publicationAtas da Conferencia da Associacao Portuguesa de Sistemas de Informacao 2019
Subtitle of host publicationCAPSI 2019
Publication statusPublished - 1 Oct 2019
Event19.a Conferencia da Associacao Portuguesa de Sistemas de Informacao, CAPSI 2019 - 19th Conference of the Portuguese Association for Information Systems, CAPSI 2019 - Lisboa, Portugal
Duration: 11 Oct 201912 Oct 2019

Publication series

NameAtas da Conferencia da Associacao Portuguesa de Sistemas de Informacao

Conference

Conference19.a Conferencia da Associacao Portuguesa de Sistemas de Informacao, CAPSI 2019 - 19th Conference of the Portuguese Association for Information Systems, CAPSI 2019
CountryPortugal
CityLisboa
Period11/10/1912/10/19

Keywords

  • Big data
  • Data Warehousing
  • Hadoop

Fingerprint Dive into the research topics of 'Augmenting data warehousing architectures with Hadoop'. Together they form a unique fingerprint.

  • Cite this

    Dias, H., & Henriques, R. (2019). Augmenting data warehousing architectures with Hadoop. In Atas da Conferencia da Associacao Portuguesa de Sistemas de Informacao 2019: CAPSI 2019 (Atas da Conferencia da Associacao Portuguesa de Sistemas de Informacao).