A Two-Tiered Segmentation Approach for Transaction Data Warehousing

Xiufeng Liu, Huan Huo, Nadeem Iftikhar, Per Sieverts Nielsen

Research output: Chapter in Book/Report/Conference proceedingBook chapterResearchpeer-review

44 Downloads (Pure)

Abstract

Data warehousing populates data from different source systems into a central data warehouse (DW) through extraction, transformation, and loading (ETL). Massive transaction data are routinely recorded in a variety of applications such as retail commerce, bank systems, and website management. Transaction data record the timestamp and relevant reference data needed for a particular transaction record. It is a non-trivial task for a standard ETL to process transaction data with dependencies and high velocity. This chapter presents a two-tiered segmentation approach for transaction data warehousing. The approach uses a so-called two-staging ETL method to process detailed records from operational systems, followed by a dimensional data process to populate the data store with a star or snowflake schema. The proposed approach is an all-in-one solution capable of processing fast/slowly changing data and early/late-arriving data. This chapter evaluates the proposed method, and the results have validated the effectiveness of the proposed approach for processing transaction data.
Original languageEnglish
Title of host publicationEmerging Perspectives in Big Data Warehousing
Number of pages27
Publication date2019
Chapter1
DOIs
Publication statusPublished - 2019

Cite this

Liu, Xiufeng ; Huo, Huan ; Iftikhar, Nadeem ; Nielsen, Per Sieverts. / A Two-Tiered Segmentation Approach for Transaction Data Warehousing. Emerging Perspectives in Big Data Warehousing. 2019.
@inbook{2b72521610f14022aea06fe30cfbd44d,
title = "A Two-Tiered Segmentation Approach for Transaction Data Warehousing",
abstract = "Data warehousing populates data from different source systems into a central data warehouse (DW) through extraction, transformation, and loading (ETL). Massive transaction data are routinely recorded in a variety of applications such as retail commerce, bank systems, and website management. Transaction data record the timestamp and relevant reference data needed for a particular transaction record. It is a non-trivial task for a standard ETL to process transaction data with dependencies and high velocity. This chapter presents a two-tiered segmentation approach for transaction data warehousing. The approach uses a so-called two-staging ETL method to process detailed records from operational systems, followed by a dimensional data process to populate the data store with a star or snowflake schema. The proposed approach is an all-in-one solution capable of processing fast/slowly changing data and early/late-arriving data. This chapter evaluates the proposed method, and the results have validated the effectiveness of the proposed approach for processing transaction data.",
author = "Xiufeng Liu and Huan Huo and Nadeem Iftikhar and Nielsen, {Per Sieverts}",
year = "2019",
doi = "10.4018/978-1-5225-5516-2.ch001",
language = "English",
booktitle = "Emerging Perspectives in Big Data Warehousing",

}

A Two-Tiered Segmentation Approach for Transaction Data Warehousing. / Liu, Xiufeng; Huo, Huan; Iftikhar, Nadeem ; Nielsen, Per Sieverts.

Emerging Perspectives in Big Data Warehousing. 2019.

Research output: Chapter in Book/Report/Conference proceedingBook chapterResearchpeer-review

TY - CHAP

T1 - A Two-Tiered Segmentation Approach for Transaction Data Warehousing

AU - Liu, Xiufeng

AU - Huo, Huan

AU - Iftikhar, Nadeem

AU - Nielsen, Per Sieverts

PY - 2019

Y1 - 2019

N2 - Data warehousing populates data from different source systems into a central data warehouse (DW) through extraction, transformation, and loading (ETL). Massive transaction data are routinely recorded in a variety of applications such as retail commerce, bank systems, and website management. Transaction data record the timestamp and relevant reference data needed for a particular transaction record. It is a non-trivial task for a standard ETL to process transaction data with dependencies and high velocity. This chapter presents a two-tiered segmentation approach for transaction data warehousing. The approach uses a so-called two-staging ETL method to process detailed records from operational systems, followed by a dimensional data process to populate the data store with a star or snowflake schema. The proposed approach is an all-in-one solution capable of processing fast/slowly changing data and early/late-arriving data. This chapter evaluates the proposed method, and the results have validated the effectiveness of the proposed approach for processing transaction data.

AB - Data warehousing populates data from different source systems into a central data warehouse (DW) through extraction, transformation, and loading (ETL). Massive transaction data are routinely recorded in a variety of applications such as retail commerce, bank systems, and website management. Transaction data record the timestamp and relevant reference data needed for a particular transaction record. It is a non-trivial task for a standard ETL to process transaction data with dependencies and high velocity. This chapter presents a two-tiered segmentation approach for transaction data warehousing. The approach uses a so-called two-staging ETL method to process detailed records from operational systems, followed by a dimensional data process to populate the data store with a star or snowflake schema. The proposed approach is an all-in-one solution capable of processing fast/slowly changing data and early/late-arriving data. This chapter evaluates the proposed method, and the results have validated the effectiveness of the proposed approach for processing transaction data.

U2 - 10.4018/978-1-5225-5516-2.ch001

DO - 10.4018/978-1-5225-5516-2.ch001

M3 - Book chapter

BT - Emerging Perspectives in Big Data Warehousing

ER -