A Scalable Smart Meter Data Generator Using Spark

Nadeem Iftikhar, Xiufeng Liu, Sergiu Danalachi, Finn Nordbjerg, Jens Vollesen

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

478 Downloads (Pure)

Abstract

Today, smart meters are being used worldwide. As a matter of fact smart meters produce large volumes of data. Thus, it is important for smart meter data management and analytics systems to process petabytes of data. Benchmarking and testing of these systems require scalable data, however, it can be challenging to get large data sets due to privacy and/or data protection regulations. This paper presents a scalable smart meter data generator using Spark that can generate realistic data sets. The proposed data generator is based on a supervised machine learning method that can generate data of any size by using small data sets as seed. Moreover, the generator can preserve the characteristics of data with respect to consumption patterns and user groups. This paper evaluates the proposed data generator in a cluster based environment in order to validate its effectiveness and scalability.
Original languageEnglish
Title of host publicationOTM Confederated International Conferences "On the Move to Meaningful Internet Systems"
PublisherSpringer
Publication date2017
Pages21-36
DOIs
Publication statusPublished - 2017
EventOn the Move to Meaningful Internet Systems - Rhodos, Greece
Duration: 23 Oct 201727 Oct 2017

Conference

ConferenceOn the Move to Meaningful Internet Systems
Country/TerritoryGreece
CityRhodos
Period23/10/201727/10/2017
SeriesLecture Notes in Computer Science
Volume10573
ISSN0302-9743

Fingerprint

Dive into the research topics of 'A Scalable Smart Meter Data Generator Using Spark'. Together they form a unique fingerprint.

Cite this