TY - GEN
T1 - A Scalable Smart Meter Data Generator Using Spark
AU - Iftikhar, Nadeem
AU - Liu, Xiufeng
AU - Danalachi, Sergiu
AU - Nordbjerg, Finn
AU - Vollesen, Jens
PY - 2017
Y1 - 2017
N2 - Today, smart meters are being used worldwide. As a matter of fact smart meters produce large volumes of data. Thus, it is important for smart meter data management and analytics systems to process petabytes of data. Benchmarking and testing of these systems require scalable data, however, it can be challenging to get large data sets due to privacy and/or data protection regulations. This paper presents a scalable smart meter data generator using Spark that can generate realistic data sets. The proposed data generator is based on a supervised machine learning method that can generate data of any size by using small data sets as seed. Moreover, the generator can preserve the characteristics of data with respect to consumption patterns and user groups. This paper evaluates the proposed data generator in a cluster based environment in order to validate its effectiveness and scalability.
AB - Today, smart meters are being used worldwide. As a matter of fact smart meters produce large volumes of data. Thus, it is important for smart meter data management and analytics systems to process petabytes of data. Benchmarking and testing of these systems require scalable data, however, it can be challenging to get large data sets due to privacy and/or data protection regulations. This paper presents a scalable smart meter data generator using Spark that can generate realistic data sets. The proposed data generator is based on a supervised machine learning method that can generate data of any size by using small data sets as seed. Moreover, the generator can preserve the characteristics of data with respect to consumption patterns and user groups. This paper evaluates the proposed data generator in a cluster based environment in order to validate its effectiveness and scalability.
U2 - 10.1007/978-3-319-69462-7_2
DO - 10.1007/978-3-319-69462-7_2
M3 - Article in proceedings
T3 - Lecture Notes in Computer Science
SP - 21
EP - 36
BT - OTM Confederated International Conferences "On the Move to Meaningful Internet Systems"
PB - Springer
T2 - On the Move to Meaningful Internet Systems
Y2 - 23 October 2017 through 27 October 2017
ER -