Smart Meter Data Analytics: Systems, Algorithms and Benchmarking

Xiufeng Liu, Lukasz Golab, Wojciech Golab, Ihab F. Ilyas, Shichao Jin

Research output: Contribution to journalJournal articleResearchpeer-review


Smart electricity meters have been replacing conventional meters worldwide, enabling automated collection of fine-grained (e.g., every 15 minutes or hourly) consumption data. A variety of smart meter analytics algorithms and applications have been proposed, mainly in the smart grid literature. However, the focus has been on what can be done with the data rather than how to do it efficiently. In this paper, we examine smart meter analytics from a software performance perspective. First, we design a performance benchmark that includes common smart meter analytics tasks. These include off-line feature extraction and model building as well a framework for on-line anomaly detection that we propose. Second, since obtaining real smart meter data is difficult due to privacy issues, we present an algorithm for generating large realistic data sets from a small seed of real data. Third, we implement the proposed benchmark using five representative platforms: a traditional numeric computing platform (Matlab), a relational DBMS with a built-in machine learning toolkit (PostgreSQL/MADlib), a main-memory column store (“System C”), and two distributed data processing platforms (Hive and Spark/Spark Streaming). We compare the five platforms in terms of application development effort and performance on a multicore machine as well as a cluster of 16 commodity servers.
Original languageEnglish
JournalA C M Transactions on Database Systems
Issue number1
Number of pages38
Publication statusPublished - 2016

Bibliographical note

Accepted by Journal of IEEE Transaction of Database Systems<br/>This journal


Dive into the research topics of 'Smart Meter Data Analytics: Systems, Algorithms and Benchmarking'. Together they form a unique fingerprint.

Cite this