Performance evaluation of distributed machine learning for load forecasting in smart grids

Dabeeruddin Syed, Shady S. Refaat, Haitham Abu-Rub

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

Load forecasting in smart grid is the process of predicting the amount of electrical power to meet the short, medium and long term demands. Accurate load forecasting helps electrical utilities to manage their energy production, operations, control and management. Most of the state-of-the-art forecasting methodologies utilize classical machine learning algorithms to predict the electrical load. There is a need that big data platforms and parallel distributed computing are utilized to their potential in the available solutions. In this paper, the Apache Spark and Apache Hadoop are utilized as big data platforms for distributed computing in order to predict the load using available big data. In this paper, MLib, Spark library for machine learning algorithms, is utilized for distributed computing. Using MLib allows testing the classic regression algorithms such as linear regression, generalized linear regression, decision tree, random forest and gradient-boosted trees in addition to survival regression and isotonic regression. The obtained results show that Spark produces high accuracy while parallelizing the process of load forecasting in highly competent training and test times. Actual big data are used in the load forecasting process.

Original languageEnglish
Title of host publicationProceedings of the 30th International Conference on Cybernetics and Informatics, K and I 2020
EditorsJan Ciganek, Stefan Kozak, Alena Kozakova
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
ISBN (Electronic)9781728143811
DOIs
Publication statusPublished - Jan 2020
Event30th International Conference on Cybernetics and Informatics, K and I 2020 - Velke Karlovice, Czech Republic
Duration: 29 Jan 20201 Feb 2020

Publication series

NameProceedings of the 30th International Conference on Cybernetics and Informatics, K and I 2020

Conference

Conference30th International Conference on Cybernetics and Informatics, K and I 2020
Country/TerritoryCzech Republic
CityVelke Karlovice
Period29/01/201/02/20

Keywords

  • Apache Spark
  • Distributed Computing
  • Distributed Machine Learning
  • Load Forecast
  • Smart Grids

Fingerprint

Dive into the research topics of 'Performance evaluation of distributed machine learning for load forecasting in smart grids'. Together they form a unique fingerprint.

Cite this