TY - JOUR
T1 - Comparison of machine learning algorithms for evaluating building energy efficiency using big data analytics
AU - Egwim, Christian Nnaemeka
AU - Alaka, Hafiz
AU - Egunjobi, Oluwapelumi
AU - Gomes, Alvaro
AU - Mporas, Iosif
N1 - © 2022, Emerald Publishing Limited. This is the accepted manuscript version of an article which has been published in final form at https://doi.org/10.1108/jedt-05-2022-0238
PY - 2022/9/26
Y1 - 2022/9/26
N2 - Purpose: This study aims to compare and evaluate the application of commonly used machine learning (ML) algorithms used to develop models for assessing energy efficiency of buildings. Design/methodology/approach: This study foremostly combined building energy efficiency ratings from several data sources and used them to create predictive models using a variety of ML methods. Secondly, to test the hypothesis of ensemble techniques, this study designed a hybrid stacking ensemble approach based on the best performing bagging and boosting ensemble methods generated from its predictive analytics. Findings: Based on performance evaluation metrics scores, the extra trees model was shown to be the best predictive model. More importantly, this study demonstrated that the cumulative result of ensemble ML algorithms is usually always better in terms of predicted accuracy than a single method. Finally, it was discovered that stacking is a superior ensemble approach for analysing building energy efficiency than bagging and boosting. Research limitations/implications: While the proposed contemporary method of analysis is assumed to be applicable in assessing energy efficiency of buildings within the sector, the unique data transformation used in this study may not, as typical of any data driven model, be transferable to the data from other regions other than the UK. Practical implications: This study aids in the initial selection of appropriate and high-performing ML algorithms for future analysis. This study also assists building managers, residents, government agencies and other stakeholders in better understanding contributing factors and making better decisions about building energy performance. Furthermore, this study will assist the general public in proactively identifying buildings with high energy demands, potentially lowering energy costs by promoting avoidance behaviour and assisting government agencies in making informed decisions about energy tariffs when this novel model is integrated into an energy monitoring system. Originality/value: This study fills a gap in the lack of a reason for selecting appropriate ML algorithms for assessing building energy efficiency. More importantly, this study demonstrated that the cumulative result of ensemble ML algorithms is usually always better in terms of predicted accuracy than a single method.
AB - Purpose: This study aims to compare and evaluate the application of commonly used machine learning (ML) algorithms used to develop models for assessing energy efficiency of buildings. Design/methodology/approach: This study foremostly combined building energy efficiency ratings from several data sources and used them to create predictive models using a variety of ML methods. Secondly, to test the hypothesis of ensemble techniques, this study designed a hybrid stacking ensemble approach based on the best performing bagging and boosting ensemble methods generated from its predictive analytics. Findings: Based on performance evaluation metrics scores, the extra trees model was shown to be the best predictive model. More importantly, this study demonstrated that the cumulative result of ensemble ML algorithms is usually always better in terms of predicted accuracy than a single method. Finally, it was discovered that stacking is a superior ensemble approach for analysing building energy efficiency than bagging and boosting. Research limitations/implications: While the proposed contemporary method of analysis is assumed to be applicable in assessing energy efficiency of buildings within the sector, the unique data transformation used in this study may not, as typical of any data driven model, be transferable to the data from other regions other than the UK. Practical implications: This study aids in the initial selection of appropriate and high-performing ML algorithms for future analysis. This study also assists building managers, residents, government agencies and other stakeholders in better understanding contributing factors and making better decisions about building energy performance. Furthermore, this study will assist the general public in proactively identifying buildings with high energy demands, potentially lowering energy costs by promoting avoidance behaviour and assisting government agencies in making informed decisions about energy tariffs when this novel model is integrated into an energy monitoring system. Originality/value: This study fills a gap in the lack of a reason for selecting appropriate ML algorithms for assessing building energy efficiency. More importantly, this study demonstrated that the cumulative result of ensemble ML algorithms is usually always better in terms of predicted accuracy than a single method.
KW - Original Article
KW - Big Data Analytics
KW - Buildings
KW - BUILDING PERFORMANCE
KW - Energy Efficiency
KW - Machine Learning
KW - Predictive Modelling
KW - Machine learning
KW - Big data analytics
KW - Energy efficiency
KW - Predictive modelling
UR - http://www.scopus.com/inward/record.url?scp=85139070843&partnerID=8YFLogxK
U2 - 10.1108/jedt-05-2022-0238
DO - 10.1108/jedt-05-2022-0238
M3 - Article
SN - 1726-0531
VL - 22
SP - 1325
EP - 1350
JO - Journal of Engineering, Design and Technology
JF - Journal of Engineering, Design and Technology
IS - 4
ER -