TY - JOUR
T1 - Deep clustering of cooperative multi-agent reinforcement learning to optimize multi chiller HVAC systems for smart buildings energy management
AU - Homod, Raad Z.
AU - Yaseen, Zaher Mundher
AU - Hussein, Ahmed Kadhim
AU - Almusaed, Amjad
AU - Alawi, Omer A.
AU - Falah, Mayadah W.
AU - Abdelrazek, Ali H.
AU - Ahmed, Waqar
AU - Eltaweel, Mahmoud
N1 - © 2022 Elsevier Ltd. All rights reserved. All rights reserved. This is the accepted manuscript version of an article which has been published in final form at https://doi.org/10.1016/j.jobe.2022.105689
PY - 2023/4/15
Y1 - 2023/4/15
N2 - Chillers are responsible for almost half of the total energy demand in buildings. Hence, the obligation of control systems of multi-chiller due to changes indoor environments is one of the most significant parts of a smart building. Such a controller is described as a nonlinear and multi-objective algorithm, and its fabrication is crucial to achieving the optimal balance between indoor thermal comfort and running a minimum number of chillers. This work proposes deep clustering of cooperative multi-agent reinforcement learning (DCCMARL) as well-suited to such system control, which supports centralized control by learning of agents. In MARL, since the learning of agents is based on discrete sets of actions and stats, this drawback significantly affects the model of agents for representing their actions with efficient performance. This drawback becomes considerably worse when increasing the number of agents, due to the increased complexity of solving MARL, which makes modeling policy very challenging. Therefore, the DCCMARL of multi-objective reinforcement learning is leveraging powerful frameworks of a hybrid clustering algorithm to deal with complexity and uncertainty, which is a critical factor that influences to the achievement of high levels of a performance action. The results showed that the ability of agents to manipulate the behavior of the smart building could improve indoor thermal conditions, as well as save energy up to 44.5% compared to conventional methods. It seems reasonable to conclude that agents' performance is influenced by what type of model structure.
AB - Chillers are responsible for almost half of the total energy demand in buildings. Hence, the obligation of control systems of multi-chiller due to changes indoor environments is one of the most significant parts of a smart building. Such a controller is described as a nonlinear and multi-objective algorithm, and its fabrication is crucial to achieving the optimal balance between indoor thermal comfort and running a minimum number of chillers. This work proposes deep clustering of cooperative multi-agent reinforcement learning (DCCMARL) as well-suited to such system control, which supports centralized control by learning of agents. In MARL, since the learning of agents is based on discrete sets of actions and stats, this drawback significantly affects the model of agents for representing their actions with efficient performance. This drawback becomes considerably worse when increasing the number of agents, due to the increased complexity of solving MARL, which makes modeling policy very challenging. Therefore, the DCCMARL of multi-objective reinforcement learning is leveraging powerful frameworks of a hybrid clustering algorithm to deal with complexity and uncertainty, which is a critical factor that influences to the achievement of high levels of a performance action. The results showed that the ability of agents to manipulate the behavior of the smart building could improve indoor thermal conditions, as well as save energy up to 44.5% compared to conventional methods. It seems reasonable to conclude that agents' performance is influenced by what type of model structure.
KW - Clustering of multi-agent reinforcement learning (MARL) policy
KW - Hybrid layer model
KW - Multi-objective reinforcement learning (MORL)
KW - Multi-unit residential buildings
KW - Optimal chiller sequencing control (OCSC)
KW - Takagi–sugeno fuzzy (TSF) identification
UR - http://www.scopus.com/inward/record.url?scp=85144449447&partnerID=8YFLogxK
U2 - 10.1016/j.jobe.2022.105689
DO - 10.1016/j.jobe.2022.105689
M3 - Article
SN - 2352-7102
VL - 65
SP - 1
EP - 29
JO - Journal of Building Engineering (JOBE)
JF - Journal of Building Engineering (JOBE)
M1 - 105689
ER -