Journal of South China University of Technology(Natural Science Edition) ›› 2025, Vol. 53 ›› Issue (1): 21-31.doi: 10.12141/j.issn.1000-565X.240105

• Energy,Power & Electrical Engineering • Previous Articles     Next Articles

Investigating an Enhanced H-AC Algorithm-Based Strategy for Energy-Saving Optimization Control in Cold Source System

ZHOU Xuan1,2,3, MO Haohua1, YAN Junwei1,2,3   

  1. 1.School of Mechanical and Automotive Engineering,South China University of Technology,Guangzhou 510640,Guangdong,China
    2.Guangzhou Institute of Modern Industrial Technology,Guangzhou 511458,Guangdong,China
    3.Guangdong Artificial Intelligence and Digital Economy Laboratory (Guangzhou),Guangzhou 511442,Guangdong,China
  • Received:2024-03-08 Online:2025-01-25 Published:2025-01-02
  • About author:周璇(1976—),女,博士,教授级高级工程师,主要从事建筑节能、数据挖掘研究。E-mail: zhouxuan@scut.edu.cn
  • Supported by:
    the Natural Science Foundation of Guangdong Province(2022A1515011128)

Abstract:

The optimization of the number of central air-conditioning cooling source units and their operating parameters is a collaborative optimization problem involving both discrete and continuous variables, which poses challenges for classical reinforcement learning algorithms. To address this problem, this paper proposed an energy-saving optimization control strategy for central air-conditioning cooling source systems based on a combination of the options-critic and actor-critic frameworks. Firstly, a hierarchical actor-critic (H-AC) algorithm was utilized to hierarchically optimize the number of units and operating parameters, with both the high-level and low-level models sharing a Q-network to evaluate state values, thereby addressing optimization challenges across multiple time scales. Secondly, the H-AC algorithm was improved in terms of agent architecture, policy, and network update mechanisms to accelerate the convergence of the agent. Finally, the proposed method was validated on the cooling source system of a research building located in a hot summer and warm winter region, using a TRNSYS simulation platform for experiments. The results demonstrate that, under conditions where the average indoor comfort time proportion is increased by 14.08, 11.23, 29.70 and 9.07 percentage points, respectively, the system energy consumption based on the improved H-AC algorithm is reduced by 32.28%, 28.55%, 28.64%, and 11.53% compared to four classical DRL algorithms. Although the system energy consumption of the improved H-AC algorithm is 0.27% higher than that of the options-critic framework, it achieves a more stable learning process and increases the average indoor comfort time proportion by 4.8%. This approach offers effective technical solutions for energy-saving optimization of central air-conditioning cold source systems in various building types, contributing to the achievement of buildings’ dual-carbon goals.

Key words: cold source system, TRNSYS simulation platform, deep hierarchical reinforcement learning, option-critic framework, co-optimization

CLC Number: