TY - JOUR
T1 - Two-Stage Optimization-Learning Framework for Uncertainty-Aware Multi-Zonal Data Center Energy Management
AU - Rafique, Abubakar
AU - Yu, Xiaojun
AU - Jawad, Muhammad
AU - Song, Qun
AU - Yuan, Zhaohui
AU - Sadiq, Muhammad Tariq
AU - Daniel, Kamran
AU - Shabbir, Noman
N1 - Publisher Copyright:
© 2026 by the authors.
PY - 2026/4
Y1 - 2026/4
N2 - This paper proposes a hybrid optimization framework that synergizes mixed-integer linear programming (MILP) and reinforcement learning (RL) to minimize operational costs and carbon emissions in multi-zonal data centers integrated with renewable generation and battery storage. The approach addresses the dual challenges of day-ahead planning under forecasted conditions and real-time adaptation to stochastic demand and renewable intermittency. A deterministic MILP model performs advance scheduling with reliability constraints derived from complementary cumulative distribution functions (CCDFs) to ensure robust renewable allocation. Three operational paradigms are evaluated: pure deterministic MILP optimization, pure RL control, and a hybrid framework where RL dynamically adjusts MILP-derived schedules via residual control to accommodate real-time uncertainties. The experimental results demonstrate that, for deterministic conditions, RL achieves performance comparable to MILP in energy utilization, peak demand, and cost. Under demand uncertainty, the hybrid and RL-based approaches reduce operational costs by up to 33% compared to uncertainty-aware MILP execution. These savings stem from significantly lower grid peak demand, achieved by prioritizing renewable energy consumption, strategically dispatching batteries, and resorting to grid power only during extreme demand spikes. The hybrid framework effectively balances optimality and computational efficiency, offering a practical solution for uncertainty-resilient energy management in data centers.
AB - This paper proposes a hybrid optimization framework that synergizes mixed-integer linear programming (MILP) and reinforcement learning (RL) to minimize operational costs and carbon emissions in multi-zonal data centers integrated with renewable generation and battery storage. The approach addresses the dual challenges of day-ahead planning under forecasted conditions and real-time adaptation to stochastic demand and renewable intermittency. A deterministic MILP model performs advance scheduling with reliability constraints derived from complementary cumulative distribution functions (CCDFs) to ensure robust renewable allocation. Three operational paradigms are evaluated: pure deterministic MILP optimization, pure RL control, and a hybrid framework where RL dynamically adjusts MILP-derived schedules via residual control to accommodate real-time uncertainties. The experimental results demonstrate that, for deterministic conditions, RL achieves performance comparable to MILP in energy utilization, peak demand, and cost. Under demand uncertainty, the hybrid and RL-based approaches reduce operational costs by up to 33% compared to uncertainty-aware MILP execution. These savings stem from significantly lower grid peak demand, achieved by prioritizing renewable energy consumption, strategically dispatching batteries, and resorting to grid power only during extreme demand spikes. The hybrid framework effectively balances optimality and computational efficiency, offering a practical solution for uncertainty-resilient energy management in data centers.
KW - data centers
KW - energy optimization
KW - energy-efficient computing
KW - power management
KW - real-time decision making
KW - reinforcement learning
UR - https://www.scopus.com/pages/publications/105035489667
U2 - 10.3390/en19071736
DO - 10.3390/en19071736
M3 - 文章
AN - SCOPUS:105035489667
SN - 1996-1073
VL - 19
JO - Energies
JF - Energies
IS - 7
M1 - 1736
ER -