TY - JOUR
T1 - Auto-Scaling Cloud Resources using LSTM and Reinforcement Learning to Guarantee Service-Level Agreements and Reduce Resource Costs
AU - Zhong, Jiang
AU - Duan, Saisai
AU - Li, Qing
N1 - Publisher Copyright:
© 2019 IOP Publishing Ltd. All rights reserved.
PY - 2019/7/12
Y1 - 2019/7/12
N2 - Auto-Scaling cloud resources aim at responding to application demands by automatically scaling the compute resources at runtime to guarantee service-level agreements (SLAs) and reduce resource costs. Existing approaches often resort to predefined sets of rules to add/remove resources depending on the application usage. However, optimal adaptation rules are difficult to devise and generalize. A proactive approach is proposed to perform auto-scaling cloud resources in response to dynamic traffic changes. This paper applies Long Short-Term Memory (LSTM) to predicting the accurate number of requests in the next time and applies Reinforcement Learning (RL) to obtaining the optimal action to scale in or scale out virtual machines. To validate the proposal, experiments under two real-world workload traces are conducted, and the results show that the approach can ensure virtual machines to work steadily and can reduce SLA violations by up to 10%-30% compared with other approaches.
AB - Auto-Scaling cloud resources aim at responding to application demands by automatically scaling the compute resources at runtime to guarantee service-level agreements (SLAs) and reduce resource costs. Existing approaches often resort to predefined sets of rules to add/remove resources depending on the application usage. However, optimal adaptation rules are difficult to devise and generalize. A proactive approach is proposed to perform auto-scaling cloud resources in response to dynamic traffic changes. This paper applies Long Short-Term Memory (LSTM) to predicting the accurate number of requests in the next time and applies Reinforcement Learning (RL) to obtaining the optimal action to scale in or scale out virtual machines. To validate the proposal, experiments under two real-world workload traces are conducted, and the results show that the approach can ensure virtual machines to work steadily and can reduce SLA violations by up to 10%-30% compared with other approaches.
UR - http://www.scopus.com/inward/record.url?scp=85070272241&partnerID=8YFLogxK
U2 - 10.1088/1742-6596/1237/2/022033
DO - 10.1088/1742-6596/1237/2/022033
M3 - 会议文章
AN - SCOPUS:85070272241
SN - 1742-6588
VL - 1237
JO - Journal of Physics: Conference Series
JF - Journal of Physics: Conference Series
IS - 2
M1 - 022033
T2 - 2019 4th International Conference on Intelligent Computing and Signal Processing, ICSP 2019
Y2 - 29 March 2019 through 31 March 2019
ER -