Auto-Scaling Cloud Resources using LSTM and Reinforcement Learning to Guarantee Service-Level Agreements and Reduce Resource Costs

Jiang Zhong; Saisai Duan; Qing Li

doi:10.1088/1742-6596/1237/2/022033

Auto-Scaling Cloud Resources using LSTM and Reinforcement Learning to Guarantee Service-Level Agreements and Reduce Resource Costs

Jiang Zhong, Saisai Duan, Qing Li

Chongqing University

科研成果: 期刊稿件 › 会议文章 › 同行评审

5 引用（Scopus）

摘要

Auto-Scaling cloud resources aim at responding to application demands by automatically scaling the compute resources at runtime to guarantee service-level agreements (SLAs) and reduce resource costs. Existing approaches often resort to predefined sets of rules to add/remove resources depending on the application usage. However, optimal adaptation rules are difficult to devise and generalize. A proactive approach is proposed to perform auto-scaling cloud resources in response to dynamic traffic changes. This paper applies Long Short-Term Memory (LSTM) to predicting the accurate number of requests in the next time and applies Reinforcement Learning (RL) to obtaining the optimal action to scale in or scale out virtual machines. To validate the proposal, experiments under two real-world workload traces are conducted, and the results show that the approach can ensure virtual machines to work steadily and can reduce SLA violations by up to 10%-30% compared with other approaches.

源语言	英语
文章编号	022033
期刊	Journal of Physics: Conference Series
卷	1237
期	2
DOI	https://doi.org/10.1088/1742-6596/1237/2/022033
出版状态	已出版 - 12 7月 2019
已对外发布	是
活动	2019 4th International Conference on Intelligent Computing and Signal Processing, ICSP 2019 - Xi'an, 中国期限: 29 3月 2019 → 31 3月 2019

访问文件

10.1088/1742-6596/1237/2/022033

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{643d790cfeac4e16adf5fdd4d04df697,

title = "Auto-Scaling Cloud Resources using LSTM and Reinforcement Learning to Guarantee Service-Level Agreements and Reduce Resource Costs",

abstract = "Auto-Scaling cloud resources aim at responding to application demands by automatically scaling the compute resources at runtime to guarantee service-level agreements (SLAs) and reduce resource costs. Existing approaches often resort to predefined sets of rules to add/remove resources depending on the application usage. However, optimal adaptation rules are difficult to devise and generalize. A proactive approach is proposed to perform auto-scaling cloud resources in response to dynamic traffic changes. This paper applies Long Short-Term Memory (LSTM) to predicting the accurate number of requests in the next time and applies Reinforcement Learning (RL) to obtaining the optimal action to scale in or scale out virtual machines. To validate the proposal, experiments under two real-world workload traces are conducted, and the results show that the approach can ensure virtual machines to work steadily and can reduce SLA violations by up to 10%-30% compared with other approaches.",

author = "Jiang Zhong and Saisai Duan and Qing Li",

note = "Publisher Copyright: {\textcopyright} 2019 IOP Publishing Ltd. All rights reserved.; 2019 4th International Conference on Intelligent Computing and Signal Processing, ICSP 2019 ; Conference date: 29-03-2019 Through 31-03-2019",

year = "2019",

month = jul,

day = "12",

doi = "10.1088/1742-6596/1237/2/022033",

language = "英语",

volume = "1237",

journal = "Journal of Physics: Conference Series",

issn = "1742-6588",

publisher = "IOP Publishing Ltd.",

number = "2",

}

TY - JOUR

T1 - Auto-Scaling Cloud Resources using LSTM and Reinforcement Learning to Guarantee Service-Level Agreements and Reduce Resource Costs

AU - Zhong, Jiang

AU - Duan, Saisai

AU - Li, Qing

PY - 2019/7/12

Y1 - 2019/7/12

N2 - Auto-Scaling cloud resources aim at responding to application demands by automatically scaling the compute resources at runtime to guarantee service-level agreements (SLAs) and reduce resource costs. Existing approaches often resort to predefined sets of rules to add/remove resources depending on the application usage. However, optimal adaptation rules are difficult to devise and generalize. A proactive approach is proposed to perform auto-scaling cloud resources in response to dynamic traffic changes. This paper applies Long Short-Term Memory (LSTM) to predicting the accurate number of requests in the next time and applies Reinforcement Learning (RL) to obtaining the optimal action to scale in or scale out virtual machines. To validate the proposal, experiments under two real-world workload traces are conducted, and the results show that the approach can ensure virtual machines to work steadily and can reduce SLA violations by up to 10%-30% compared with other approaches.

AB - Auto-Scaling cloud resources aim at responding to application demands by automatically scaling the compute resources at runtime to guarantee service-level agreements (SLAs) and reduce resource costs. Existing approaches often resort to predefined sets of rules to add/remove resources depending on the application usage. However, optimal adaptation rules are difficult to devise and generalize. A proactive approach is proposed to perform auto-scaling cloud resources in response to dynamic traffic changes. This paper applies Long Short-Term Memory (LSTM) to predicting the accurate number of requests in the next time and applies Reinforcement Learning (RL) to obtaining the optimal action to scale in or scale out virtual machines. To validate the proposal, experiments under two real-world workload traces are conducted, and the results show that the approach can ensure virtual machines to work steadily and can reduce SLA violations by up to 10%-30% compared with other approaches.

UR - http://www.scopus.com/inward/record.url?scp=85070272241&partnerID=8YFLogxK

U2 - 10.1088/1742-6596/1237/2/022033

DO - 10.1088/1742-6596/1237/2/022033

M3 - 会议文章

AN - SCOPUS:85070272241

SN - 1742-6588

VL - 1237

JO - Journal of Physics: Conference Series

JF - Journal of Physics: Conference Series

IS - 2

M1 - 022033

T2 - 2019 4th International Conference on Intelligent Computing and Signal Processing, ICSP 2019

Y2 - 29 March 2019 through 31 March 2019

ER -

Auto-Scaling Cloud Resources using LSTM and Reinforcement Learning to Guarantee Service-Level Agreements and Reduce Resource Costs

摘要

访问文件

其它文件与链接

指纹

引用此