Professional Documents
Culture Documents
Research On Cloud Computing Load Forecasting Based On LSTM-ARIMA Combined Model
Research On Cloud Computing Load Forecasting Based On LSTM-ARIMA Combined Model
Abstract—With the continuous development of cloud At present, load forecasting models can be mainly divided
computing technology, the change of cloud computing resource into two categories: single forecasting models and combined
load presents more and more complex characteristics, and forecasting models based on ensemble learning. Yan et al. [1]
efficient load prediction has become a key technology to solve the used the integrated moving average autoregression model (auto-
imbalance of cloud computing resource utilization. Aiming at the regression and moving average model, ARIMA) algorithm to
problem of low prediction performance of the current load perform software The prediction of the resource consumption of
prediction model, a combined prediction model LSAR based on the system is carried out by comparing the prediction effect of
long short-term memory network LSTM and autoregressive the ARIMA algorithm, the support vector machine and the
moving average model ARIMA is proposed by comprehensively
artificial neural network in different scenarios. Calheiros et al. [2]
considering the factors of prediction accuracy and prediction time.
Compared with the traditional load prediction models ARIMA
designed a cloud platform workload prediction module based on
and LSTM, the open data set was used for the experiment. The the ARIMA algorithm, and actively configures dynamic
experimental results show that the prediction accuracy of the resources based on the prediction data of the model, which can
cloud computing resource combination prediction model is ensure the user's application service quality with as little
significantly higher than that of other prediction models, and the resource cost as possible. Shetty et al. [3] used multi-model
real-time prediction error of resource load in the cloud fusion to predict the load of cloud computing resources. The
environment is significantly reduced. experiment divided the load data set into training set and test set,
and used exponential smoothing, ARIMA, neural network and
other prediction models to calculate the average of the training
set. Based on the square error, the prediction results obtained
Keywords—cloud computing, load forecasting, combined from the test set are divided into weights, and finally the
forecasting model, long short-term memory network (LSTM) combined prediction value is obtained. Prassanna et al. [4] used
the HoltWinters algorithm to predict the workload of the cloud
environment. The mentioned algorithm can fit the historical time
I. INTRODUCTION series trend curve by establishing a mathematical model. It is
With the advent of the cloud computing era, more and more prone to the problem that the parameters are difficult to confirm.
companies have migrated their applications to the cloud The parameters are set by their own experience, so the final
computing platform. Cloud computing uses virtualization accuracy of the model is greatly affected by the parameters. At
technology to integrate discrete resources into resource pools, the same time, it is difficult for the model to learn the complex
and realizes user-to-computing in an on-demand manner. Elastic laws inside the time series data. Zhang et al. [5] used the
requirements of resources. The essence of cloud computing is to Recursive Neural Network (RNN) to predict cloud workloads,
provide services on demand to realize automatic allocation and and verified the accuracy of the method through experiments on
switching of application resources on the cloud platform. To the Google Cloud Trace dataset. Babu et al. [6] adopted the
achieve dynamic scaling of computing power, load balancing combined forecasting model of ARIMA-ANN to perform linear
technology is required. Cloud computing platforms use load and nonlinear forecasting on time series data, respectively, and
balancing technology to increase processing capacity , which combine the final results. The experimental results show that the
strengthens data processing capabilities. Load is a key indicator mixed models have higher prediction accuracy.
required to achieve load balancing. Accurately predicting At present, there are various services running in cloud
application load can achieve peak traffic expansion and traffic computing nodes, and the resource requirements of various
shrinking of application capacity, thereby saving resource costs, services are also different.The load time series data usually has
reducing manual intervention, and improving cloud computing a combination of linear and nonlinear characteristics during the
platforms. reliability and performance. operation. Most of the existing prediction models use a single
prediction model, or use the prediction results of multiple
The work was supported by The National Natural Science Foundation of China (No.62262011) and The Natural Science Foundation of Guangxi
˄No.2021JJA170130˅
20
Authorized licensed use limited to: PES University Bengaluru. Downloaded on March 04,2024 at 17:22:28 UTC from IEEE Xplore. Restrictions apply.
3.4 Evaluation Criteria for Prediction Models
Time series data
This paper uses mean absolute error (MAE), mean absolute
percentage error (MAPE), mean square error (MSE), root mean
square error (RMASE) and coefficient of determination (R2) to
evaluate the prediction results. The calculation methods are as
follows: Formulas (9)~(13) are shown:
1 n
MAE ¦ | yi yˆi |
ni1
ARIMA load forecasting LSTM load prediction
˄9˅ model model
1 n yi yˆ i
MAPE ¦|
n i 1 yi
|
˄10˅
Predicted load value Predicted load value
1 n
MSE ¦
ni1
( yi yˆi ) 2
˄11˅
1 n
RMSE ¦ ( yi yˆi)2
ni1 ˄12˅
n
¦ ( y yˆ ) 2
CRITIC Data Fusion
i i
R2 1 i 1
n
¦ ( y y)
i 1
i
2
˄13˅
Combined predicted value
21
Authorized licensed use limited to: PES University Bengaluru. Downloaded on March 04,2024 at 17:22:28 UTC from IEEE Xplore. Restrictions apply.
IV. EXPERIMENTAL RESULTS AND ANALYSIS TABLE I. COMPARISON OF PREDICTION PERFORMANCE
濣瀅濸濷濼濶瀇濼瀂瀁澳 濘瀉濴濿瀈濴瀇濼瀂瀁澳瀆瀇濴瀁濷濴瀅濷澳
濅
4.1 Experimental data 瀀瀂濷濸濿澳 濠濔濘澳 濠濔濣濘澳 濠濦濘澳 濥濠濦濘澳 濥澳
In order to verify the validity of the cloud computing 濔濥濜濠濔澳 濅濁濆濊濄濉澳 濃濁濃濌濈濈澳 濌濁濋濌濊濄澳 濆濁濄濇濉澳 濃濁濉濃濋濋澳
resource load prediction model proposed in this paper, the public 澳
data set provided by The Grid Workloads Archive website is 濟濦濧濠澳 濄濁濉濄濋濉澳 濃濁濃濉濊濅澳 濈濁濅濃濆濄澳 濅濁濅濋濄濄澳 濃濁濊濉濇濇澳
used to verify it experimentally. Using the load time series data 澳
provided by Materna, the data set records a grid workload every 濟濦濔濥澳 濃濁濌濉濅濉澳 濃濁濃濆濋濄澳 濅濁濇濌濊濇澳 濄濁濈濋濃濇澳 濃濁濌濉濃濊澳
5 minutes.
4.2 Analysis of experimental results
In order to test and evaluate the performance of the load From the figure, all models and the original sequence are
prediction model (LSAR) proposed in this paper, the original basically consistent in trend. However, from the perspective of
data TEST is compared with the single model ARIMA and prediction accuracy, the single load prediction model ARIMA
LSTM, and it is also compared with the combined model and LSTM are obviously inferior to the combined prediction
LSTM-ARIMA(LSAR). model LSAR. Compared with the traditional load prediction
model, the load prediction model LSAR proposed in this paper
The actual workload value in November 2010 was selected has a great improvement in accuracy. At the same time, it can be
from the data set, and the weights of CPU utilization and seen from the experimental results that the LSAR model
memory utilization were set to 0.4 and 0.6 respectively by proposed in this paper has high accuracy, and the error has been
analyzing the load data, and a piece of data was obtained every further corrected on the basis of the combined prediction model,
5 minutes, totaling 18,800 pieces of data. The first 70% of the which can further ensure the accuracy of the prediction data.
data is selected as the training set, 20% of the data is used as the
single-step test set, and the remaining data is used to verify the The load prediction of cloud computing resources mainly
generalization ability of the model. The results before and after refers to the changes in the load of cloud computing resources
model load prediction are shown in Figure 2, and Table1 gives for a period of time in the future. And from the prediction results,
the prediction error indicators of each model. the model proposed in this paper can not only accurately predict
the cloud computing resource load, but also the prediction model
has a certain generalization ability.
V. CONCLUSION
22
Authorized licensed use limited to: PES University Bengaluru. Downloaded on March 04,2024 at 17:22:28 UTC from IEEE Xplore. Restrictions apply.
REFERENCES
23
Authorized licensed use limited to: PES University Bengaluru. Downloaded on March 04,2024 at 17:22:28 UTC from IEEE Xplore. Restrictions apply.