Professional Documents
Culture Documents
LSTM-based Indoor Air Temperature Prediction Framework For HVAC Systems in Smart Buildings
LSTM-based Indoor Air Temperature Prediction Framework For HVAC Systems in Smart Buildings
https://doi.org/10.1007/s00521-020-04926-3(0123456789().,-volV)(0123456789().
,- volV)
ORIGINAL ARTICLE
Mohamed Cheriet1
Received: 21 November 2019 / Accepted: 6 April 2020 / Published online: 4 May 2020
Springer-Verlag London Ltd., part of Springer Nature 2020
Abstract
Accurate indoor air temperature (IAT) predictions for heating, ventilation, and air conditioning (HVAC) systems are
challenging, especially for multi-zone building and for different HVAC system types. Moreover, the nonlinearity of the
buildings thermal dynamics makes the IAT prediction more difficult since it is affected by complex factors such as
controlled and uncontrolled points, outside weather conditions and occupancy schedule. This paper presents a long short-
term memory (LSTM) model to predict IAT for multi-zone building based on direct multi-step prediction with sequence-
to-sequence approach. Two strategies, LSTM-MISO and LSTM-MIMO, are built for multi-input single-output and multi-
input multi-output, respectively. The performance of these two strategies has been evaluated based on two case studies on
real smart buildings using variable air volume (VAV) and constant air volume (CAV) systems. For both buildings,
experimental results showed that the LSTM models outperform multilayer perceptron models by reducing the prediction
error by 50%.
123
17570 Neural Computing and Applications (2020) 32:17569–17585
data-driven approaches is to eliminate the cost and time- parameters to increase the performance of multi-step pre-
consuming task to build IAT physics-based models. diction models; (3) the validation of the proposed frame-
Besides, IAT model is easy to implement for a multi-zone work in two real cases with the industrial data.
system using artificial intelligence (AI) techniques since it This paper is structured as follows: Section 2 summa-
can deal with nonlinearity in the system [11]. rizes the prior work related to our research. Section 3
Recently, artificial neural networks (ANN) and nonlin- describes the two types of buildings used in this study.
ear autoregressive network with exogenous inputs Section 4 illustrates the detailed methodology of our pro-
(NNARX) have been extensively used to model indoor posed data-driven framework for modeling IAT for both
environments [12–14]. However, prior work generally MISO and MIMO architectures using S2S approach. Sec-
adopted a recursive prediction strategy for predicting tion 5 discusses the experimental results and compares the
multi-step ahead [14–16]. For instance, the model predicts performance of the proposed models. Finally, we conclude
time t and uses the predicted value to predict t þ 1, and key findings and present future research directions.
recursively until t þ p, where p is the time horizon. This
method accumulates prediction errors at each time step.
Furthermore, the ANN method considers each input as an 2 Related work
independent parameter. It ignores the time dependency
between sequential values. Unlike many studies which The main advantage of a data-driven approach is to reduce
used ANN with multilayer perceptrons (MLP) models or the cost and time-consumed by traditional physics-based
NNARX, this paper investigates a time series approach for techniques. Moreover, the data-driven approaches can deal
multi-step prediction model using long short-term memory with nonlinearity, incomplete or noisy data [11]. Machine-
(LSTM) which is shown to be an accurate forecasting learning algorithms have been applied to design dynamic
method for time series data [17, 18]. In this paper, two models of the HVAC system. For instance, the multi-step
prediction structures are used to model IAT in multi-zone prediction of IAT can be used in a predictive control
with multi-step prediction: the LSTM-MISO architecture approach and then leads to improving the thermal comfort
uses multi-input single-output to predict the temperature and decreasing the energy consumption of buildings [17].
for each zone; the LSTM-MIMO architecture uses multi-
input multi-output to predict IAT for all zones simultane- The key common algorithms applied in data-driven
ously with only one model. This strategy shows clear approaches that have been used extensively in the building
advantages compared to prediction models presented in sector for IAT modeling are regression trees [23], random
prior work [12, 14, 19]. The proposed LSTM framework is forests [24], nonlinear autoregressive network with
based on a direct sequence-to-sequence (S2S) multivariate exogenous inputs (NARX) [7], NNARX [14], ANN [13]
multi-step time series prediction model, instead of the and recurrent neural networks (RNN) [25]. Table 1 sum-
recursive model, which is helpful to better control the marizes state-of-the-art of data-driven approaches. Jain
HVAC system’s operation. et al. [23] combine multi-output regression trees to repre-
The key problem to be addressed in this paper is an IAT sent the system’s dynamics. Yet, the modeling accuracy
prediction model used for a CAV system cannot be applied to using single trees to constitute multi-step prediction for
a VAV system or vice versa without decreasing the perfor- zone temperature is strongly affected by over-fitting and
mance. Therefore, a general modeling process considers high variance. The authors in [24] model the temperature
different properties of both systems are required to save time by a set of linear regression models, which change after
and cost particularly when there are many HVAC systems to each time step. They model their system with regression
control inside a large-scale building. Our proposed models trees to predict temperature for multi-zone using MISO
have been tested in two different types of buildings: the first structure and extend them to a random forest model.
floor of a hotel in Montreal with five VAVs systems and a However, their model is complicated and time-consuming.
small retail store with three zones supplied by three CAVs Nowadays, the ANN model has been widely applied for
systems. A general framework with specific feature selection several type of applications in HVAC sector, such as Fault
approaches which can adapt to VAV and CAV types has Detection and Diagnostics (FDD) [20], thermal comfort
been defined. Furthermore, we evaluated the effect of tuning approximation [21] and IAT prediction [12, 13]. Du et al.
model hyperparameters in both systems to increase the [20] developed ANN-based tool to detect faults in the
accuracy of the prediction model. supply air temperature control loop in commercial building
The contributions of this paper are: (1) a data-driven with VAV systems. The authors used a combined neural
framework for modeling IAT with LSTM-MISO and networks model which includes the basic neural networks
LSTM-MIMO models based on the S2S approach; (2) and auxiliary neural networks to detect faults and then used
different settings of previous time step and input clustering approach for classification to diagnose the fault
123
Neural Computing and Applications (2020) 32:17569–17585 17571
sources. The proposed models diagnose the faults using using MISO structure are proposed by [15]. Zeng et al. [16]
context information related to the monitoring parameters developed an optimal control of multi-zones VAV system.
such as supply chilled water temperature, return chilled They elaborated a data-driven predictive model using MLP
water temperature, chilled water flow rate and chilled water to predict the environmental conditions of each zone and
valve position. The principal component analysis is carried optimize energy consumption. The IAT is predicted with
out to analyze the contributions of these parameters in the only one step ahead. Moreover, multi-step prediction is
supply temperature control loop. The occurrence of faults necessary to lead a real-time implementation in the control
is computed according to a combined relative error and its phase. Moreover, only two control parameters were used as
threshold. Castilla et al. [21] proposed a context-aware inputs in the prediction model in [15, 16]. Neither specific
neural networks model using human and environmental feature selection methodology nor model tuning approach
variables for approximating thermal comfort evaluation for was implemented. The context information like weather
HVAC systems. Their model avoids the costs involved in data, control parameters and other external factors might
calculating the classical predictive mean vote (PMV) index improve the future prediction. Liang et al. [19] designed a
in terms of the computation time and the extensive sensor framework named UrbanFM based on deep neural net-
network size required to collect the input data. Moreover, it works. UrbanFM is composed of two models, an inference
allows the use of PMV index within real-time model pre- network component and an external factor fusion compo-
dictive control framework. Attoue and al. [13] developed a nent. The inference network component generates a fine-
simple ANN-MISO model to predict indoor temperature grained flow from coarse-grained inputs by using a novel
for different forecasting time steps. They proposed a feature extraction and distributional upsampling modules.
methodology based on the selection of pertinent input The external factor fusion component handles the context
parameters from a large set of features. Their experimental information (like the day of the week, time of the day,
results show that outdoor and facade temperature data weather, other external factors) to capture near and distant
provide good forecasting results of indoor temperatures. spatio-temporal dependencies. This component plays an
Moreover, their results show that predictions were accurate important role in providing a prior knowledge and
for up to 2 h. However, the predictions have unsatisfactory improves the inference performance under sparse sam-
accuracy for more than 4-h forecast ahead. Huang et al. pling. A feature selection investigation is done to find the
[12] develop an hybrid MPC based on neural network exact number of control parameters that must be included
feedback-linearization model to predict IAT over 6 h as input in the predictive model to increase the accuracy.
ahead. The goal of this approach is to linearize the system For example, the CAV system has only basic control
using the neural network through feedback to build non- parameters like heating stages, cooling stages and fan
linear functions approximation. The type of HVAC system stages. On the other hand, the VAV systems have more
used in [12] is designed with constant-air volume (CAV). parameters like supply air temperature, damper position,
Prior studies also model IAT for a VAV system [7, 16]. etc. In the literature, IAT prediction models have been
An indoor air temperature prediction models of multi-zone proposed for either the CAV or VAV systems, but to the
123
17572 Neural Computing and Applications (2020) 32:17569–17585
best of our knowledge, no prediction model has been tested week) in their models. A few studies have investigated the
or adapted on both types of systems at the same time. In usefulness of LSTM for IAT multi-step predictions in the
addition, the development of multiple models for multi- HVAC system [17]. An LSTM prediction model with
zones is time-consuming. Yet, the size of the control MISO structure was proposed by [17] to predict IAT until
problem grows rapidly as the number of air handling units 30 min using a recursive prediction approach. Neverthe-
(AHUs) and controlled zones increases. A multi-zone less, their proposed model did not show clear advantages
modeling approach using the MLP-MIMO model was compared to the traditional prediction model like SVM and
proposed by [10] to forecast 2-h ahead temperature inside decision tree. Indeed, the use of recursive prediction can
an open space commercial building. Afroz et al. [7] predict decrease the prediction performance. This paper system-
IAT in multi-zone buildings using a different tuned model atically investigates the performance of direct multi-step
based on NARX model. The authors use MISO architecture prediction approach using LSTM-MISO and LSTM-MIMO
to predict one step ahead then MIMO architecture to pre- architectures in CAV and VAV HVAC systems.
dict multi-step ahead for the same zone. In this paper we
use MIMO architecture to predict multi-zone with multi-
steps ahead. Delcroix et al. [14] predicted the behaviors of 3 Smart building models
IAT using NNARX-MISO. They carried out their experi-
ments with the same type of CAV-building used in this 3.1 CAV-building
paper, and they compared their results with the gray-box
model and the linear autoregressive model with exogenous CAV systems supply air at a constant volume and variable
inputs (ARX). Their comparisons show the NNARX model temperature [27]. Figure 1a illustrates one of three zones of
achieves the highest performance the alternative models. retail store with a CAV system. The fresh air entering from
However, the authors assume that the future exogenous outside is mixed with the return air to produce fresher air to
inputs (control parameters, outdoor temperature, etc.) are the fan ventilation. The speed of the fan is fixed, and it is
known, which cannot be true in the real case. All [7, 10] controlled by an (ON/OFF) switch. Thus, even for part-
and [14] develop a one-step forecasting model and then use load conditions, the fans use the maximum energy that
a recursive multi-step forecasting strategy to predict the leads to wasting energy. The mixed air is conditioned by
future. Such recursive strategy has two major limitations. the heating/cooling coils and then distributed to the zones
First, it accumulates prediction errors, because prediction through different diffusers. There is no controllable ter-
values are used instead of real observations. This method minal damper in the conditioned area zone. The terminal is
degrades the performance of the model as the number of set at a fixed opening level and cannot be actively con-
future steps increases. Second, it considers each input trolled. The air returns from the zone to the roof top unit;
parameter as an independent parameter. Consequently, the then, it can be rejected outside or mixed with the fresh air.
temporal dependency among continuous variables is The IAT and humidity are monitored by sensors. There is
ignored, while the historical data collected from HVAC significant thermal coupling between the three zones and
systems are generally time series data. These limitations with the neighbor spaces. The CAV system uses five
can be tackled by adopting a technique that takes into controlled points for each zone i, to maintain the IAT at the
account the temporal relationship among input parameters comfort value, the fan ventilation stage (FVS), two heating
and using a direct prediction approach to avoid the problem stages (HS1 and HS2 ) and two cooling stages (CS1 and CS2 )
of error accumulation. A powerful solution for modeling as described in Table 2 and enumerated by 1, 2 and 3,
sequence dependency is RNN models. LSTM network is a respectively, in Fig. 1a. The fan, cooling and heating stages
RNN that overcomes the problem of training a recurrent of the CAV are controlled by the heating and cooling set
network with the architecture of learnable gates. LSTM had points versus the zone’s temperature. The total power use
been found suitable for electric consumption, prices fore- of each CAV system in monitoring time Dt is calculated
cast and also for emission factor prediction to schedule according to the stages of control parameters, and it is
appliances use in the smart house domain [18, 26]. stated as:
Recently, a LSTM-RNN has been proposed in [22] for 2 X
X Dt
predicting the photovoltaic power. Specifically, the authors Pi ðtÞ ¼ ðCSji ðtÞ Cc þ HSji ðtÞ Ch Þ
compare the prediction accuracy of five different LSTM j¼1 t¼0
ð1Þ
models. Their results show LSTM for regression with time X
Dt
f
steps and LSTM for regression using the window technique þ FVSi ðtÞ C ; 8i
to achieve the best performance. However, the authors do t¼0
123
Neural Computing and Applications (2020) 32:17569–17585 17573
Fig. 1 Schematic diagram of air handling unit in both buildings with CAV and VAV systems: (a) represents CAV-building and (b) represents
VAV-building
where Pi ðtÞ represents the total amount of power at time t kW of the cooling equipment i, Ch is the capacity in kW of
for zone i, CSji ðtÞ ¼ f0; 1g is the stage j of the cooling unit the heating equipment i, and C f is the capacity in kW of the
of zone i at time t, HSji ðtÞ ¼ f0; 1g is the stage j of the fan unit. The total amount of power Pi ðtÞ has a positive
heating unit of zone i at time t, FVSi ðtÞ ¼ f0; 1g is the influence on the prediction model as described in Table 5
stage of the fan unit of zone i at time t, C c is the capacity in and will be used as input to the model.
123
17574 Neural Computing and Applications (2020) 32:17569–17585
3.2 VAV-building dependencies of the elements in the AHU system and the
high number of control parameters as described in Table 2.
Figure 1b shows a VAV system deployed in the ground The action taken by each control parameter can change the
floor of a hotel. It has one air handling unit (AHU), AC1, IAT value. The VAV system is controlled by eleven con-
which serves a conditioned area with five zones. The main trol points, to ensure the heating/cooling demand and sat-
components of the AHU are cooling/heating coils and an isfy the thermal comfort. As described in Table 2, six
air supply fan represented by system components number control points in AC1 are represented by the set of system
12, 6 and 13 in Fig. 1b, respectively. Each zone has a components numbered from 1 to 6 and the other five
variable air volume (VAV) box and a duct heater con- control points in each zone i are numbered from 7 to 11.
trolled by the control parameters, heating/cooling stage 1 The damper position of each zone i has a direct impact on
and 2, defined by the system components number 8 and 9, energy load, calculated by Eq. 2, where Di ðtÞ represents
respectively. The AHU circulates the air in its zones by the opening level of each VAV box damper of each zone i
supplying fresh air from outside that is passed by the fresh and q is the fan demand in (kW) of the air handling unit
air damper to control how much fresh air is needed. The AC1. In other words, this equation gives the power used by
fresh air damper and exhaust air are modulated together to each opening action for each VAV box damper. This
maintain the fluidity of the pressure of the circulating air in parameter is an uncontrolled variable and will be used as
the system ducts. The fresh air is then conditioned by input to the prediction model as described in Table 3.
heating or cooling as per the zone demand by the heat- !
Di ðtÞ
ing/cooling coils in the supply duct. Since the system’s fan Pi ðtÞ ¼ PN q; 8t ð2Þ
is controlled by a binary switch, not by a variable fre- i¼1 Di ðtÞ
quency drive (VFD), the pressure in the ducts is regulated
by the bypass air damper (BAD). The BAD manages the
change in air pressure that would occur due to the different
4 Data-driven framework for modeling IAT
VAVs damper discrete opening levels in each zone Di ðtÞ
(from 0 to 100%). If a specific zone demands more cooling/
4.1 Preprocessing methodology
heating, the cooling/heating coils in that respective VAV’s
zone will operate to meet the required temperature set point
4.1.1 Data collection and feature selection
[27]. The mixing air damper (MAD) is used as a form of
energy recovery mechanism since the returning air has
The data were collected from November 1, 2018, at 12
already been conditioned when it was in the supply duct.
a.m., to March 31, 2019, at 11:59 p.m., with 10-min
The MAD is modulated if the return temperature is within
sampling intervals. Since not all the sensors are sending the
the desired range, whether it is cooling or heating season
data at the same time step, we developed preprocessing
when compared to the fresh air temperature to precondition
algorithms to sample and interpolate the data every 10 min.
the fresh air before it is exposed to the cooling/heating coils
A feature selection process is executed to obtain a set of
so that achieving the desired temperature will cost less.
principal variables using Extra Trees classifier and corre-
The VAV system can save more energy compared to
lation techniques. We apply this method to each building
CAV [28]. However, it is difficult to accurately model the
datasets. Significant features are selected as input to predict
IAT variations in VAV systems regarding the complex
the IAT, and they are summarized in Table 3. In order to
123
Neural Computing and Applications (2020) 32:17569–17585 17575
build predictive models, the selected features including 4.1.2 Model training organization for multi-step prediction
context information are categorized into three groups:
After data preprocessing steps, the dataset was split into
• Controlled features: Includes current control actions
randomly sampled training (60 % of the data), validation
that impact HVAC system operations. The set vi ðtÞ is
(20 % of the data) and testing (20 % of the data) sets from
the vector of the control variables at time t for each
five months of data. Models are developed using the
zone i as depicted in Table 2. The set of control
training and validation datasets, and the prediction results
variables depends on the type of building as shown in
are from the test dataset. The IAT can be easily predicted in
Sect. 3.
short term with only 10 min ahead prediction since they
• Uncontrolled features: The set ui ðtÞ is the vector of the
follow a slow dynamic process. However, a long-term
measurable input variables at time t for each zone
prediction is needed for optimal operational of the HVAC
i (e.g., the supply air temperature, the outdoor air
system. To create the dataset to train the multi-step pre-
temperature, the impact of damper position on temper-
diction model, we test different configurations, with a
ature value, etc.).
different number of previous time steps and future time
• Target features: The set yi ðtÞ is the vector Tiin ðtÞ of IAT
steps. In this paper, we looked at multivariate inputs for
for each zone i.
multi-step time series forecasting model. The multivariate
We add the hour of day and day of the week as inputs. The multi-step time series forecasting is a challenging task,
hour input leads to know the difference between tempera- especially in the preparation of time series data and the
ture profile during occupancy and unoccupancy time. The definition of the shape of multiple inputs and multi-step
day input leads to distinguish between business and outputs for the model. Therefore, the time series data are
weekend days. All selected features are normalized transformed into a supervised learning problem. Each
between 0 and 1 before they are used for training, to pre- dataset of each building was manipulated and re-scaled in
vent the dominant effect of particular variables. The such a way that it can feed the prediction model. The
equation in (3) is used to scale the variables into [0, 1]. multivariate time series dataset for each zone i is defined
x xmin as:
xscale ¼ ð3Þ
xmax xmin
where xscale represents the scaled variable, x defines the
variable value before scaling, xmin and xmax are the mini-
mum and maximum values of the dataset to be scaled,
respectively.
123
17576 Neural Computing and Applications (2020) 32:17569–17585
123
Neural Computing and Applications (2020) 32:17569–17585 17577
123
17578 Neural Computing and Applications (2020) 32:17569–17585
123
Neural Computing and Applications (2020) 32:17569–17585 17579
and can be bigger than MAE for outliers. MAE is a com- 5.1 Feature selection experiments
monly used metric to evaluate forecast accuracy and cor-
responds to the mean value of the sum of absolute Since we predict from 30 min to 6 h, we assume that 2 h of
differences between actual and forecast values. future time steps prediction can give a good understanding
n of what features must be used to improve the prediction
100 X
yi y^i
MAPE ¼ ð12Þ performance. Experiments have been done with three
n y
i¼1 i
datasets having no HVAC control parameters (VF1, CF1
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1X n and CF2), and five datasets having controls (VF2, VF3,
RMSE ¼ ðyk y^k Þ2 ð13Þ VF4, CF3 and CF4). These datasets are built from existing
n k¼1
VAV and CAV datasets by adding or removing some
1X n features. With each dataset, we evaluate the performance of
MAE ¼ jyk y^k j ð14Þ LSTM and two baseline (MLP and NNARX) models using
n k¼1
different metrics. Table 5 shows, in the case of zone in
where y and y^ define the real and predicted outputs, VAV-building, some features have positive impacts on the
respectively, and n is the total observation number. LSTM and NNARX models, but no impact on the MLP
model. The bold values indicate the minimum errors. We
notice that adding input features of control parameters for
5 Results and discussion the VAV (VF2) and for the AC1 (VF3) separately, has
negative impacts on MLP, NNARX and LSTM. Further-
The performance of multi-step IAT prediction models is more, models using dataset including input features of
evaluated on LSTM, NNARX and MLP models applied on control parameters for both AC1 and VAV (VF4) provide
real data collected from hotel and retail store in Montreal the best prediction performance for LSTM and NNARX.
with VAV and CAV system, respectively. Five zones are However, MLP gives the best performance when control
investigated for a VAV-building and three zones for a parameters are not included in the dataset (VF1), but it
CAV-building. Only winter season data are considered in achieves the lowest performance comparing to VF4. We
this study, but the same models can be applied to other notice that the missing of control features in the dataset
seasons in further studies. The choice of the number of CF1 in the case of CAV-building decreases the prediction.
previous steps for the training model is not obvious, and it Moreover, adding control variables to the model CF4 for
can affect the performance of the prediction. An example CAV-buildings increases the prediction performance of
of the effect of using a different number of previous time LSTM and NNARX models. The temperature behavior
steps to predict 2 h in zone 1–2 in VAV-building is shown depends on the time of the action taken by the control
in Table 4. It is observed that the use of 10 h to predict 2 h parameters. Unlike MLP, the LSTM model takes into
gives the best performance, bold values. More experiments account the sequence of time behavior, so including the
have been executed on different zones to find the optimal control variables as inputs in the LSTM model increases
number of previous steps for different number of prediction the performance of the long-term prediction accuracy.
steps ahead. Finally, we used last 1 h and 5 h to predict Similarly, NNARX includes the future control parameters
30 min and 1 h ahead, respectively. Furthermore, we to predict future steps, so the consideration of these data
consider ten previous hours to predict 2 h, 4 h and 6 h makes the prediction more accurate. Therefore, 13 features
ahead. In the following, we present the feature selection used in VF4 and the 9 features used in CF4 have a sig-
procedure and then we evaluate the performance of multi- nificant impact on the multi-step prediction model of IAT
step prediction models for MISO and MIMO architectures. for both single zone in VAV- and CAV-buildings.
123
17580 Neural Computing and Applications (2020) 32:17569–17585
VF1 Tiin , Pi ,Hdi , Tiout , h, d 0.8346 0.5145 0.7502 0.0851 0.0865 0.0615 0.1842 0.1142 0.1666
VF2 Tiin , Si ,Pi ,Hdi , Tiout , h, d , Di , HS1i , HS2i 0.8775 0.5385 0.6708 0.0868 0.0911 0.0527 0.1939 0.1193 0.1487
VF3 Tiin , Si ,Pi , Hdi ,Tiout , h, d , Di , MDa ,SFa , Ha 0.8465 0.5165 0.8633 0.0807 0.0821 0.0796 0.1871 0.1146 0.1921
VF4 Tiin , Si ,Pi , Hdi ,Tiout , h, d , Di , HS1i , 0.8431 0.4989 0.5694 0.0853 0.0803 0.0380 0.1861 0.1108 0.1262
HS2i ,MDa ,SFa , Ha
CAV-building MLP NNARX LSTM MLP NNARX LSTM MLP NNARX LSTM
CF1 Tiin , Hiin , Tiout , h, d 0.7857 0.5819 0.5740 0.0565 0.0608 0.0293 0.1617 0.1198 0.1181
CF2 Tiin , Hiin , Tiout , h, d, Pi 0.7743 0.5737 0.5606 0.0545 0.0595 0.0285 0.1593 0.1181 0.1155
CF3 Tiin , Hiin , Tiout , h, d, Pi , HS1i , HS2i 0.7585 0.5335 0.5505 0.0493 0.0525 0.0245 0.1561 0.1099 0.1140
CF4 Tiin , Hiin , Tiout , h, d, Pi , HS1i , HS2i , Fi 0.7375 0.4822 0.5043 0.0479 0.0476 0.0208 0.1519 0.1042 0.1
5.2 Performance evaluation of multi-step short period the improvement by applying LSTM is not
prediction models noticeable compared to baselines, but as soon as we go
further in time (2, 4 and 6 h) the improvement by LSTM
5.2.1 Single-zone prediction results gets more and more important. The MAPE and MAE
scores of NNARX and LSTM models are quite similar,
This section discusses the prediction performance of especially for 6-h prediction. In the case of CAV-buildings,
LSTM-MISO model. We used MLP, CANN, NNARX with the NNARX for 6-h prediction model gives the best MAPE
MISO structure as a baseline models. The prediction and MAE scores, as shown in Fig. 5a and b. It is because
accuracy of the all models is evaluated by comparing three NNARX uses the measured/real values of exogenous
metrics, MAPE, RMSE and MAE. The numerical results of inputs (control parameters and outdoor temperature) to
LSTM-MISO and baseline models and the different steps predict multi-step ahead [14]. NNARX performs the multi-
used are shown in details in Fig. 5. Figure 5a–c represents step prediction by feeding back the IAT prediction output
the MAPE, the MAE and the RMSE results, respectively, and the future exogenous input information as the input for
to predict 30 min, 1 h, 2 h, 4 h and 6 h for the zone in the next prediction. Indeed, we observe that RMSE of the
CAV-building. Figure 5d–f represents the MAPE, the NNARX model is higher than RMSE of LSTM and CANN
MAE and the RMSE results, respectively, to predict models because of the error propagation over time caused
30 min, 1 h, 2 h, 4 h and 6 h for the zone 2 in VAV- by the recursive prediction method used, since RMSE
building. We notice that, in the two cases of building, for a penalizes large error values. Figure 5a–c indicates that the
MLP CANN NNARX LSTM MLP CANN NNARX LSTM MLP CANN NNARX LSTM
1 0,1
0,2
RMSE (℃)
℃)
MAPE (%)
0,08
MAE (℃
0,8
0,15
0,6 0,06
0,1 0,04
0,4
0,2
0,05 0,02
0 0 0
0,5 1 2 4 6 0,5 1 2 4 6 0,5 1 2 4 6
Predicon steps (h) Predicon steps (h) Predicon steps (h)
0,12
MAE (℃)
0,2
0,8 0,1
0,15 0,08
0,6
0,1 0,06
0,4
0,04
0,2 0,05
0,02
0 0 0
0,5 1 2 4 6 0,5 1 2 4 6 0,5 1 2 4 6
Predicon steps (h) Predicon steps (h) Predicon steps (h)
123
Neural Computing and Applications (2020) 32:17569–17585 17581
error of LSTM-MISO model increases according to the deteriorates along with an increasing prediction horizon
time prediction steps in the CAV-building. However, it is compared to LSTM-MISO. In other words, the estimated
not the case for VAV-building, as shown in Fig. 5d–f values of NNARX-MISO generally deviate from measured
where the largest error occurs when the LSTM-MISO values for long-term prediction due to error accumulation
model tries to predict 4 h ahead in VAV-building. The overtime. On the other hand, Fig. 7c and d clearly shows
performance of MISO-LSTM to predict 6 h ahead is better LSTM model performance over a 6-h period is very similar
than to predict 4 h ahead in VAV-building. Figure 6 pre- to that of 30-min period. Consequently, the increasing
sents the results of 6-h ahead prediction of indoor tem- number of advanced time steps does not deteriorate the
perature for zone 2 and zone 1–2 in CAV and VAV- prediction performance of LSTM-MISO model due to the
building, respectively, using the LSTM-MISO and baseline direct prediction method.
approaches. It can be seen that LSTM-MISO and NNARX Prediction efficiency
models work reasonably well for both types of buildings We compare the prediction efficiency of LSTM and
and they are close to real measurements. However, for NNARX in terms of execution time. As depicted in
MLP and CANN models, the prediction variability follows Table 6, the execution time of NNARX is significantly
the real measures, but the predicted values move around higher than LSTM for both types of buildings. In a pre-
the real values. The high prediction error is due to the dictive control system, if the control horizon is 5 min and
neglect of time sequence dependencies. Although CANN the prediction horizon is 4 h, the execution time of IAT
model tries to capture the sequential relationship among prediction could not exceed the control horizon to be
inputs by using the external factor fusion component, it is included in the predictive control system. As shown in
less powerful than the gates mechanism used in LSTM Table 6, for 4-h prediction, the execution time of NNARX
model. Due to the proximate results of NNARX and LSTM is higher than 5 min. In this case, NNARX is not appro-
models, we study further the standard deviation of error priate for a predictive control system against LSTM should
and prediction efficiency of both models. be used because its execution time is 15 seconds only.
Standard deviation of error
Figure 7a–d shows the mean error in terms of temper- 5.2.2 Multi-zone prediction results
ature for each time step predicted on the 20% of testing
data and the standard deviation of this error for LSTM- In this section, we implement a multi-zone model which
MISO and NNARX-MISO models in CAV and VAV- predicts the IAT for all the zones in the building using all
building. We notice the error of LSTM-MISO in under control parameters vectors as input. Figure 8 shows a
0.35 C for both types of buildings. In the same time, the comparison of experimental results between LSTM-MISO
error of NNARX is over 0.45 C. In reality, the precision of models and LSTM-MIMO models. It describes the mean
temperature sensors is often 0:5 C, and this value is errors of all investigated zones of each building for single-
considered as an appropriate benchmark for the accuracy of zone and multi-zone prediction models. It shows that in the
models [33]. Although the error of NNARX is within the case of CAV-building, the MAPE for LSTM-MIMO model
error benchmark, it is less accurate than LSTM. An is better than LSTM-MISO model. Furthermore, the
important remark is the performance of NNARX-MISO MAPE, MAE and RMSE errors for all three zones
Fig. 6 Results for 6 h ahead for indoor temperature prediction in: (a) zone 2 in a CAV-building and (b) zone in a VAV-building
123
17582 Neural Computing and Applications (2020) 32:17569–17585
(a) N N A RX-M ISO z o n e 2 C AV-b u ild in g (b) N N A RX-M ISO z o n e 1-2 VAV-b u ild in g
(c) LSTM -M ISO z o n e 2 C AV-b u ild in g (d) LSTM -M ISO z o n e 1-2 VAV-b u ild in g
Fig. 7 The standard deviation of error for MISO and MIMO models for CAV- and VAV-buildings for 6-h prediction ahead: (a, c, e) CAV-
building and (b, d, f) VAV-building
123
Neural Computing and Applications (2020) 32:17569–17585 17583
1 0,2 0,09
0,9 0,18 0,08
0,8 0,16 0,07
0,7 0,14
decreased after the LSTM-MIMO model is used for 30-min MISO MIMO
and 6-h prediction. For instance, it can be observed that the Time (ms/sample) 3,5
MAPE reduces with more than 0.3% on average to 6-h 3
prediction ahead. We notice that in Fig. 7e, the mean error 2,5
123
17584 Neural Computing and Applications (2020) 32:17569–17585
effect of thermal coupling between adjacent zones in CAV- 13. Attoue N, Shahrour I, Younes R (2018) Smart building: use of the
building because of its open space area, it is found that the artificial neural network approach for indoor temperature fore-
casting. Energies 11(2):395
overall prediction accuracy increases using the MIMO 14. Delcroix B, Le Ny J, Bernier M, Azam M, Qu B, Venne J-S
model. We can conclude that LSTM-MIMO is a valuable (2020) Autoregressive neural networks with exogenous variables
method for modeling IAT in a lightweight building with for indoor temperature prediction in buildings. Build Simul.
CAV type of HVAC system. In future work, the proposed https://doi.org/10.1007/s12273-019-0597-2
15. He X, Zhang Z, Kusiak A (2014) Performance optimization of
LSTM-based data-driven framework for multi-step pre- hvac systems with computational intelligence algorithms. Energy
diction ahead of IAT will be used to design predictive Build 81:371–380
control approaches in order to decrease energy consump- 16. Zeng Y, Zhang Z, Kusiak A (2015) Predictive modeling and
tion, e.g., load-shifting control, demand-limiting control or optimization of a multi-zone hvac system with data mining and
firefly algorithms. Energy 86:393–402
optimal start—stop time control. 17. Xu C, Chen H, Wang J, Guo Y, Yuan Y (2019) Improving pre-
diction performance for indoor temperature in public buildings
Acknowledgements This work is supported by Mitacs Accelerate based on a novel deep learning method. Build Environ
program and BrainBox AI. 148:128–135
18. Riekstin AC, Langevin A, Dandres T, Gagnon G, Cheriet M
Compliance with ethical standards (2018) Time series-based ghg emissions prediction for smart
homes. IEEE Trans Sustain Comput 1:1
19. Liang Y, Ouyang K, Jing L, Ruan S, Liu Y, Zhang J, Rosenblum
Conflict of interest The authors declare that they have no conflict of DS, Zheng Y (2019) Urbanfm: inferring fine-grained urban flows,
interest. In: Proceedings of the 25th ACM SIGKDD international con-
ference on knowledge discovery and data mining. ACM, 2019,
pp 3132–3142
References 20. Du Z, Fan B, Jin X, Chi J (2014) Fault detection and diagnosis for
buildings and hvac systems using combined neural networks and
1. Costa A, Keane MM, Torrens JI, Corry E (2013) Building subtractive clustering analysis. Build Environ 73:1–11
operation and energy performance: monitoring, analysis and 21. Castilla M, Álvarez J, Ortega M, Arahal M (2013) Neural net-
optimisation toolkit. Appl Energy 101:310–316 work and polynomial approximated thermal comfort models for
2. Yang L, Yan H, Lam JC (2014) Thermal comfort and building hvac systems. Build Environ 59:107–115
energy consumption implications-a review. Appl Energy 22. Abdel-Nasser M, Mahmoud K (2019) Accurate photovoltaic
115:164–173 power forecasting models using deep lstm-rnn. Neural Comput
3. Pérez-Lombard L, Ortiz J, Pout C (2008) A review on buildings Appl 31(7):2727–2740
energy consumption information. Energy Build 40(3):394–398 23. Jain A, Smarra F, Behl M, Mangharam R (2018) Data-driven
4. Baniasadi A, Habibi D, Bass O, Masoum MAS (2018) Optimal model predictive control with regression trees-an application to
real-time residential thermal energy management for peak-load building energy management. ACM Trans Cyber-Phys Syst
shifting with experimental verification. IEEE Trans Smart Grid 2(1):4
1:1 24. Smarra F, Jain A, de Rubeis T, Ambrosini D, D’Innocenzo A,
5. Standard A (2017) Standard 55–2017 thermal environmental Mangharam R (2018) Data-driven model predictive control using
conditions for human occupancy. Ashrae, Atlanta random forests for building energy optimization and climate
6. Rojas JD, Kunusch C, Ocampo-Martinez C, Puig V (2015) control. Appl Energy 226:1252–1272
Control-oriented thermal modeling methodology for water- 25. Javed A, Larijani H, Ahmadinia A, Emmanuel R (2014) Com-
cooled pem fuel-cell-based systems. IEEE Trans Ind Electron parison of the robustness of rnn, mpc and ann controller for
62(8):5146–5154 residential heating system. In: 2014 IEEE fourth international
7. Afroz Z, Urmee T, Shafiullah G, Higgins G (2018) Real-time conference on big data and cloud computing. IEEE, 2014,
prediction model for indoor temperature in a commercial build- pp 604–611
ing. Appl Energy 231:29–53 26. Rahman A, Srikumar V, Smith AD (2018) Predicting electricity
8. Sturzenegger D, Gyalistras D, Morari M, Smith RS (2016) Model consumption for commercial and residential buildings using deep
predictive climate control of a swiss office building: implemen- recurrent neural networks. Appl Energy 212:372–385
tation, results, and cost-benefit analysis. IEEE Trans Control Syst 27. Yan Y, Luh PB, Pattipati KR (2017) Fault diagnosis of hvac air-
Technol 24(1):1–12 handling systems considering fault propagation impacts among
9. Chen X, Wang Q, Srebric J (2015) A data-driven state-space components. IEEE Trans Autom Sci Eng 14(2):705–717
model of indoor thermal sensation using occupant feedback for 28. Yao Y, Lian Z, Liu W, Hou Z, Wu M (2007) Evaluation program
low-energy buildings. Energy Build 91:187–198 for the energy-saving of variable-air-volume systems. Energy
10. Huang H, Chen L, Hu E (2015) A neural network-based multi- Build 39(5):558–568
zone modelling approach for predictive control system design in 29. Gers FA, Schraudolph NN, Schmidhuber J (2002) Learning
commercial buildings. Energy Build 97:86–97 precise timing with lstm recurrent networks. J Mach Learn Res
11. Serale G, Fiorentini M, Capozzoli A, Bernardini D, Bemporad A 3:115–143
(2018) Model predictive control (mpc) for enhancing building 30. Hochreiter S, Schmidhuber J (1997) Long short-term memory.
and hvac system energy efficiency: problem formulation, appli- Neural Comput 9(8):1735–1780
cations and opportunities. Energies 11(3):631 31. Alom MZ, Taha TM, Yakopcic C, Westberg S, Sidike P, Nasrin
12. Huang H, Chen L, Hu E (2015) A new model predictive control MS, Hasan M, Van Essen BC, Awwal AA, Asari VK (2019) A
scheme for energy and cost savings in commercial buildings: an state-of-the-art survey on deep learning theory and architectures.
airport terminal building case study. Build Environ 89:203–216 Electronics 8(3):292
123
Neural Computing and Applications (2020) 32:17569–17585 17585
32. Lipton ZC, Kale DC, Elkan C, Wetzel R (2015) Learning to Publisher’s Note Springer Nature remains neutral with regard to
diagnose with lstm recurrent neural networks. arXiv:1511.03677 jurisdictional claims in published maps and institutional affiliations.
33. Ashrae A (2002) Ashrae guideline 14: measurement of energy
and demand savings. Am Soc Heat Refrig Air-Cond Eng
35:41–63
123