LSTM-based Indoor Air Temperature Prediction Framework For HVAC Systems in Smart Buildings

Neural Computing and Applications (2020) 32:17569–17585
https://doi.org/10.1007/s00521-020-04926-3(0123456789().,-volV)(0123456789().
,- volV)
ORIGINAL ARTICLE
LSTM-based indoor air temperature prediction framework for HVAC

systems in smart buildings
Fatma Mtibaa1 • Kim-Khoa Nguyen1 • Muhammad Azam2 • Anastasios Papachristou2 • Jean-Simon Venne2 •
Mohamed Cheriet1
Received: 21 November 2019 / Accepted: 6 April 2020 / Published online: 4 May 2020
Springer-Verlag London Ltd., part of Springer Nature 2020
Abstract
Accurate indoor air temperature (IAT) predictions for heating, ventilation, and air conditioning (HVAC) systems are
challenging, especially for multi-zone building and for different HVAC system types. Moreover, the nonlinearity of the
buildings thermal dynamics makes the IAT prediction more difficult since it is affected by complex factors such as
controlled and uncontrolled points, outside weather conditions and occupancy schedule. This paper presents a long short-
term memory (LSTM) model to predict IAT for multi-zone building based on direct multi-step prediction with sequence-
to-sequence approach. Two strategies, LSTM-MISO and LSTM-MIMO, are built for multi-input single-output and multi-
input multi-output, respectively. The performance of these two strategies has been evaluated based on two case studies on
real smart buildings using variable air volume (VAV) and constant air volume (CAV) systems. For both buildings,
experimental results showed that the LSTM models outperform multilayer perceptron models by reducing the prediction
error by 50%.
Keywords HVAC LSTM Sequence-to-sequence Multi-step ahead predictions VAV CAV
1 Introduction according to ASHRAE standard 55-2017 [5]). Therefore,

accurately modeling IAT is required. The IAT prediction
The total energy used in buildings accounts for 40% of the contributes to keep IAT within a comfortable range. Hence,
global energy consumption and up to 30% of carbon the IAT prediction model helps to offer a future boundary
dioxide emissions in the world [1, 2]. In building sectors, setting and detects any contradiction between estimated
heating, cooling, ventilation and air-conditioning (HVAC) and actual conditions. The IAT prediction can be achieved
system is responsible for more than half of the energy using physics-based or data-driven methods. Physics-based
consumed [3]. The reduction of the energy consumption is models are based on Fourier’s law of heat conduction
crucial to enhance energy efficiency. However, the mini- which is discretized into the finite difference method and
mization of the energy consumption may influence on the typically expressed in an resistance–capacitance analogy
indoor comfort. Indoor air temperature (IAT) is one of the [6]. However, the IAT model varies from one zone to the
essential thermal comfort parameters [4]. It is necessary others and is nonlinear since it is affected by complex
that IAT variations stay around the upper and lower factors, for instance, controlled and uncontrolled points,
boundaries of comfort (67 and 82 F, respectively, outside weather conditions, occupancy schedule, etc. These
factors make the IAT prediction modeling using physical
approach more challenging and time-consuming [7],
& Fatma Mtibaa especially for multi-zone building and when there are dif-
fatma.mtibaa.1@ens.etsmtl.ca
ferent types of HVAC systems to control [8]. Data-driven
1
Synchromedia Laboratory - École de Technologie models are based on the data generated from a large
Supérieure, Université du Québec, 1100 Notre-Dame St W, number of sensors and thermostats already deployed. Data-
Montreal, QC H3C 1K3, Canada driven approaches, to model IAT, have been studied in
2
Smart Building Lab, BrainBox AI, 2075 Boulevard Robert- different previous studies [9, 10]. The first advantage to use
Bourassa, Montreal, QC H3A 2L1, Canada
123
17570 Neural Computing and Applications (2020) 32:17569–17585
data-driven approaches is to eliminate the cost and time- parameters to increase the performance of multi-step pre-
consuming task to build IAT physics-based models. diction models; (3) the validation of the proposed frame-
Besides, IAT model is easy to implement for a multi-zone work in two real cases with the industrial data.
system using artificial intelligence (AI) techniques since it This paper is structured as follows: Section 2 summa-
can deal with nonlinearity in the system [11]. rizes the prior work related to our research. Section 3
Recently, artificial neural networks (ANN) and nonlin- describes the two types of buildings used in this study.
ear autoregressive network with exogenous inputs Section 4 illustrates the detailed methodology of our pro-
(NNARX) have been extensively used to model indoor posed data-driven framework for modeling IAT for both
environments [12–14]. However, prior work generally MISO and MIMO architectures using S2S approach. Sec-
adopted a recursive prediction strategy for predicting tion 5 discusses the experimental results and compares the
multi-step ahead [14–16]. For instance, the model predicts performance of the proposed models. Finally, we conclude
time t and uses the predicted value to predict t þ 1, and key findings and present future research directions.
recursively until t þ p, where p is the time horizon. This
method accumulates prediction errors at each time step.
Furthermore, the ANN method considers each input as an 2 Related work
independent parameter. It ignores the time dependency
between sequential values. Unlike many studies which The main advantage of a data-driven approach is to reduce
used ANN with multilayer perceptrons (MLP) models or the cost and time-consumed by traditional physics-based
NNARX, this paper investigates a time series approach for techniques. Moreover, the data-driven approaches can deal
multi-step prediction model using long short-term memory with nonlinearity, incomplete or noisy data [11]. Machine-
(LSTM) which is shown to be an accurate forecasting learning algorithms have been applied to design dynamic
method for time series data [17, 18]. In this paper, two models of the HVAC system. For instance, the multi-step
prediction structures are used to model IAT in multi-zone prediction of IAT can be used in a predictive control
with multi-step prediction: the LSTM-MISO architecture approach and then leads to improving the thermal comfort
uses multi-input single-output to predict the temperature and decreasing the energy consumption of buildings [17].
for each zone; the LSTM-MIMO architecture uses multi-
input multi-output to predict IAT for all zones simultane- The key common algorithms applied in data-driven
ously with only one model. This strategy shows clear approaches that have been used extensively in the building
advantages compared to prediction models presented in sector for IAT modeling are regression trees [23], random
prior work [12, 14, 19]. The proposed LSTM framework is forests [24], nonlinear autoregressive network with
based on a direct sequence-to-sequence (S2S) multivariate exogenous inputs (NARX) [7], NNARX [14], ANN [13]
multi-step time series prediction model, instead of the and recurrent neural networks (RNN) [25]. Table 1 sum-
recursive model, which is helpful to better control the marizes state-of-the-art of data-driven approaches. Jain
HVAC system’s operation. et al. [23] combine multi-output regression trees to repre-
The key problem to be addressed in this paper is an IAT sent the system’s dynamics. Yet, the modeling accuracy
prediction model used for a CAV system cannot be applied to using single trees to constitute multi-step prediction for
a VAV system or vice versa without decreasing the perfor- zone temperature is strongly affected by over-fitting and
mance. Therefore, a general modeling process considers high variance. The authors in [24] model the temperature
different properties of both systems are required to save time by a set of linear regression models, which change after
and cost particularly when there are many HVAC systems to each time step. They model their system with regression
control inside a large-scale building. Our proposed models trees to predict temperature for multi-zone using MISO
have been tested in two different types of buildings: the first structure and extend them to a random forest model.
floor of a hotel in Montreal with five VAVs systems and a However, their model is complicated and time-consuming.
small retail store with three zones supplied by three CAVs Nowadays, the ANN model has been widely applied for
systems. A general framework with specific feature selection several type of applications in HVAC sector, such as Fault
approaches which can adapt to VAV and CAV types has Detection and Diagnostics (FDD) [20], thermal comfort
been defined. Furthermore, we evaluated the effect of tuning approximation [21] and IAT prediction [12, 13]. Du et al.
model hyperparameters in both systems to increase the [20] developed ANN-based tool to detect faults in the
accuracy of the prediction model. supply air temperature control loop in commercial building
The contributions of this paper are: (1) a data-driven with VAV systems. The authors used a combined neural
framework for modeling IAT with LSTM-MISO and networks model which includes the basic neural networks
LSTM-MIMO models based on the S2S approach; (2) and auxiliary neural networks to detect faults and then used
different settings of previous time step and input clustering approach for classification to diagnose the fault
123
Neural Computing and Applications (2020) 32:17569–17585 17571
Table 1 Comparison of models prediction

References Data-driven model HVAC type Prediction horizon Type of model Tuning model
[20] ANN – Clustering VAV – MISO Yes

[21] ANN – MLP Solar cooling 60 s MISO Yes
[19] UrbanFM – – – No
[10] ANN – MLP CAV 2h MISO and MIMO No
[13] ANN – MLP CAV 1 to 4 h MISO Yes
[17] RNN – LSTM CAV 5 to 30 min MISO No
[22] RNN – LSTM – 1h MISO No
[7] RNN – NARX VAV 28 days MISO and MIMO Yes
[14] RNN – NNARX CAV 1h MISO Yes
[12] RNN – NNFL CAV 6h MISO No
[15] ANN – MLP VAV 15 min MISO No
[16] ANN – MLP VAV 15 min MISO No
[23] Regression trees CAV 6h MISO No
[24] Random forests CAV 6h MISO No
sources. The proposed models diagnose the faults using using MISO structure are proposed by [15]. Zeng et al. [16]
context information related to the monitoring parameters developed an optimal control of multi-zones VAV system.
such as supply chilled water temperature, return chilled They elaborated a data-driven predictive model using MLP
water temperature, chilled water flow rate and chilled water to predict the environmental conditions of each zone and
valve position. The principal component analysis is carried optimize energy consumption. The IAT is predicted with
out to analyze the contributions of these parameters in the only one step ahead. Moreover, multi-step prediction is
supply temperature control loop. The occurrence of faults necessary to lead a real-time implementation in the control
is computed according to a combined relative error and its phase. Moreover, only two control parameters were used as
threshold. Castilla et al. [21] proposed a context-aware inputs in the prediction model in [15, 16]. Neither specific
neural networks model using human and environmental feature selection methodology nor model tuning approach
variables for approximating thermal comfort evaluation for was implemented. The context information like weather
HVAC systems. Their model avoids the costs involved in data, control parameters and other external factors might
calculating the classical predictive mean vote (PMV) index improve the future prediction. Liang et al. [19] designed a
in terms of the computation time and the extensive sensor framework named UrbanFM based on deep neural net-
network size required to collect the input data. Moreover, it works. UrbanFM is composed of two models, an inference
allows the use of PMV index within real-time model pre- network component and an external factor fusion compo-
dictive control framework. Attoue and al. [13] developed a nent. The inference network component generates a fine-
simple ANN-MISO model to predict indoor temperature grained flow from coarse-grained inputs by using a novel
for different forecasting time steps. They proposed a feature extraction and distributional upsampling modules.
methodology based on the selection of pertinent input The external factor fusion component handles the context
parameters from a large set of features. Their experimental information (like the day of the week, time of the day,
results show that outdoor and facade temperature data weather, other external factors) to capture near and distant
provide good forecasting results of indoor temperatures. spatio-temporal dependencies. This component plays an
Moreover, their results show that predictions were accurate important role in providing a prior knowledge and
for up to 2 h. However, the predictions have unsatisfactory improves the inference performance under sparse sam-
accuracy for more than 4-h forecast ahead. Huang et al. pling. A feature selection investigation is done to find the
[12] develop an hybrid MPC based on neural network exact number of control parameters that must be included
feedback-linearization model to predict IAT over 6 h as input in the predictive model to increase the accuracy.
ahead. The goal of this approach is to linearize the system For example, the CAV system has only basic control
using the neural network through feedback to build non- parameters like heating stages, cooling stages and fan
linear functions approximation. The type of HVAC system stages. On the other hand, the VAV systems have more
used in [12] is designed with constant-air volume (CAV). parameters like supply air temperature, damper position,
Prior studies also model IAT for a VAV system [7, 16]. etc. In the literature, IAT prediction models have been
An indoor air temperature prediction models of multi-zone proposed for either the CAV or VAV systems, but to the
123
best of our knowledge, no prediction model has been tested week) in their models. A few studies have investigated the
or adapted on both types of systems at the same time. In usefulness of LSTM for IAT multi-step predictions in the
addition, the development of multiple models for multi- HVAC system [17]. An LSTM prediction model with
zones is time-consuming. Yet, the size of the control MISO structure was proposed by [17] to predict IAT until
problem grows rapidly as the number of air handling units 30 min using a recursive prediction approach. Neverthe-
(AHUs) and controlled zones increases. A multi-zone less, their proposed model did not show clear advantages
modeling approach using the MLP-MIMO model was compared to the traditional prediction model like SVM and
proposed by [10] to forecast 2-h ahead temperature inside decision tree. Indeed, the use of recursive prediction can
an open space commercial building. Afroz et al. [7] predict decrease the prediction performance. This paper system-
IAT in multi-zone buildings using a different tuned model atically investigates the performance of direct multi-step
based on NARX model. The authors use MISO architecture prediction approach using LSTM-MISO and LSTM-MIMO
to predict one step ahead then MIMO architecture to pre- architectures in CAV and VAV HVAC systems.
dict multi-step ahead for the same zone. In this paper we
use MIMO architecture to predict multi-zone with multi-
steps ahead. Delcroix et al. [14] predicted the behaviors of 3 Smart building models
IAT using NNARX-MISO. They carried out their experi-
ments with the same type of CAV-building used in this 3.1 CAV-building
paper, and they compared their results with the gray-box
model and the linear autoregressive model with exogenous CAV systems supply air at a constant volume and variable
inputs (ARX). Their comparisons show the NNARX model temperature [27]. Figure 1a illustrates one of three zones of
achieves the highest performance the alternative models. retail store with a CAV system. The fresh air entering from
However, the authors assume that the future exogenous outside is mixed with the return air to produce fresher air to
inputs (control parameters, outdoor temperature, etc.) are the fan ventilation. The speed of the fan is fixed, and it is
known, which cannot be true in the real case. All [7, 10] controlled by an (ON/OFF) switch. Thus, even for part-
and [14] develop a one-step forecasting model and then use load conditions, the fans use the maximum energy that
a recursive multi-step forecasting strategy to predict the leads to wasting energy. The mixed air is conditioned by
future. Such recursive strategy has two major limitations. the heating/cooling coils and then distributed to the zones
First, it accumulates prediction errors, because prediction through different diffusers. There is no controllable ter-
values are used instead of real observations. This method minal damper in the conditioned area zone. The terminal is
degrades the performance of the model as the number of set at a fixed opening level and cannot be actively con-
future steps increases. Second, it considers each input trolled. The air returns from the zone to the roof top unit;
parameter as an independent parameter. Consequently, the then, it can be rejected outside or mixed with the fresh air.
temporal dependency among continuous variables is The IAT and humidity are monitored by sensors. There is
ignored, while the historical data collected from HVAC significant thermal coupling between the three zones and
systems are generally time series data. These limitations with the neighbor spaces. The CAV system uses five
can be tackled by adopting a technique that takes into controlled points for each zone i, to maintain the IAT at the
account the temporal relationship among input parameters comfort value, the fan ventilation stage (FVS), two heating
and using a direct prediction approach to avoid the problem stages (HS1 and HS2 ) and two cooling stages (CS1 and CS2 )
of error accumulation. A powerful solution for modeling as described in Table 2 and enumerated by 1, 2 and 3,
sequence dependency is RNN models. LSTM network is a respectively, in Fig. 1a. The fan, cooling and heating stages
RNN that overcomes the problem of training a recurrent of the CAV are controlled by the heating and cooling set
network with the architecture of learnable gates. LSTM had points versus the zone’s temperature. The total power use
been found suitable for electric consumption, prices fore- of each CAV system in monitoring time Dt is calculated
cast and also for emission factor prediction to schedule according to the stages of control parameters, and it is
appliances use in the smart house domain [18, 26]. stated as:
Recently, a LSTM-RNN has been proposed in [22] for 2 X
X Dt
predicting the photovoltaic power. Specifically, the authors Pi ðtÞ ¼ ðCSji ðtÞ Cc þ HSji ðtÞ Ch Þ
compare the prediction accuracy of five different LSTM j¼1 t¼0
ð1Þ
models. Their results show LSTM for regression with time X
Dt
f
steps and LSTM for regression using the window technique þ FVSi ðtÞ C ; 8i
to achieve the best performance. However, the authors do t¼0
not take into account the context information (such as wind

speed, outside air temperature, time of day and day of
123
Fig. 1 Schematic diagram of air handling unit in both buildings with CAV and VAV systems: (a) represents CAV-building and (b) represents
VAV-building
where Pi ðtÞ represents the total amount of power at time t kW of the cooling equipment i, Ch is the capacity in kW of
for zone i, CSji ðtÞ ¼ f0; 1g is the stage j of the cooling unit the heating equipment i, and C f is the capacity in kW of the
of zone i at time t, HSji ðtÞ ¼ f0; 1g is the stage j of the fan unit. The total amount of power Pi ðtÞ has a positive
heating unit of zone i at time t, FVSi ðtÞ ¼ f0; 1g is the influence on the prediction model as described in Table 5
stage of the fan unit of zone i at time t, C c is the capacity in and will be used as input to the model.
123
Table 2 Control points

CAV-building VAV-building Control parameters Description Unit value
parameters
X BAD Bypass air damper %
X CS1 Cooling stage 1 for AC1 b
X CS2 Cooling stage 2 for AC1 b
X MAD Mixing air damper %
X X FVS Fan ventilation stage b
X HO Heating output %
X Di Damper position opening level %
X X HS1i /CS1i Heating/cooling stage 1 for zone i b
X X HS2i /CS2i Heating/cooling stage 2 for zone i b
3.2 VAV-building dependencies of the elements in the AHU system and the
high number of control parameters as described in Table 2.
Figure 1b shows a VAV system deployed in the ground The action taken by each control parameter can change the
floor of a hotel. It has one air handling unit (AHU), AC1, IAT value. The VAV system is controlled by eleven con-
which serves a conditioned area with five zones. The main trol points, to ensure the heating/cooling demand and sat-
components of the AHU are cooling/heating coils and an isfy the thermal comfort. As described in Table 2, six
air supply fan represented by system components number control points in AC1 are represented by the set of system
12, 6 and 13 in Fig. 1b, respectively. Each zone has a components numbered from 1 to 6 and the other five
variable air volume (VAV) box and a duct heater con- control points in each zone i are numbered from 7 to 11.
trolled by the control parameters, heating/cooling stage 1 The damper position of each zone i has a direct impact on
and 2, defined by the system components number 8 and 9, energy load, calculated by Eq. 2, where Di ðtÞ represents
respectively. The AHU circulates the air in its zones by the opening level of each VAV box damper of each zone i
supplying fresh air from outside that is passed by the fresh and q is the fan demand in (kW) of the air handling unit
air damper to control how much fresh air is needed. The AC1. In other words, this equation gives the power used by
fresh air damper and exhaust air are modulated together to each opening action for each VAV box damper. This
maintain the fluidity of the pressure of the circulating air in parameter is an uncontrolled variable and will be used as
the system ducts. The fresh air is then conditioned by input to the prediction model as described in Table 3.
heating or cooling as per the zone demand by the heat- !
Di ðtÞ
ing/cooling coils in the supply duct. Since the system’s fan Pi ðtÞ ¼ PN q; 8t ð2Þ
is controlled by a binary switch, not by a variable fre- i¼1 Di ðtÞ
quency drive (VFD), the pressure in the ducts is regulated
by the bypass air damper (BAD). The BAD manages the
change in air pressure that would occur due to the different
4 Data-driven framework for modeling IAT
VAVs damper discrete opening levels in each zone Di ðtÞ
(from 0 to 100%). If a specific zone demands more cooling/
4.1 Preprocessing methodology
heating, the cooling/heating coils in that respective VAV’s
zone will operate to meet the required temperature set point
4.1.1 Data collection and feature selection
[27]. The mixing air damper (MAD) is used as a form of
energy recovery mechanism since the returning air has
The data were collected from November 1, 2018, at 12
already been conditioned when it was in the supply duct.
a.m., to March 31, 2019, at 11:59 p.m., with 10-min
The MAD is modulated if the return temperature is within
sampling intervals. Since not all the sensors are sending the
the desired range, whether it is cooling or heating season
data at the same time step, we developed preprocessing
when compared to the fresh air temperature to precondition
algorithms to sample and interpolate the data every 10 min.
the fresh air before it is exposed to the cooling/heating coils
A feature selection process is executed to obtain a set of
so that achieving the desired temperature will cost less.
principal variables using Extra Trees classifier and corre-
The VAV system can save more energy compared to
lation techniques. We apply this method to each building
CAV [28]. However, it is difficult to accurately model the
datasets. Significant features are selected as input to predict
IAT variations in VAV systems regarding the complex
the IAT, and they are summarized in Table 3. In order to
123
Table 3 Multivariate inputs variables

VAV-building CAV-building
Inputs Description Unit Inputs Description Unit
Controlled variables, vi ðtÞ Controlled variables, vi ðtÞ
vi ðtÞ The controlled vector at time t of zone i – vi ðtÞ The controlled vector at time t of zone i –
Uncontrolled variables, ui ðtÞ Uncontrolled variables, ui ðtÞ
h Hour of day Integer h Hour of day Integer

d Day of week Integer d Day of week Integer
Si Supply air temperature of each zone i C Hi Indoor humidity of the zone i %
Sac1 Supply air temperature of AC1 C Tiout Outdoor air temperature C
Hdi Heating demand of zone i CFM out Predicted outdoor air temperature C
Tiþp
Tiout Outdoor air temperature C Pi Power use of zone i kW
out
Tiþp Predicted outdoor air temperature C
Pi Impact of damper position on temperature value kW
Target variables, yi ðtÞ Target variables, yi ðtÞ
Tiin Indoor temperature of zone i C Tiin Indoor temperature of zone i C
build predictive models, the selected features including 4.1.2 Model training organization for multi-step prediction
context information are categorized into three groups:
After data preprocessing steps, the dataset was split into
• Controlled features: Includes current control actions
randomly sampled training (60 % of the data), validation
that impact HVAC system operations. The set vi ðtÞ is
(20 % of the data) and testing (20 % of the data) sets from
the vector of the control variables at time t for each
five months of data. Models are developed using the
zone i as depicted in Table 2. The set of control
training and validation datasets, and the prediction results
variables depends on the type of building as shown in
are from the test dataset. The IAT can be easily predicted in
Sect. 3.
short term with only 10 min ahead prediction since they
• Uncontrolled features: The set ui ðtÞ is the vector of the
follow a slow dynamic process. However, a long-term
measurable input variables at time t for each zone
prediction is needed for optimal operational of the HVAC
i (e.g., the supply air temperature, the outdoor air
system. To create the dataset to train the multi-step pre-
temperature, the impact of damper position on temper-
diction model, we test different configurations, with a
ature value, etc.).
different number of previous time steps and future time
• Target features: The set yi ðtÞ is the vector Tiin ðtÞ of IAT
steps. In this paper, we looked at multivariate inputs for
for each zone i.
multi-step time series forecasting model. The multivariate
We add the hour of day and day of the week as inputs. The multi-step time series forecasting is a challenging task,
hour input leads to know the difference between tempera- especially in the preparation of time series data and the
ture profile during occupancy and unoccupancy time. The definition of the shape of multiple inputs and multi-step
day input leads to distinguish between business and outputs for the model. Therefore, the time series data are
weekend days. All selected features are normalized transformed into a supervised learning problem. Each
between 0 and 1 before they are used for training, to pre- dataset of each building was manipulated and re-scaled in
vent the dominant effect of particular variables. The such a way that it can feed the prediction model. The
equation in (3) is used to scale the variables into [0, 1]. multivariate time series dataset for each zone i is defined
x xmin as:
xscale ¼ ð3Þ
xmax xmin
where xscale represents the scaled variable, x defines the
variable value before scaling, xmin and xmax are the mini-
mum and maximum values of the dataset to be scaled,
respectively.
123
2 3 4.2 Data-driven LSTM-based framework

vi ðt l þ 1Þ ui ðt l þ 1Þ Yi ðt l þ 1Þ
6 vi ðt l þ 2Þ ui ðt l þ 2Þ Yi ðt l þ 2Þ 7 for multi-step IAT prediction
6 7
6 7
6 vi ðt l þ 3Þ ui ðt l þ 3Þ Yi ðt l þ 3Þ 7
6 7 The predictive model of the IAT of each zone i has the
Xil ðtÞ ¼ 6 .. .. .. 7
6 . . . 7 form shown in Eq. 7.
6 7
6 7
4 vi ðt 1Þ ui ðt 1Þ Yi ðt 1Þ 5 Tiin ðt þ kÞ ¼ f ðTiin ðtÞ; vi ðtÞ; ui ðtÞÞ; 8i 2 ½1; N ð7Þ
vi ðtÞ ui ðtÞ Yi ðtÞ
The historical data collected from buildings have many
ð4Þ features such as IAT, relative humidity, heating demand,
Each row of the matrix Xil ðtÞ represents the input vector damper positions, heating/cooling stages. The interaction
between these features is complicated and nonlinear.
given to the network at each time step. The sliding window
Therefore, it might be difficult to build a linear or poly-
algorithm is applied to define the shape of training data sets
nomial regression, and it can provide a poor prediction
and predicted vectors. Window method leads to the use of a
accuracy. This is why nonlinear models like neural network
set of recent last time steps to predict the next time step.
models are used. To construct the predictive models we
The window length l represents the number of past time
will use two types of neural network models, LSTM and
steps used before the predicted horizon p, and it defines the
MLP, which will be discussed in details in the next sec-
length of the data over which the algorithm trains the data.
tions. We use direct-S2S forecasting models to predict
The window moves as the new data come in, sample by
different horizon for MISO and MIMO structure. This
sample, over the data used for training. In fact, the algo-
approach involves a heavier computational load than
rithm uses the recently measured data samples to replace
recursive forecasting used by [12]. These models will be
the previous data frame. Consequently, the model can be
used in future work, to construct the optimization model
retrained and updated using the newest dataset. For
for control purpose.
instance, to predict p time steps, we use a window of l size
which includes l prior input vectors. The size of the win-
4.2.1 LSTM model definition
dow is a parameter that can be tuned according to past and
future steps to predict. For example, if six prior input
Few previous steps data can lead to getting an accurate
vectors (l = 6 = 60 min) of each of three inputs time
predictive model; however, in some cases, older data can
series are used, and three time steps (p = 3 = 30 min) of
lead to recognizing general trends that recent data fail to
temperature must be predicted, then the input at a time t
show. This problem is called long-term dependencies. The
can be represented as:
2 3 characteristic of MLP and basic RNN cannot solve this
vi ðt 5Þ ui ðt 5Þ Tiin ðt 5Þ problem. On the one hand, MLP does not consider the
6 v ðt 4Þ u ðt 4Þ T in ðt 4Þ 7 dataset as a time series. Moreover, RNN fails to consider
6 i i i 7
6 7 recent with distant past data in the learning phase. To deal
6 v i ðt 3Þ u i ðt 3Þ T in
ðt 3Þ 7
6 6
Xi ðtÞ ¼ 6 i 7 ð5Þ with long-term dependencies in time series data, we use the
in 7
6 vi ðt 2Þ ui ðt 2Þ Ti ðt 2Þ 7 LSTM model introduced by Schmidhuber et al. [29].
6 7
4 vi ðt 1Þ ui ðt 1Þ Tiin ðt 1Þ 5
LSTM is one of many variations of the RNN. The ability of
vi ðtÞ ui ðtÞ Tiin ðtÞ LSTM to reduce the vanishing and exploding gradient
problems efficiently makes such an approach more appro-
And the output will be:
2 in 3 priate for contexts having a long-term dependency prob-
Ti ðt þ 1Þ lem. Since the HVAC data are sequential, and future
6 7
Yi3 ðtÞ ¼ 4 Tiin ðt þ 2Þ 5 ð6Þ outputs depend not only on the current values of inputs but
Tiin ðt þ 3Þ also on the previous information, hence a model based on
LSTM is a good choice to predict IAT in the HVAC sys-
The input data are two-dimensional matrix, with six time tem. The main advantage of LSTM is the use of gates [30]
steps of samples including controlled, uncontrolled and to manage its own memory by choosing to update or not
target features. And, the output data are two-dimensional the information goes through the cell. In fact, LSTM net-
matrix. For each sample of Xi6 ðtÞ, three time steps Yi3 ðtÞ work is able to learn long-term dependencies from an input
will be predicted for each zone i. sequence thanks to its internal memory cells. An example
of LSTM cell is described in Fig. 2.
The information will be added or removed according to
the cell state defined by three different gates. The input
123
consideration of the temporal evolution of the features for

each time step [32].
Two direct-S2S models are developed for MISO and
MIMO architectures. The direct-S2S for LSTM-MISO
architecture uses the input matrix Xil ðtÞ with l past time
steps to predict the output vectors yi ðt þ 1Þ; yi ðt þ
2Þ; :::; yi ðt þ pÞ with p time steps ahead, as shown in Fig. 3.
For the MISO architecture, one model is developed for
each zone i. However, in the case of direct-S2S for the
LSTM-MIMO architecture, one model is developed for all
the zones, as shown in Fig. 4. It includes as input all n
zones feature vectors, xi ; xiþ1 ; :::; xn with all l past time
steps of each feature vector. All controlled variables of all
Fig. 2 Diagram for LSTM cell [31]
n zones are included in the input vectors. Moreover, this
architecture has n output prediction vectors, yi ; yiþ1 ; :::; yn
gate it is responsible for the process of controlling the input
with p future steps each. The two time steps, l and p, have
activation and adding new information to the cell state. The
variable lengths. The main advantage of direct-S2S archi-
forget gate ft is responsible for deciding whether the
tecture is that it allows tuning l to have the best prediction
memory cells require to remember or forget its former
performance of p steps. As discussed in Sect. 5, we set a
status. The output gate ot controls the output activation and
time step ahead to 2 h and we tune l to 5, 10 and 12 h and
determines whether the information from the current cell
then choose the l giving the best prediction performance.
states needs to be sent or not to the next layer. The equa-
tions for the input gate it , forget gate ft and output gate ot
can be expressed by:
4.3 Data-driven predictive models deployment
it ¼ rðWi ½ht1 ; xt þ bi Þ ð8Þ 4.3.1 Baseline models deployment

ft ¼ r Wf ht1 ; xt þ bf ð9Þ
The proposed models are compared with relevant past
ot ¼ rðWo ½ht1 ; xt þ bo Þ ð10Þ studies’ models as baselines, and they are as follows:
where Wi , Wf and Wo are the weight matrices, bi , bf and bo • MLP [10]: MLP is a class of feedforward artificial
are the biases for the input gate, forget gate and output neural network. MLP connects multiple layers in a
gate, respectively; ht1 is the previous hidden state, and xt directed graph. In this paper, the MLP model imple-
is the input at current time step. The r operator is the mented includes three layers. The first layer is fed by a
logistic sigmoid function, and it is used as an activation set of input features presented in Table 3. The second
function in the hidden layer which ranges from 0 to 1 and it layer is a hidden layer, containing 200 neurons. The
is described by (11). third layer is the output layer with p linear neurons. A
direct MLP-MISO and MLP-MIMO models are imple-
rðxÞ ¼ ð1 þ ex Þ1 ð11Þ mented for VAV and CAV system cases.
• NNARX [14]: this network is implemented using a
4.2.2 LSTM-based direct sequence to sequence (S2S) recursive prediction method. The exogenous inputs
prediction architecture contain the controlled variables vi ðtÞ and uncontrolled
variables ui ðtÞ. We used the forecasted weather data
LSTM is useful for different types of applications, which
need various architectures according to the studied prob-
lem. In this paper, the problem can be formulated as, given
a time series of observations as input, predict a sequence of
observations as output for a range of future time steps. In
order to further improve the flexibility of the temperature
forecasting methodology, an LSTM-based direct sequence
to sequence (direct-S2S) architecture is investigated to
solve the defined problem. This method was used in the
video classification problem, which takes video frames as
input and tries to label each frame of the video with the Fig. 3 LSTM-based direct-S2S for MISO architecture
123
we design the LSTM model with four fully connected

layers used repeat vector and time-distributed layers to
form the LSTM model for MISO and MIMO architectures.
For the case of LSTM-MISO model, the vector matrix x(t)
represents the input for the first LSTM layer, which con-
sists of 200 LSTM cells. The repeat vector repeats one time
for each time step p the incoming 1D inputs vector from the
previous LSTM layer to create 2D matrix output in order to
include multiple future time steps in the model. For
instance, if the shape of the input was (32), and we want to
predict three times ahead, so the output shape of the repeat-
vector layer would be (3, 32). The output 2D matrix from
the repeat vector layer passes as the input to the LSTM
layer. At each moment, the output of the LSTM layer is
connected to the fully connected dense layer. Then, time-
distributed layer applies a specific layer such as dense layer
to every sample it receives as an input to get the final
output ytþ1 with p time steps ahead. The exact architecture
for direct-S2S LSTM-MISO model is as follows:
(1) Input (xi ðt lÞ; xi ðt l þ 1Þ; :::; xi ðtÞ)
Fig. 4 LSTM-based direct-S2S for MIMO architecture (2) LSTM layer (N = 200, activation function=ReLU)
(3) Repeat vector layer (N=1)
available online and the real observations for vi ðtÞ. The (4) LSTM layer (N = 200, activation function=ReLU)
number of hidden layers and neurons is set to 2 and 200, (5) Time-distributed layer (N = 100, activation
respectively. The activation function is ReLU and the function=ReLU)
solver is Adam. The NNARX-MISO architecture is (6) Fully connected (N = 1, activation function=linear)
implemented for VAV and CAV system use cases. (7) Output (yi ðt þ 1Þ; yi ðt þ 2Þ; :::; yi ðt þ pÞ)
• CANN [19]: The context-aware neural network model
(CANN) is based on the model proposed in [19], named The main differences between MISO and MIMO deploy-
UrbanFM. CANN uses two components: (1) the exter- ment architecture are the input and output data. The exact
nal factor fusion component to integrate context architecture for direct-S2S LSTM-MIMO model is as
information and (2) the inference network component follows:
to include the IAT target feature. The inputs of the first (1) Input (xi ; xiþ1 ; :::; xn with all l past time steps of each
components are the controlled variables vi ðtÞ and vector)
uncontrolled variables ui ðtÞ. The embedding external (2) LSTM layer (N = 200, activation function=ReLU)
factors vector concatenate the continuous features like (3) Repeat vector layer (N = p)
outdoor temperature Tiout ðtÞ, and categorical features (4) LSTM layer (N = 200, activation function=ReLU)
include the hour of day h, the day of the week d, the on/ (5) Time-distributed layer (N = 100, activation
off controlled variables vi ðtÞ. This vector is fed into the function=ReLU)
feature extraction module of the first component to (6) Fully connected (N = p, activation function=linear)
design the complicated interaction. The inputs of the (7) Output (yi ; yiþ1 ; :::; yn with p future steps each)
second components concatenate the context features
extracted from the first component and the target
information (IAT). Then the output of the first and 4.3.3 Modeling error
second component is concatenated to fit a convolution
layer and predict the future steps of IAT. To evaluate the performance of proposed predictive mod-
els, three metrics in (12), (13) and (14) are used to eval-
uate the models: the mean absolute percentage error
4.3.2 LSTM models deployment (MAPE), the root-mean-square error (RMSE) and the mean
absolute error (MAE). The MAPE measures the size of the
To consider a direct-S2S architecture, LSTM model might error in percentage terms, and a lower result indicates
transform the input sequence into the correct output better performance. A high MAPE score indicates a high
sequence representation in the learning phase. Therefore, error range. The RMSE penalizes more larger error values
123
and can be bigger than MAE for outliers. MAE is a com- 5.1 Feature selection experiments
monly used metric to evaluate forecast accuracy and cor-
responds to the mean value of the sum of absolute Since we predict from 30 min to 6 h, we assume that 2 h of
differences between actual and forecast values. future time steps prediction can give a good understanding
n of what features must be used to improve the prediction
100 X
yi yî
MAPE ¼ ð12Þ performance. Experiments have been done with three
n y
i¼1 i
datasets having no HVAC control parameters (VF1, CF1
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1X n and CF2), and five datasets having controls (VF2, VF3,
RMSE ¼ ðyk y^k Þ2 ð13Þ VF4, CF3 and CF4). These datasets are built from existing
n k¼1
VAV and CAV datasets by adding or removing some
1X n features. With each dataset, we evaluate the performance of
MAE ¼ jyk y^k j ð14Þ LSTM and two baseline (MLP and NNARX) models using
n k¼1
different metrics. Table 5 shows, in the case of zone in
where y and y^ define the real and predicted outputs, VAV-building, some features have positive impacts on the
respectively, and n is the total observation number. LSTM and NNARX models, but no impact on the MLP
model. The bold values indicate the minimum errors. We
notice that adding input features of control parameters for
5 Results and discussion the VAV (VF2) and for the AC1 (VF3) separately, has
negative impacts on MLP, NNARX and LSTM. Further-
The performance of multi-step IAT prediction models is more, models using dataset including input features of
evaluated on LSTM, NNARX and MLP models applied on control parameters for both AC1 and VAV (VF4) provide
real data collected from hotel and retail store in Montreal the best prediction performance for LSTM and NNARX.
with VAV and CAV system, respectively. Five zones are However, MLP gives the best performance when control
investigated for a VAV-building and three zones for a parameters are not included in the dataset (VF1), but it
CAV-building. Only winter season data are considered in achieves the lowest performance comparing to VF4. We
this study, but the same models can be applied to other notice that the missing of control features in the dataset
seasons in further studies. The choice of the number of CF1 in the case of CAV-building decreases the prediction.
previous steps for the training model is not obvious, and it Moreover, adding control variables to the model CF4 for
can affect the performance of the prediction. An example CAV-buildings increases the prediction performance of
of the effect of using a different number of previous time LSTM and NNARX models. The temperature behavior
steps to predict 2 h in zone 1–2 in VAV-building is shown depends on the time of the action taken by the control
in Table 4. It is observed that the use of 10 h to predict 2 h parameters. Unlike MLP, the LSTM model takes into
gives the best performance, bold values. More experiments account the sequence of time behavior, so including the
have been executed on different zones to find the optimal control variables as inputs in the LSTM model increases
number of previous steps for different number of prediction the performance of the long-term prediction accuracy.
steps ahead. Finally, we used last 1 h and 5 h to predict Similarly, NNARX includes the future control parameters
30 min and 1 h ahead, respectively. Furthermore, we to predict future steps, so the consideration of these data
consider ten previous hours to predict 2 h, 4 h and 6 h makes the prediction more accurate. Therefore, 13 features
ahead. In the following, we present the feature selection used in VF4 and the 9 features used in CF4 have a sig-
procedure and then we evaluate the performance of multi- nificant impact on the multi-step prediction model of IAT
step prediction models for MISO and MIMO architectures. for both single zone in VAV- and CAV-buildings.
Table 4 Effect of the number of

Model Previous steps (h) Future steps (h) MAPE (%) RMSE (C) MAE (C)
previous steps to predict 2 h in
zone in VAV-building 1 5 2 0.4880 0.0225 0.1077
2 10 2 0.4264 0.0173 0.0941
3 12 2 1.0651 0.1187 0.2354
123
Table 5 Datasets and feature selection

Datasets Feature selection MAPE (%) RMSE (C) MAE (C)
VAV-building MLP NNARX LSTM MLP NNARX LSTM MLP NNARX LSTM
VF1 Tiin , Pi ,Hdi , Tiout , h, d 0.8346 0.5145 0.7502 0.0851 0.0865 0.0615 0.1842 0.1142 0.1666
VF2 Tiin , Si ,Pi ,Hdi , Tiout , h, d , Di , HS1i , HS2i 0.8775 0.5385 0.6708 0.0868 0.0911 0.0527 0.1939 0.1193 0.1487
VF3 Tiin , Si ,Pi , Hdi ,Tiout , h, d , Di , MDa ,SFa , Ha 0.8465 0.5165 0.8633 0.0807 0.0821 0.0796 0.1871 0.1146 0.1921
VF4 Tiin , Si ,Pi , Hdi ,Tiout , h, d , Di , HS1i , 0.8431 0.4989 0.5694 0.0853 0.0803 0.0380 0.1861 0.1108 0.1262
HS2i ,MDa ,SFa , Ha
CAV-building MLP NNARX LSTM MLP NNARX LSTM MLP NNARX LSTM
CF1 Tiin , Hiin , Tiout , h, d 0.7857 0.5819 0.5740 0.0565 0.0608 0.0293 0.1617 0.1198 0.1181
CF2 Tiin , Hiin , Tiout , h, d, Pi 0.7743 0.5737 0.5606 0.0545 0.0595 0.0285 0.1593 0.1181 0.1155
CF3 Tiin , Hiin , Tiout , h, d, Pi , HS1i , HS2i 0.7585 0.5335 0.5505 0.0493 0.0525 0.0245 0.1561 0.1099 0.1140
CF4 Tiin , Hiin , Tiout , h, d, Pi , HS1i , HS2i , Fi 0.7375 0.4822 0.5043 0.0479 0.0476 0.0208 0.1519 0.1042 0.1
5.2 Performance evaluation of multi-step short period the improvement by applying LSTM is not
prediction models noticeable compared to baselines, but as soon as we go
further in time (2, 4 and 6 h) the improvement by LSTM
5.2.1 Single-zone prediction results gets more and more important. The MAPE and MAE
scores of NNARX and LSTM models are quite similar,
This section discusses the prediction performance of especially for 6-h prediction. In the case of CAV-buildings,
LSTM-MISO model. We used MLP, CANN, NNARX with the NNARX for 6-h prediction model gives the best MAPE
MISO structure as a baseline models. The prediction and MAE scores, as shown in Fig. 5a and b. It is because
accuracy of the all models is evaluated by comparing three NNARX uses the measured/real values of exogenous
metrics, MAPE, RMSE and MAE. The numerical results of inputs (control parameters and outdoor temperature) to
LSTM-MISO and baseline models and the different steps predict multi-step ahead [14]. NNARX performs the multi-
used are shown in details in Fig. 5. Figure 5a–c represents step prediction by feeding back the IAT prediction output
the MAPE, the MAE and the RMSE results, respectively, and the future exogenous input information as the input for
to predict 30 min, 1 h, 2 h, 4 h and 6 h for the zone in the next prediction. Indeed, we observe that RMSE of the
CAV-building. Figure 5d–f represents the MAPE, the NNARX model is higher than RMSE of LSTM and CANN
MAE and the RMSE results, respectively, to predict models because of the error propagation over time caused
30 min, 1 h, 2 h, 4 h and 6 h for the zone 2 in VAV- by the recursive prediction method used, since RMSE
building. We notice that, in the two cases of building, for a penalizes large error values. Figure 5a–c indicates that the
MLP CANN NNARX LSTM MLP CANN NNARX LSTM MLP CANN NNARX LSTM
1,4 0,3 0,14
1,2 0,25 0,12
1 0,1
0,2
RMSE (℃)
℃)
MAPE (%)
0,08
MAE (℃
0,8
0,15
0,6 0,06
0,1 0,04
0,4
0,2
0,05 0,02
0 0 0
0,5 1 2 4 6 0,5 1 2 4 6 0,5 1 2 4 6
Predicon steps (h) Predicon steps (h) Predicon steps (h)
(a) M A P E (b) M A E (c)RM SE

MLP CANN NNARX LSTM MLP CANN NNARX LSTM MLP CANN NNARX LSTM
1,6 0,35 0,2

1,4 0,18
0,3
0,16
1,2 0,25 0,14
1
RMSE (℃)
MAPE (%)
0,12
MAE (℃)
0,2
0,8 0,1
0,15 0,08
0,6
0,1 0,06
0,4
0,04
0,2 0,05
0,02
0 0 0
0,5 1 2 4 6 0,5 1 2 4 6 0,5 1 2 4 6
(d) M A P E (e) M A E (f)RM SE

Fig. 5 MISO model prediction error for two buildings: (a–c) represent zone 2 in CAV-building , and (d–f) represent zone in VAV-building
123
error of LSTM-MISO model increases according to the deteriorates along with an increasing prediction horizon
time prediction steps in the CAV-building. However, it is compared to LSTM-MISO. In other words, the estimated
not the case for VAV-building, as shown in Fig. 5d–f values of NNARX-MISO generally deviate from measured
where the largest error occurs when the LSTM-MISO values for long-term prediction due to error accumulation
model tries to predict 4 h ahead in VAV-building. The overtime. On the other hand, Fig. 7c and d clearly shows
performance of MISO-LSTM to predict 6 h ahead is better LSTM model performance over a 6-h period is very similar
than to predict 4 h ahead in VAV-building. Figure 6 pre- to that of 30-min period. Consequently, the increasing
sents the results of 6-h ahead prediction of indoor tem- number of advanced time steps does not deteriorate the
perature for zone 2 and zone 1–2 in CAV and VAV- prediction performance of LSTM-MISO model due to the
building, respectively, using the LSTM-MISO and baseline direct prediction method.
approaches. It can be seen that LSTM-MISO and NNARX Prediction efficiency
models work reasonably well for both types of buildings We compare the prediction efficiency of LSTM and
and they are close to real measurements. However, for NNARX in terms of execution time. As depicted in
MLP and CANN models, the prediction variability follows Table 6, the execution time of NNARX is significantly
the real measures, but the predicted values move around higher than LSTM for both types of buildings. In a pre-
the real values. The high prediction error is due to the dictive control system, if the control horizon is 5 min and
neglect of time sequence dependencies. Although CANN the prediction horizon is 4 h, the execution time of IAT
model tries to capture the sequential relationship among prediction could not exceed the control horizon to be
inputs by using the external factor fusion component, it is included in the predictive control system. As shown in
less powerful than the gates mechanism used in LSTM Table 6, for 4-h prediction, the execution time of NNARX
model. Due to the proximate results of NNARX and LSTM is higher than 5 min. In this case, NNARX is not appro-
models, we study further the standard deviation of error priate for a predictive control system against LSTM should
and prediction efficiency of both models. be used because its execution time is 15 seconds only.
Standard deviation of error
Figure 7a–d shows the mean error in terms of temper- 5.2.2 Multi-zone prediction results
ature for each time step predicted on the 20% of testing
data and the standard deviation of this error for LSTM- In this section, we implement a multi-zone model which
MISO and NNARX-MISO models in CAV and VAV- predicts the IAT for all the zones in the building using all
building. We notice the error of LSTM-MISO in under control parameters vectors as input. Figure 8 shows a
0.35 C for both types of buildings. In the same time, the comparison of experimental results between LSTM-MISO
error of NNARX is over 0.45 C. In reality, the precision of models and LSTM-MIMO models. It describes the mean
temperature sensors is often 0:5 C, and this value is errors of all investigated zones of each building for single-
considered as an appropriate benchmark for the accuracy of zone and multi-zone prediction models. It shows that in the
models [33]. Although the error of NNARX is within the case of CAV-building, the MAPE for LSTM-MIMO model
error benchmark, it is less accurate than LSTM. An is better than LSTM-MISO model. Furthermore, the
important remark is the performance of NNARX-MISO MAPE, MAE and RMSE errors for all three zones
(a) Z o n e 2 in C AV-b u ild in g (b) Z o n e 1-2 in VAV-b u ild in g
Fig. 6 Results for 6 h ahead for indoor temperature prediction in: (a) zone 2 in a CAV-building and (b) zone in a VAV-building
123
(a) N N A RX-M ISO z o n e 2 C AV-b u ild in g (b) N N A RX-M ISO z o n e 1-2 VAV-b u ild in g
(c) LSTM -M ISO z o n e 2 C AV-b u ild in g (d) LSTM -M ISO z o n e 1-2 VAV-b u ild in g
(e) LSTM -M IM O C AV-b u ild in g (f) LSTM -M IM O VAV-b u ild in g
Fig. 7 The standard deviation of error for MISO and MIMO models for CAV- and VAV-buildings for 6-h prediction ahead: (a, c, e) CAV-
building and (b, d, f) VAV-building
Table 6 Execution time (in

CAV-building VAV-building
seconds) of NNARX and LSTM
models for CAV-building and NNARX-MSIO LSTM-MSIO NNARX-MSIO LSTM-MSIO
VAV-building
30 min 80.71 4.97 83.9 4.83
1h 182.13 10.20 186.06 8.84
2h 203.19 15.25 287.31 14.45
4h 472.29 14.88 379.22 16.79
6h 550.21 17.54 565.80 17.41
123
MISO MIMO MISO MIMO MISO MIMO
1 0,2 0,09
0,9 0,18 0,08
0,8 0,16 0,07
Mean RMSE (℃)

Mean MAPE (%)
0,7 0,14
Mean MAE (℃)

0,06
0,6 0,12 0,05
0,5 0,1
0,04
0,4 0,08
0,3 0,03
0,06
0,2 0,04 0,02
0,1 0,02 0,01
0 0 0
0,5 1 2 4 6 0,5 1 2 4 6 0,5 1 2 4 6
(a) M A P E (b) M A E (c) RM SE

MISO MIMO MISO MIMO MISO MIMO
1,6 0,8 0,4

1,4 0,7 0,35
1,2 0,6 0,3
Mean RMSE (℃)

Mean MAPE (%)
Mean MAE (℃)

1 0,5 0,25
0,8 0,4 0,2
0,6 0,3 0,15
0,4 0,2 0,1
0,2 0,1 0,05
0 0 0
0,5 1 2 4 6 0,5 1 2 4 6 0,5 1 2 4 6
(d) M A P E (e) M A E (f) RM SE

Fig. 8 Comparison between MIMO and MISO model prediction error for two buildings: (a–c) represent CAV-building and (d–f) represent VAV-
building
decreased after the LSTM-MIMO model is used for 30-min MISO MIMO
and 6-h prediction. For instance, it can be observed that the Time (ms/sample) 3,5
MAPE reduces with more than 0.3% on average to 6-h 3
prediction ahead. We notice that in Fig. 7e, the mean error 2,5
with its standard deviation in terms of temperature for each 2

1,5
time step predicted for LSTM-MIMO in the case of CAV-
1
building is better than for LSTM-MISO model (Fig. 7c)
0,5
and it does not exceed 0.2 C which is less than the
0
accuracy value of temperature sensor ( 0:5 C) [33]. 0,5 1 2 4 6
However, the error in LSTM-MIMO model for VAV- Predicon steps (h)
building case is worse than LSTM-MISO and NNARX-
Fig. 9 Prediction time execution for CAV-building
MISO. As the CAV-building is an open space, and it is a
lightweight structure, the thermal interaction among all
MISO models for 2 and 6 h ahead. As a result, executing
adjacent zones influences the temperature prediction
LSTM-MIMO model for online prediction, for 2 and 6 h
results. So, the consideration of all parameters as inputs in
ahead, is more computationally efficient than LSTM-MISO
the model and the exchanged information among zones
models.
increases the accuracy of the LSTM-MIMO model. On the
other hand, in the VAV-building with heavy-weight
building structure, the interaction among adjacent zones
does not influence the prediction performance. The size of
6 Conclusion
VAV-building zones is not all uniform, and there is no
This paper presents an LSTM-MISO and LSTM-MIMO
significant heat transfer from zone to zone. For these rea-
framework based on direct-S2S architecture to predict
sons, LSTM-MISO model performs better for the VAV-
multi-step ahead of IAT for two real-world cases on dif-
building use case.
ferent buildings. While most of prior works only investi-
Predicting a MIMO model is more expensive in terms of
gate a specific type of HVAC system, the modeling
time compared to a MISO model, as expected. For exam-
framework proposed in this study covers both VAV- and
ple, as shown in Fig. 9, it takes 1.33 ms/sample using a
CAV-buildings. The consideration of control variables as
MIMO model, while only 0.4 ms/sample using a MISO
the input increases the prediction accuracy of the LSTM
model to predict 30 min, which can be explained by the
models. The proposed framework considers a direct-S2S
large amount of input in the MIMO model. However, in
prediction instead of a recursive one, which increases the
Fig. 9 we notice that the execution time for predicting 2 h
accumulation of prediction error throughout the prediction
and 6 h using LSTM-MIMO model is comparable to the
step ahead. This study showed that LSTM-MISO model is
time using LSTM-MISO models. Moreover, as shown in
efficient for VAV-buildings. However, since there is an
Fig. 8, the prediction error for MIMO models is lower than
123
effect of thermal coupling between adjacent zones in CAV- 13. Attoue N, Shahrour I, Younes R (2018) Smart building: use of the
building because of its open space area, it is found that the artificial neural network approach for indoor temperature fore-
casting. Energies 11(2):395
overall prediction accuracy increases using the MIMO 14. Delcroix B, Le Ny J, Bernier M, Azam M, Qu B, Venne J-S
model. We can conclude that LSTM-MIMO is a valuable (2020) Autoregressive neural networks with exogenous variables
method for modeling IAT in a lightweight building with for indoor temperature prediction in buildings. Build Simul.
CAV type of HVAC system. In future work, the proposed https://doi.org/10.1007/s12273-019-0597-2
15. He X, Zhang Z, Kusiak A (2014) Performance optimization of
LSTM-based data-driven framework for multi-step pre- hvac systems with computational intelligence algorithms. Energy
diction ahead of IAT will be used to design predictive Build 81:371–380
control approaches in order to decrease energy consump- 16. Zeng Y, Zhang Z, Kusiak A (2015) Predictive modeling and
tion, e.g., load-shifting control, demand-limiting control or optimization of a multi-zone hvac system with data mining and
firefly algorithms. Energy 86:393–402
optimal start—stop time control. 17. Xu C, Chen H, Wang J, Guo Y, Yuan Y (2019) Improving pre-
diction performance for indoor temperature in public buildings
Acknowledgements This work is supported by Mitacs Accelerate based on a novel deep learning method. Build Environ
program and BrainBox AI. 148:128–135
18. Riekstin AC, Langevin A, Dandres T, Gagnon G, Cheriet M
Compliance with ethical standards (2018) Time series-based ghg emissions prediction for smart
homes. IEEE Trans Sustain Comput 1:1
19. Liang Y, Ouyang K, Jing L, Ruan S, Liu Y, Zhang J, Rosenblum
Conflict of interest The authors declare that they have no conflict of DS, Zheng Y (2019) Urbanfm: inferring fine-grained urban flows,
interest. In: Proceedings of the 25th ACM SIGKDD international con-
ference on knowledge discovery and data mining. ACM, 2019,
pp 3132–3142
References 20. Du Z, Fan B, Jin X, Chi J (2014) Fault detection and diagnosis for
buildings and hvac systems using combined neural networks and
1. Costa A, Keane MM, Torrens JI, Corry E (2013) Building subtractive clustering analysis. Build Environ 73:1–11
operation and energy performance: monitoring, analysis and 21. Castilla M, Álvarez J, Ortega M, Arahal M (2013) Neural net-
optimisation toolkit. Appl Energy 101:310–316 work and polynomial approximated thermal comfort models for
2. Yang L, Yan H, Lam JC (2014) Thermal comfort and building hvac systems. Build Environ 59:107–115
energy consumption implications-a review. Appl Energy 22. Abdel-Nasser M, Mahmoud K (2019) Accurate photovoltaic
115:164–173 power forecasting models using deep lstm-rnn. Neural Comput
3. Pérez-Lombard L, Ortiz J, Pout C (2008) A review on buildings Appl 31(7):2727–2740
energy consumption information. Energy Build 40(3):394–398 23. Jain A, Smarra F, Behl M, Mangharam R (2018) Data-driven
4. Baniasadi A, Habibi D, Bass O, Masoum MAS (2018) Optimal model predictive control with regression trees-an application to
real-time residential thermal energy management for peak-load building energy management. ACM Trans Cyber-Phys Syst
shifting with experimental verification. IEEE Trans Smart Grid 2(1):4
1:1 24. Smarra F, Jain A, de Rubeis T, Ambrosini D, D’Innocenzo A,
5. Standard A (2017) Standard 55–2017 thermal environmental Mangharam R (2018) Data-driven model predictive control using
conditions for human occupancy. Ashrae, Atlanta random forests for building energy optimization and climate
6. Rojas JD, Kunusch C, Ocampo-Martinez C, Puig V (2015) control. Appl Energy 226:1252–1272
Control-oriented thermal modeling methodology for water- 25. Javed A, Larijani H, Ahmadinia A, Emmanuel R (2014) Com-
cooled pem fuel-cell-based systems. IEEE Trans Ind Electron parison of the robustness of rnn, mpc and ann controller for
62(8):5146–5154 residential heating system. In: 2014 IEEE fourth international
7. Afroz Z, Urmee T, Shafiullah G, Higgins G (2018) Real-time conference on big data and cloud computing. IEEE, 2014,
prediction model for indoor temperature in a commercial build- pp 604–611
ing. Appl Energy 231:29–53 26. Rahman A, Srikumar V, Smith AD (2018) Predicting electricity
8. Sturzenegger D, Gyalistras D, Morari M, Smith RS (2016) Model consumption for commercial and residential buildings using deep
predictive climate control of a swiss office building: implemen- recurrent neural networks. Appl Energy 212:372–385
tation, results, and cost-benefit analysis. IEEE Trans Control Syst 27. Yan Y, Luh PB, Pattipati KR (2017) Fault diagnosis of hvac air-
Technol 24(1):1–12 handling systems considering fault propagation impacts among
9. Chen X, Wang Q, Srebric J (2015) A data-driven state-space components. IEEE Trans Autom Sci Eng 14(2):705–717
model of indoor thermal sensation using occupant feedback for 28. Yao Y, Lian Z, Liu W, Hou Z, Wu M (2007) Evaluation program
low-energy buildings. Energy Build 91:187–198 for the energy-saving of variable-air-volume systems. Energy
10. Huang H, Chen L, Hu E (2015) A neural network-based multi- Build 39(5):558–568
zone modelling approach for predictive control system design in 29. Gers FA, Schraudolph NN, Schmidhuber J (2002) Learning
commercial buildings. Energy Build 97:86–97 precise timing with lstm recurrent networks. J Mach Learn Res
11. Serale G, Fiorentini M, Capozzoli A, Bernardini D, Bemporad A 3:115–143
(2018) Model predictive control (mpc) for enhancing building 30. Hochreiter S, Schmidhuber J (1997) Long short-term memory.
and hvac system energy efficiency: problem formulation, appli- Neural Comput 9(8):1735–1780
cations and opportunities. Energies 11(3):631 31. Alom MZ, Taha TM, Yakopcic C, Westberg S, Sidike P, Nasrin
12. Huang H, Chen L, Hu E (2015) A new model predictive control MS, Hasan M, Van Essen BC, Awwal AA, Asari VK (2019) A
scheme for energy and cost savings in commercial buildings: an state-of-the-art survey on deep learning theory and architectures.
airport terminal building case study. Build Environ 89:203–216 Electronics 8(3):292
123
32. Lipton ZC, Kale DC, Elkan C, Wetzel R (2015) Learning to Publisher’s Note Springer Nature remains neutral with regard to
diagnose with lstm recurrent neural networks. arXiv:1511.03677 jurisdictional claims in published maps and institutional affiliations.
33. Ashrae A (2002) Ashrae guideline 14: measurement of energy
and demand savings. Am Soc Heat Refrig Air-Cond Eng
35:41–63
123

LSTM-based Indoor Air Temperature Prediction Framework For HVAC Systems in Smart Buildings

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

LSTM-based Indoor Air Temperature Prediction Framework For HVAC Systems in Smart Buildings

Uploaded by

Copyright:

Available Formats

Neural Computing and Applications (2020) 32:17569–17585

LSTM-based indoor air temperature prediction framework for HVAC

Keywords HVAC LSTM Sequence-to-sequence Multi-step ahead predictions VAV CAV

1 Introduction according to ASHRAE standard 55-2017 [5]). Therefore,

Table 1 Comparison of models prediction

[20] ANN – Clustering VAV – MISO Yes

not take into account the context information (such as wind

Table 2 Control points

Table 3 Multivariate inputs variables

h Hour of day Integer h Hour of day Integer

Tiin Indoor temperature of zone i C Tiin Indoor temperature of zone i C

2 3 4.2 Data-driven LSTM-based framework

consideration of the temporal evolution of the features for

it ¼ rðWi ½ht1 ; xt þ bi Þ ð8Þ 4.3.1 Baseline models deployment

we design the LSTM model with four fully connected

Table 4 Effect of the number of

Table 5 Datasets and feature selection

1,4 0,3 0,14

1,2 0,25 0,12

(a) M A P E (b) M A E (c)RM SE

1,6 0,35 0,2

(d) M A P E (e) M A E (f)RM SE

(a) Z o n e 2 in C AV-b u ild in g (b) Z o n e 1-2 in VAV-b u ild in g

(e) LSTM -M IM O C AV-b u ild in g (f) LSTM -M IM O VAV-b u ild in g

Table 6 Execution time (in

MISO MIMO MISO MIMO MISO MIMO

Mean RMSE (℃)

Mean MAE (℃)

(a) M A P E (b) M A E (c) RM SE

1,6 0,8 0,4

Mean RMSE (℃)

Mean MAE (℃)

(d) M A P E (e) M A E (f) RM SE

with its standard deviation in terms of temperature for each 2

You might also like

it ¼ rðWi ½ht1 ; xt þ bi Þ ð8Þ 4.3.1 Baseline models deployment