Abdel Basset2021

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 15

Journal of Cleaner Production 303 (2021) 127037

Contents lists available at ScienceDirect

Journal of Cleaner Production


journal homepage: www.elsevier.com/locate/jclepro

PV-Net: An innovative deep learning approach for efficient forecasting


of short-term photovoltaic energy production
Mohamed Abdel-Basset a, Hossam Hawash a, *, Ripon K. Chakrabortty b, Michael Ryan b
a
Faculty of Computers and Informatics, Zagazig University, Zagazig, Sharqiyah, 44519, Egypt
b
Capability Systems Centre, School of Engineering and IT, UNSW, Canberra, Australia

a r t i c l e i n f o a b s t r a c t

Article history: Although photovoltaic (PV) energy production offers several environmental and commercial advantages,
Received 9 January 2021 the irregular nature of PV energy can challenge the design and development of the energy management
Received in revised form systems. Precise forecasting for PV energy production is therefore of vital importance to supply con-
12 March 2021
sumers to improve trust in functionality of the energy management system. Stimulated by current de-
Accepted 6 April 2021
Available online 12 April 2021
velopments in deep learning (DL) techniques as well as the promising efficiency in energy-related
applications, this study introduces a novel DL architecture, called PV-Net, for short-term forecasting of
Handling editor: Mingzhou Jin day-ahead PV energy. In PV-Net, the gates of the gated recurrent unit (GRU) are redesigned using
convolutional layers (called Conv-GRU) to enable efficient extraction of positional and temporal char-
Keywords: acteristics in the PV power sequences. The Conv-GRU cells are stacked in bidirectional (Bi-dir) blocks to
Photovoltaic energy enable modeling temporal information in forward and backward directions. The Bi-dir block is residually
Deep learning connected to avoid information loss across layers and to facilitate gradient flow during training. A real-
Short-term forecasting world case study from Alice Springs, Australia, is employed to evaluate and compare the performance of
Convolution gated recurrent units
the proposed PV-Net against recent cutting-edge approaches. The values of four performance measures
Residual learning
demonstrate the efficiency of the proposed PV-Net in terms of prediction precision and consistency.
© 2021 Elsevier Ltd. All rights reserved.

1. Introduction energy production (Pan et al., 2020). Thus, the growing number of
PV energy sources in smart grids results in substantial variations in
Photovoltaic (PV) energy production is one of the preferred energy production and consumption, leading to regularity devia-
renewable power sources that has attracted increasing global in- tion, voltage increase, and many other issues that require some
terest in recent years (Sharadga et al., 2020; Wang et al., 2019). The means of preserving the active equilibrium between consumer’s
efficient production, communication, delivery, and consumption of needs and provider’s supplies (Shi et al., 2020). Numerous solutions
electrical energy are reliant on instantaneous management have been offered, involving energy system planning, dispatches
(Heidari and Khovalyg, 2020), such that PV energy production (Guo et al., 2021), need reply (Roth et al., 2020), battery storage
schemes are accessible on both the production and load sides (Lee (Guo et al., 2021), and backup generators. Nevertheless, many
and Kim, 2020). PV energy requires instantaneous management challenges remain unsolved.
because the solar energy source is subject to arbitrary oscillations These obstacles include the vulnerability of the PV energy
and ambiguities, imposing considerable. output to meteorological circumstances, high-level machinery ex-
Challenges on the operation of the power system (Wang et al., penses, and fluctuations in energy production (Wang et al., 2020c).
2020c). For example, during hazy weather, the solar irradiance on Prediction of PV energy production has emerged as an intelligent
PV units is variable due to the movement of cloud formations, way for tackling such obstacles and the reliable prediction of the
causing rapid variations on the PV units resulting in variations in production of PV energy has been identified as a necessary
precondition to extensive PV deployment in power systems (Shi
et al., 2020). Such prediction is difficult, however, as PV energy
* Corresponding author.
sequences are typically irregular and complex due to unpredictable
E-mail addresses: mohamedbasset@zu.edu.eg (M. Abdel-Basset), hossamreda@ meteorological circumstances. The energy rise/drop ratio of backup
zu.edu.eg (H. Hawash), r.chakrabortty@adfa.edu.au (R.K. Chakrabortty), m.ryan@ generators is constrained with the entity ramp level, causing many
adfa.edu.au (M. Ryan).

https://doi.org/10.1016/j.jclepro.2021.127037
0959-6526/© 2021 Elsevier Ltd. All rights reserved.
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

Nomenclature and units ECMWF European Centre for Medium-Range Weather


Forecasts
PV photovoltaic (PV) RGB red, green blue
LTPVF long-term PV energy forecasting GHI global horizontal irradiance
MTPVF middle-term PV energy forecasting MA moving average
STPVF short-term PV energy forecasting AR auto regressive
ML Machine Learning GA genetic algorithm
ARMA autoregressive and moving average SDA similar day analysis
ARIMA autoregressive integrated moving average ANNs artificial neural networks
SARIMA seasonal autoregressive integrated moving average LRC linear regressive correction
SVM support vector machine ACO ant colony optimization
ELM extreme learning machine DOEs design of experiments
AI artificial intelligence IDBN improved deep belief network
CNNs convolutional neural networks WPD wavelet packet decomposition
RNN Recurrent Neural Network AM attention mechanism
LSTM Long Short-Term Memory AE Autoencoder
GRU Gated Recurrent Unit QCNN quantile CNN
DERs dispersed energy resources QR quantile regression
SG smart grid ConvLSTM convolutional LSTM
SI solar irradiation BN Batch Normalization
Bi-Dir bi-directional DKASC Desert Knowledge Precinct in Central Australia
NWP numerical weather prediction MSE Mean Squared Error
GDAS Global Data Assimilation System MAE Mean Absolute Error
CFS Climate Forecast System MAPE Mean Absolute Percentage Error
GFS Global Forecast System RMSE Root Mean Square Error

issues in fulfilling the increasing demand for energy production movement and physical state of climatological circumstances, but
(Huang et al., 2020). In the case of battery resources, very large the predictive performance of these approaches depends heavily on
energy storage capacity is difficult to achieve due to production the stability of meteorological conditions. Persistence approaches
expenses and storing volume constraints. Further, the unavailabil- generally suppose a high association between current and forth-
ity of data about user consumption means that it is challenging to coming values of the temporal energy data which are computed
balance demand with supply (Bullich-Massague  et al., 2020). presuming that the surroundings stay unaltered from the timestep
Similarly, the efficacy of such solutions essentially depends on the t to time t þ Dt (Mayer and Gro  f, 2020). However, the predictive
precision of prediction in different time horizons, thereby sup- performance of these approaches heavily relies on ancient mean
porting day-ahead planning for the grid (Theocharides et al., 2020). values (Roth et al., 2020). ML approaches seek to learn and model
Precise prediction offers the ability to manage arbitrary PV energy the association between ancient PV energy productions and the
on the smart grid (Shi et al., 2020), as well as enable data provision relevant meteorological factors, which in turn mean that the pre-
for demand and to extend the amenity lifetime of standby batteries dictive performance of ML approaches is very sensitive to the
by reducing periods of charge and release (Guo et al., 2021). temporal distance and the fineness of received input.
A number of research efforts have concentrated on the efficiency Consequently, a variety of statistical approaches have been
of predicting PV energy production and several predictive ap- widely adopted in PV energy forecasting in recent years involving:
proaches have been introduced (Ahmed et al., 2020; Bullich- autoregressive and moving average (ARMA), autoregressive inte-
Massague  et al., 2020; Huang et al., 2020; Moreira et al., 2021). grated moving average (ARIMA) (Liu et al., 2012), seasonal ARIMA
These methods can be split into three categories according to the (SARIMA) (Sharadga et al., 2020), regression techniques (Müller
prediction time horizon: long-term PV energy forecasting (LTPVF), and Trutnevyte, 2020), support vector machine (SVM) (Pan et al.,
middle-term PV energy forecasting (MTPVF), and short-term PV 2020), and extreme learning machine (ELM) (Zhou et al., 2020).
energy forecasting (STPVF) (Han et al., 2019; Heidari and Khovalyg, Nevertheless, some statistical approaches still have a number of
2020). LTPVF approaches consider from a single month to twelve limitations that need to be addressed, such as the requirement of
months; MTPVF approaches focus on a span of seven to thirty days; the ARMA method for stationarity of the temporal input. Through
and STPVF approaches describe the timeframe of a single week or a the rapid advances in artificial intelligence (AI), DL approaches have
smaller amount. LTPVF assists in managing long-standing PV en- been adopted to solve the limitations of traditional ML in many
ergy production, communication, and delivery, and guarantees the research fields (Shi et al., 2017). In this context, convolutional
trustworthy function of the energy management system; MTPVF neural networks (CNNs) (Huang and Wei, 2020) have been
supports decisions relating to the mid-term forwarding of the po- employed for PV energy forecasting due to their powerful feature
wer management system and STPVF approaches support energy extraction capability. Recurrent networks such as recurrent neural
management and consequently improve the trustworthiness of the networks (RNN) (Lee and Kim, 2020), Long Short-term Memory
energy management system (Oh et al., 2020). (LSTM) (Ma et al., 2020), and Gated Recurrent Unit (GRU) (Liu et al.,
Current PV energy forecasting approaches can be categorized 2019) have also been employed to model temporal representations.
into four main groups: physical approaches, persistence ap- Compared with the traditional physical, persistence, and ML ap-
proaches, statistical or machine learning (ML) approaches, and proaches, DL models have the ability to extract and model so-
deep learning (DL) approaches (Theocharides et al., 2020) Physical phisticated deep features inherent in the time series of PV energy,
methods employ scientific calculations to define the dynamic which results in a more efficient forecaster. Thus, this study focuses

2
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

on investigating a new DL framework to predict PV energy  Traditional statistical prediction approaches have the potential
production. to become trapped in local minimum and are unable to achieve
The usage of dispersed energy resources (DERs) has been widely acceptable results, due to the erratic nature of PV energy pro-
adopted in numerous countries because of recent advances in duction, particularly during hazy and wet periods. Due to the
smart grid (SG) technology, which means that renewable energy non-uniform and unsteady characteristics of the PV energy se-
(including PV power) is likely to be the main form of power in the quences, precise and steady forecasts cannot be acquired when
coming years. Consequently, there is a growing need for instanta- the initial PV energy data is explicitly passed to the prediction
neous consistency as well as safety of PV energy systems with the approach, specifically with volatile climatology. As a result,
increase in PV energy production within the SG. As a consequence, traditional STPVF approaches are unsuited to achieve acceptable
developing an efficient predictive technique for PV energy pro- prediction performance for current energy management sys-
duction has become of great interest. Several studies concentrate tems (Dairi et al., 2020; Li et al., 2020; Wang et al., 2019).
on architectural development and the tuning of parameters of B. Novelty and contributions
prediction techniques, which usually achieve better forecasts than
conventional techniques. Nonetheless, some forecasting ap- To bridge the present research gap and overcome the above-
proaches become trapped in local minima or saturation points and mentioned challenges, this study introduces a new DL architecture,
are unable to realize acceptable performance because of the very called PV-Net, for short-term PV energy forecasting. The contri-
unstable nature of the production of PV energy, particularly during butions of this work are:
rainy and cloudy weather (Feng et al., 2020). Since the features of
PV energy sequences are nonstationary and nonuniform, precise  An innovative deep learning framework is introduced for STPVF,
and constant prediction outcomes cannot be realized when the called PV-Net. The proposed PV-Net aims to process the his-
original PV energy data is simply exploited by the forecasting torical PV time series by redesigning the gates of the GRU cell
approach, specifically with volatile meteorological conditions. using dilated convolutional operation, hence capturing posi-
Therefore, traditional approaches for PV energy prediction are tional information at the same time as modeling temporal
unable to achieve the robust forecasting performance required for characteristics of input sequences.
the design of reliable power systems. Several studies have  The Conv-GRU cells are stacked in a bi-directional (Bi-Dir)
employed DL approaches to fine-tune the efficiency of PV energy manner to process PV information in both forward and back-
prediction or pertinent aspects (e.g., solar irradiation (SI)) (Kim ward directions.
et al., 2020; Wang et al., 2020a, 2020c). However, these ap-  To improve gradient flow and prevent missing information in
proaches emphasize only the PV energy sequence itself and over- later layers, a novel residual Conv-GRU module is designed,
look the meteorological aspects which are closely associated with where the residual link is implemented using one-dimensional
PV energy production. In areas with varied atmospheric conditions, convolution.
these approaches cannot detect deviations in PV energy, principally  Fair and extensive comparative experimentations on public case
in the case of small-time intervals, which are challenging to learn studies demonstrated the practical and statistical proficiency of
from historical data only. the PV-Net outperforming the recent cutting-edge deep
learning approaches for energy prediction.
A. Challenges and research gaps  The correlation between forecasting error and various length of
input intervals and seasonal changes are discussed, which
A comprehensive study of recent literature for PV energy fore- further demonstrates the stability and flexibility of PV-Net for
casting revealed a number of critical challenges to afford reliable various chronological data in the project design.
prediction performance of the PV energy. This paper aims to C. The study planning
address the following challenges:
The remainder of this work is organized as follows: Section 2
 PV energy production is associated with three irradiances of discusses the recent literature relevant to short-term forecasting
diffuse i.e., extraterrestrial SI, direct normal irradiance, and of PV energy. Section 3 discusses the design and the building of the
horizontal irradiance: and with a variety of meteorological proposed PV-Net, parameters, and relevant training. Section 4
conditions (i.e., humidity, temperature, etc.). Thus, integrating presents the experiments, results, analysis, and the discussion.
PV energy collection with current renewable energy systems is a Section 5 introduces the key limitations of this study and, finally,
considerable challenge that necessitates a new approach that is the conclusions of this study and future directions of work are
able to recognize the complicated data characteristics (Lee and introduced in Section 6.
Kim, 2020; Shi et al., 2020).
 Assimilating PV energy systems to fulfil the growing electrical 2. Related work
needs without utilizing energy backup storage is technologically
infeasible since such systems influence grid constancy. Specif- This section outlines the most recent and pertinent research,
ically, alterations in climatological conditions cause ambiguity classifying the PV forecasting approaches into four main categories:
in PV production (Lee and Kim, 2020) such as triggering irreg- physical approaches, persistent approaches, ML approaches, and DL
ular breakthroughs, voltage rises, opposite energy streams, and approaches.
diversities in voltage waveforms and frequency deformation.
This volatile production impacts the dependability, steadiness, A. Physical approaches
and planning of the energy system. Consistent forecasting for
the PV energy production will considerably decrease this un- Physical approaches include orography, impediment, tempera-
certainty, improve solidity, and enhance commercial sustain- ture, surface roughness, pressure, obstructions, and disruption of
ability. Thus, developing an efficient STPVF has become a critical weather for future forecasts (Mayer and Gro  f, 2020). These ap-
research field in recent years (Dairi et al., 2020; Heidari and proaches are mostly more trustworthy when used for long-term
Khovalyg, 2020). prediction (Han et al., 2019) and can be divided into two primary
subcategories according to the scale: mesoscale approaches and
3
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

global-scale approaches. Meso-scale approaches consider the at- instantaneously by disintegrating the prediction GHI material into
mospheric characteristics in a limited region such as provinces, the calculation of sharp-sky GHI and undertaking the sharp-sky
states, territories, or continents (Lee and Kim, 2020); while global- index (Wang et al., 2020b). Nevertheless, the sharp-sky index is
scale approaches consider the weather features at a universal scale. unable to cope with the deviation of the solar climax gradient
For example, the numerical weather prediction (NWP) method owing to variations of meteorological states such as a storm, rain-
integrates atmospheric calculations and meteorological conditions fall, and cloud within the prediction time horizon (Wang et al.,
for forecasting. For these global NWP methods, around fifteen 2020c).
climate services accessible for operative data acquisition including
the Global Data Assimilation System (GDAS), Climate Forecast C. Statistical or machine learning approaches
System (CFS), Global Forecast System (GFS), etc. they are dominated
by governments or funding corporations like the European Centre Statistical approaches utilize the chronological sequences and
for Medium-Range Weather Forecasts (ECMWF). These NWP instantaneous engendered time-series. They take a small number
techniques predict weather conditions for over fifteen days in the of input samples and demonstrate improved STPVF performance
future (Ahmed et al., 2020; Huang et al., 2020), then employ compared with conventional NWP techniques. These ML ap-
concurrently a group of mathematical calculations for tangible proaches utilize genuine statistical calculations to model the rep-
conditions and the dynamic attributes of the atmosphere (Roth resentations and interrelationships inherent in the PV time series.
et al., 2020). Normally, the essential procedure has its own moving average
Reliable PV energy forecasts can be achieved by these ap- (MA), and, curvature tailoring, and auto-regressive (AR) algorithms
proaches when the meteorological conditions remain stable, (Liu et al., 2012). Such algorithms reduce forecast errors by
however erroneous forecasts result in cases of rapid fluctuations in approximating the discrepancy between the realistically quantified
meteorological conditions. In (Wang et al., 2020b), the authors historical value and the forecasts of PV production. Thus, fore-
introduced a novel surface irradiance mapping technique for casting performance hinges on the characteristics and components
minute-level PV energy prediction, where the mapping correlation of the received time series. As an example, the authors of (Zhou
between SI and sky image is modeled by extracting the et al., 2020) adopted the ELM for modeling input characteristics, a
redegreeneblue (RGB) units and the pixel-wise information in the genetic algorithm (GA) for optimizing the relevant parameters and
sky images. In (Mayer and Gro  f, 2020), the authors contrasted the a customized similar day analysis (SDA) to forecast hourly PV power
accomplishment of nine beam and diffuse separation methods for production by estimating the similarity between various days
physical PV energy forecasting based on the computation of nu- depending on five meteorological components. The authors of
merical weather data. The study included ERBS(Erbs et al., 1982), (Theocharides et al., 2020) presented an ML framework for pre-
SKARTVEIT (Skartveit and Olseth, 1987), DISC(Bird and Hulstrom, dicting day-ahead PV energy production by including artificial
1981), DIRINT (Ineichen et al., 1992), DIRINDEX (Perez et al., neural networks (ANNs) to model input sequences, K-means to
2002), BRL (Ridley et al., 2010), ENGERER (Engerer, 2015), STARKE cluster the weather information, and a linear regressive correction
(Starke et al., 2018), ABREU (Abreu et al., 2019), where the methods (LRC) algorithm to optimize the generated results. In (Pan et al.,
were selected based on simplicity, efficiency, and novelty and were 2020), the authors employed an SVM for ultra-short-term PV po-
chosen as the best from previous studies. However, these methods wer prediction and adopted ant colony optimization (ACO) for
do not deliver the best results in computing energy forecasts as finding the optimal SVM’s parameters. In (Moreira et al., 2021), the
they are only validated for a single pleasant climatic area and one authors introduced a new method for week-ahead PV generation
NWP forecast supplier. The testing of the methods of various NWP- forecasting using an ensemble of ANNs where the design of ex-
based irradiance predictions suffers from the lack of consistent periments (DOE) technique was employed for input factors, and
generation of data of functioning PV factories together with proven cluster analysis was conducted to select the best networks. The
model constraints. In (Yin et al., 2020), the authors proposed to authors of (Zhang et al., 2020b) introduced a hybrid framework that
reduce the forecasting error caused by NWP by scrutinizing the integrates improved variational mode decomposition (IVMD) to
distribution of attributes of prediction error, then using immediate decompose PV data into normal and abnormal elements, ARIMA to
and chronological data to reduce the error. Despite a large number predict normal elements, and improved deep belief network (IDBN)
of studies investigating physics-based PV forecasting, they have to forecast the abnormal elements. In (Sun et al., 2020), the authors
multiple shortcomings: 1) they lack robustness against ambiguous introduced a probabilistic framework that employs Bayesian theory
and irregular conditions; 2) they lack adjustive and self-sufficient to generate weather scenarios to be passed to an ML-based multi-
functionality; 3) they fail to provide reliable real-time decision model solar energy prediction approach.
making (Wang et al., 2020b); and 4) they are unable to model Despite the wide adoption of ML approaches, they still have a
complex features and random characteristics. Consequently, the number of critical shortcomings including 1) they rely heavily on
physics-based approaches are not generally applicable for efficient the quality of employed feature engineering; 2) they cannot effi-
PV energy forecasting. ciently deal with complex features scenarios with high irregularity;
and 3) they exhibit unstable predictive power meaning that they
B. Persistent approaches are inappropriate as a component for reliable decision making
necessary for any power system. As a result, ML approaches are
Persistence approaches are commonly used for very short-term often combined with other techniques to offer more precise
and short-term prediction as they have low computational burden, forecasting.
short wait times, and acceptable performance. These approaches
are based on the idea that the current day is similar to the next day, D. Deep learning approaches
which means that the weather conditions (i.e., SI) for a 24 h ahead
is likely to stay comparable to the earlier day (Zhang et al., 2020a). Recently, DL approaches have been shown to have a competitive
As an example, when the climate of the current day is sunny with forecasting performance overcoming the main limitations of con-

35 Celsius then the prediction will be performed by assuming that ventional ML approaches and alleviating the reliance on complex

the temperature of the next day is also sunny with 35 Celsius. feature engineering. In view of this, the authors of (Wang et al.,
Consequently, it can calculate global horizontal irradiance (GHI) 2020c) investigated the potential of DL for solar power prediction
4
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

and presented a taxonomy for categorizing the present AI methods reflect the normal behavior of PV data. Some values could be
according to their discrepancies and similarities. In (Li et al., 2020), missing value due to system failures or similar conditions, so these
the authors introduced a new DL framework that integrates missed values are replaced with the mean value of the time win-
wavelet packet decomposition (WPD) and LSTM networks to pre- dow. The data features can be normalized to allow them to be
dict an hour-ahead PV energy using a 5-min window. In (Heidari compared more usefully. Since the CNNs and LSTMs have obliga-
and Khovalyg, 2020), the authors decomposed the input time- tions for the structure of the input data, it is essential to convert the
series and then employed LSTM augmented with attention mech- PV time-series into a suitable format. After the data is reformatted,
anism (AM) to model PV power forecasts. Similarly, the authors of they should be split into training, validation, and test sets in the
(Zhang et al., 2020c) combined LSTM and Autoencoder (AE) to es- proportions 60%, 20%, and 20% respectively. Moreover, the hyper-
timate uncertainties in input series, while modeling complex parameters of the proposed PV-Net are also initialized before
weather conditions for efficient day-ahead PV energy forecasting model training starts.
using a 15-min window. Further, the authors of (Dairi et al., 2020)
employed Variational AutoEncoder (VAE) for short-term PV fore- B. Spatial-temporal Conv-GRU
casting owing to the ability of the VAE to model non-linear time-
series statistics in both single-step-ahead and multi-step-ahead Recent DL literature has demonstrated the efficiency of recur-
predictions. In the same way, IDBN has been used to model the rent networks in modeling dependencies in sequential data.
irregular patterns in PV energy time series (Zhang et al., 2020b). In Among them, LSTM has been widely employed due to its ability to
(Wang et al., 2020a), the authors introduced a day-ahead PV power handle long-term dependencies without vanishing gradient.
prediction framework using LSTM followed by a modification However, energy time series usually contain positional information
technique to adjust the output LSTM network according to the time and long-term correlations, which means that the LSTM is unable
correlation standards concerning various forms of daily PV energy. to capture an important portion of input characteristics that can
In addition, the authors of (Huang and Wei, 2020) introduced a day- significantly affect prediction performance. The authors of (Shi
ahead probabilistic PV energy prediction framework, where an et al., 2015) created a new sequence to sequence the convolu-
improved quantile CNN (QCNN) was employed for deep feature tional LSTM (ConvLSTM) layer by redesigning the input and state
extraction from input PV power data, then quantile regression (QR) gates using convolutional layers aiming to leverage spatial-
was adopted to produce the PV energy probability distribution temporal interrelationships to provide more efficient modeling
from the acquired representations. Despite the success recently input sequences. Nevertheless, the large amount of training pa-
achieved by DL approaches, research in this area is still in the early rameters presented by the ConvLSTM complicates the training of
stages requiring more investigation. Thus, this study focuses on an the DL model. In an attempt to tackle this issue, the authors of (Shi
innovative DL model to improve the efficiency of PV energy et al., 2017) recently introduced the ConvGRU cell, which has been
forecasting. shown to have fewer parameters compared with the ConvLSTM.
The prediction performance of a conventional statistical The empirical findings demonstrate that the Conv-GRU has a
approach is unsatisfactory for large-size, irregular, and high- powerful spatial-temporal learning capability resulting in further
dimensional data. However, such approaches can perform better performance improvement. However, the ConvLSTM and ConvGRU
by hybridizing them with other feature engineering methods. have not been investigated for short-term forecasting of PV energy.
Currently, DL approaches have emerged as data-driven approaches Inspired by this, we propose the redesign of the input and state
to resolve the main limitations of physical, persistent, and statistical gates of conventional GRU using a dilated causal convolutions layer
approaches and alleviate the need for complex feature engineering (Oord et al., 2016) due to the demonstrated efficiency in modeling
owing to their powerful feature extraction capability that enables time-series information. Accordingly, the spatio-temporal charac-
them to learn both regular and irregular representations from PV teristics of feature maps can be effectively captured with the
energy time-series. Studies applying DL to PV energy prediction are redesigned Conv-GRU, and inner computations of each gate can be
still in the early stages and are mostly limited to RNNs or AEs. calculated with Eqs. (1)e(4).
Therefore, this paper addresses the task of PV energy forecasting
using a new DL approach called PV-Net. Zt ¼ sðWxz * Xt þ Whz * ht1 Þ (1)

3. Methodology
rt ¼ sðWxr * Xt þ Whr * ht1 Þ (2)
This section describes and discusses the details of designing the
proposed PV-Net presented in Fig. 1, in which the conventional GRU b
h t ¼ f ðWxh * Xt þ rt 1 ðWhh * ht1 ÞÞ (3)
cell is redesigned using dilated convolutions (hence the name
Conv-GRU) to empower the network to capture spatial-temporal
representations of PV data. The residual Conv-GRU is introduced ht ¼ ð1  zt Þ1 b
h t þ zt *ht1 (4)
to empower the network learning capability. The stacked Bi-Dir
In this paper, we use s, *, and 1 to denote a sigmoid function,
modules are employed to enable the processing of information in
dilated casual convolution function, and Hadamard product,
both forward and backward directions.
correspondingly, where rt and Zt represent reset and update gates,
respectively. Given Xt representing a tensor as input, the unit first
A. Data preparation
calculates the outputs corresponding to the update gate and reset
gate by Eqs. (1) and (2), respectively. Based on the sigmoid func-
The real-world PV data captured at PV plants is at different time
tion, the value of zt and rt lies in the interval [0,1]. Then, the hidden
intervals i.e., minutely, hourly, daily, weekly, etc., so several pre-
temporal state h b t is calculated with Eq. (3), depending on the
paratory tasks are required before using that data to train any DL
model. First, we need to select the time and size of PV energy data current input and prior hidden state ht1 . In Eq. (4), the last hidden
required for model training and evaluating the DL model. The data state ht of the unit is computed as a linear grouping of the present
could contain abnormal samples resulted from nasty situations, so temporal hidden state h b t and the former one h . In the Conv-GRU,
t1
these samples are removed from the original data as they do not the forward dependencies are considered, however the overall
5
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

Fig. 1. Systematic diagram indicating the pipeline of PV energy forecasting using the proposed PV-Net.

dependency information should be fully exploited; hence, it could


be advantageous to take into consideration the backward de- !)
pendencies because of the proven efficiency of leveraging both X ¼ Bi  dirðXÞ (5)
forward and backward sequential patterns in improving model
performance. As a result, in our proposed architecture, we adopted
Bi-directional (Bi-Dir) blocks that utilize two Conv-GRU layers to X Conv ¼ ConvðXÞ (6)
process the incoming feature maps in both forward and backward
directions and then deciding the upcoming output based on the
 ! 
dependencies among data in both directions. Then, the spatial-
Oi ¼ Concat X ; X Conv (7)
temporal outcome of the Bi-Dir block is computed as Yt ¼
! ! ) )
tanhðWt H * H t þ Wt H *H t Þ, in which, Y2R Fl Hl Wl , tanh denotes where O represents the output after the first residual link. The same
tangent line function, and the hidden states of both layers are calculation could be repeated according to the number of Bi-Dir
! )
blocks which is determined based on sensitivity experiments dis-
represented as H t and H t . Additionally, a dropout layer (Srivastava
et al., 2014) is applied after Conv-GRU cells to avoid overfitting cussed later.
problems during training which could limit prediction perfor- At the end of the Residual Conv-GRU module, the Batch
mance. The amount of dropout rate applied is set to 0.3, which is Normalization (BN) (Ioffe and Szegedy, 2015) layer is introduced to
empirically validated in a later section. alleviate the impact of periodicity and nonstationary nature of the
PV data on the performance of the proposed PV-Net. In other
C. Residual Conv-GRU learning words, a modest, but useful DL layer that enabled mitigating the
problem of diversity of distributions of activations, which slowed
Recently, the residual learning of CNNs has demonstrated its training of each layer due to the time is taken to adjust for new
efficiency in acquiring sophisticated discriminatory deep repre- iteration distribution, and allowed regularizing the received PV
sentations from visual data i.e., image, videos (He et al., 2016). The energy sequences, while taking into consideration the PV data
most important benefit from employing the residual connection is distribution. Given Olast as the output of the last residually con-
that it enables the development of deeper DL models (i.e., hundreds nected Bi-Dir blocks, the output of the Residual Conv-GRU module
of layers) without data leakage in deep layers while preserving the can be computed according to Eq. (8).
ease of gradient flow during training. Motivated by this strong
representational learning capability, this study proposes to inves- Y ¼ Batach  NormalizeðOlast Þ (8)
tigate the advantage of this learning technique in PV-Net to enable
effective modeling of complex and non-stationary characteristics of As shown in Fig. 1 the output of the last residual Conv-GRU
PV energy data. In particular, to fit this to the studied case, a novel module represents the forecast PV energy production for a given
residual Conv-GRU is introduced as shown in Fig. 2. Where the input. Finally, the proposed PV-Net is trained to optimize the mean
input X is passed into the first Bi-Dir block to capture the forward squared error (MSE) loss during the training process, where the
!) computation of MSE loss is articulated in Eq. (9).
and backward spatial-temporal representations X , then the
outputs of Bi-Dir blocks are concatenated with their input after
being convolved with 1  1 convolutional kernel to allow the N  2
1 X
capture of more spatial information through layers. Hence, the MSE ¼ ~i
yi  y (9)
N i¼1
pipeline of residual calculation could be formulated as shown in
Eqs. (5)e(7).
~ denote
where N represents the count of training elements, y and y
the predicted and the actual load values, respectively.
6
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

Fig. 2. Systematic diagram representing the architecture of the proposed Residual Conv-GRU module.

4. Experiments and analysis active power.


It could be noted from any sub-figures that the energy during
A. Implementation setup the noontime is the highest, which is comparatively minor at
sunrise and in the mid-afternoon, having a tendency to be zero at
The design of all models is performed using the Keras 2.4.3 API nightfall. PV energy between contiguous timesteps is influenced by
installed on the TensorFlow-GPU2.2.0 as the backend, and all run in a variety of aspects and differs significantly without powerful
a Python 3.7 environment. All these software tools are installed on consistency. Based on the observations of different sub-figures, the
a Dell workstation operated with a Windows 10 64-bit system and energy discrepancy between the equal times periods in different
supplemented by an Intel (R) Xeon (R) CPU E5-2670 0@ 2.60 GHz years is also considerable and it is difficult to discover the regime
processor, a graphical processing unit of NVIDIA Quadro K2200, and because of the variability in the physical environment, which in
256 GB of memory. Through all experiments, MSE is used to opti- turn introduces high ambiguity. Fig. 4 shows the map of solar po-
mize the network parameters. The Adam optimizer is adopted due wer installations in Alice Springs, Central Australia, which are
to its well-known rapid convergence and is initialized with 0.0005 distributed over 38 locations, among them one location (i.e., 1) is
as a primary learning rate, which is split by a factor of 10 following separated into 1A and 1B groupings. The PV energy time-series
every 200 iterations. The size of the training batch is set to be 64, employed here came from sector 1B. The detailed material and
while training epochs is set to be 150. Uniform initialization is specifications of all solar power systems are publicly available.
employed to calculate the initial values of model parameters. The experimental data primarily involves wind direction (A ^  Þ;
diffuse horizontal radiation (w=m2  sr), global horizontal radia-
B. Case study tion ((w=m2  sr), weather relative humidity (%), weather tem-
perature Celsius ( C), wind speed (m=s), active power (kW), current
In this study, the PV data from 1B Desert Knowledge Precinct in phase average (A), etc. The data get prepared before fed into the
Central Australia (DKASC), Alice Springs PV system was chosen as a model by eliminating abnormal samples and substituting the
case study. The installation configuration of the DKASC PV system is missed elements, while the samples for each attribute is normal-
tabulated in Table 1. According to (DKASC, 2020), the case study ized to be scaled into the equal dimension, while a simple
samples can be easily downloaded to the workspace. The time se- augmentation technique is applied to expand the size of data (for
ries of 5-min resolution is captured throughout five years example, the sine value and cosine value are employed to augment
(2015e2019). Fig. 3, presents a visualization of the historical PV wind direction). The data is divided into two parts: 80% for training
time-series of successive days in that period. Each part of the figure and 20% for testing. The training set is divided in the same way with
represents the historical data of diverse years yet in an equivalent 80% for training and 20% for validation.
date of the year for a period of one week, whereas the horizontal
axis denotes the timestep and the vertical axis denote the relative C. Evaluation metrics

This subsection presents the evaluation measures employed to


assess the predictive efficiency of the proposed PV-Net:
Table 1
The specification of the power system of DKASC, Alice Springs (1B group).
 Mean Absolute Error (MAE): The MAE denotes the mean sig-
Array tilt/Azimuth Variable : Dual axis tracking: nificance of the forecasting errors and is computed as the
Installation completed Thu; 8 Jan 2009 average of the absolute divergence between forecast and real
Inverter size/type 4 x 6 kW; SMA SMC 6000A values, as shown in Eq. (10).
Type of tracker DEGERenergie 5000NT; dual axis
Array area 4 x 38:37 m2
N  
Panel type Trina TSM  195DC01A
Number of panels 4 x 30 1 X y i  y


MAE ¼ ~ (10)
Panel rating 195W N i¼1  i 
Array rating 23:4kW

7
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

Fig. 3. Visualization of one-week samples of our case study using 5 min resolution a) 2015 samples; b) 2016 samples; c) 2017 samples; d) 2018 samples; e) 2019 samples.

Fig. 4. Visualization of the map of DKASC power system (DKASC, 2020).

8
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

 Mean Absolute Percentage Error (MAPE): The MAPE evaluates outperforming all the competing data-driven approaches which
the mean absolute percentage of the forecasting performance of illustrates the ability of the proposed model to model seasonal/
the model as expressed in Eq. (11). periodical information in complex non-linear relationships and
irregular patterns within the received PV power data.
 
  For a comprehensive understanding of the model’s behavior,
N  
1 X  i
y  ~
y i Fig. 5 presents a comparison of the forecasting errors of PV-Net
MAPE ¼    100% (11)
N i¼1  yi  with other competing approaches using distinct intervals of input
 
sequences. It is observable that employing a PV time-series data
with an interval of half-year results in the worst forecasting per-
formance, mostly because the short length input did not promote
 Mean Square Error (MSE): The MSE measure represents the
effective training of any of the implemented DL approaches,
average squared variation between the expected and the real
resulting in greater forecasting error greater. Once the range of
values, as formulated in Eq. (9).
input sequence is increased to a length of one year, the performance
 Root Mean Square Error (RMSE): The RMSE metric represents
of the model increases and PV-Net realizes the best performance
the standard deviation corresponding to forecasting errors and
(MAE: 0.786; MPAE: 10.86%; RMSE: 0.976). Compared with GRU,
is a frequently applied measure in prediction/regression analysis
the values of RMSE, MAE, and MAPE of PV-Net were enhanced by
and is calculated with Eq. (12).
0.611, 0.087, and 3.4% correspondingly. Compared with CLSTM, the
vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi PV-Net results of RMSE, MAE, and MAPE were enhanced by 0.465,
u N  2
u1 X 0.035, and 2.1%, in that order. This is primarily because increasing
RMSE ¼ t yi  y ~i (12) the length of the input interval allows the recurrent networks to
N i¼1
progressively learn the temporal characteristics from input se-
quences, whereas the spatial information becomes comparatively
poorer. Hence, CNN is unable to effectively model important data
D. Results and comparisons
representations, and the LSTM/GRU performs better. PV-net has the
best performance because it can effectively capture both kinds of
This sub-section presents the results achieved by experimenting
information even in the case of short sequences.
with the proposed PV-Net on the test set of our case study as shown
It can be seen that increasing the length of input interval to 3
in the last row of Table 2. At the same time, we compare these
years, the prediction error decreases for most competing methods.
results with the corresponding ones achieved by recent state-of-
In particular, CNN achieved the highest RMSE of 1.563, while PV-
the-art approaches implemented under the same settings as pre-
Net achieved the lowest RMSE of 1.034 with 0.179 reductions in
sented in Table 2.
forecasting error. For MAE, the worst performance of 0.799 is
It can be noted that the CNN (Wang et al., 2019) had the highest
realized by LSTM while the best performance of 0.583 is achieved
forecasting error (MAE: 0.6528; MAPE: 8.98%; MSE: 1.1389; RMSE:
by PV-Net. For MAPE, the highest error of 9.51% is achieved by CNN
1.2971) which can be explained by the nature of convolution layers
while the best performance of 0.0873 is achieved by PV-Net. The
failing to capture the temporal representations inherent in the
CLSTM and LSTM þ AM keep performing better than the LSTM and
historical PV time series. It is also observable that the LSTM archi-
CNN. However, in contrast with CLSTM, the RMSE, MAE of PV-Net
tecture presented in (Sharadga et al., 2020) has better forecasting
rose by 0.2, and 0.069, correspondingly. Compared with CLSTM,
performance (MAE: 0.6437; MPAE: 8.32; MSE: 1.0637; RMSE:
the RMSE, MAE of PV-Net rose by 0.313, and 0.181, correspondingly.
1.1314) in comparison with the CNN (Heidari and Khovalyg, 2020).
This can be justified by the fact that the CLSTM architecture in-
This indicates the prominence of modeling the seasonal and peri-
cludes LSTM to model temporal representations, while the CNN
odical information on improving the prediction performance.
layers are introduced to capture the hidden positional representa-
Additionally, integrating the LSTM and CNN in (Wang et al., 2019)
tions of data. Thus, the forecasting performance of CLSTM is much
shows great improvement in forecasting performance across
better than that of the approaches that depend only on CNN or
different measures (MAE: 0.4215; MPAE: 7.53; MSE: 0.9538; RMSE:
LSTM.
0.9097) which explains the effectiveness of modeling both posi-
Although the competing approaches perform the worst in our
tional and historical representations in PV energy data. In the same
comparative experiments, previous studies show that their fore-
way, combining LSTM and AM in (Heidari and Khovalyg, 2020) has
casting performance is better than conventional ML approaches, so
demonstrated similar performance improvement ((MAE: 0.4824;
we did not consider comparing PV-Net with conventional ap-
MAPE: 7.10%; MSE: 0.9763; RMSE: 0.9531) as it enables the model
proaches. Nevertheless, it is notable in Fig. 5 that forecasting per-
to attend more to the most crucial features throughout the learning.
formance does not always improve with the increase of the length
Moreover, the adoption of the GRU in (Lee and Kim, 2020) shows
of input intervals and degrades in some cases. This could be
robust forecasting performance over the CNN and LSTM architec-
explained by the characteristics of the PV energy data itself. In
tures (MAE: 0.4617; MAPE: 7.31; MSE: 0.9807, RMSE: 0.9617). More
terms of time series, it does not necessarily follow that more data
outstandingly, the proposed PV-Net has the lowest prediction error
will result in better prediction. When the data increases to a
(MAE: 0.3987; MAPE: 6.61%; MSE: 0.9013; RMSE: 0.8124)

Table 2
A comparative analysis of day-ahead forecasting for different competing methods on our test set using different evaluation measures.

Methods MAE MAPE (%) MSE RMSE

CNN (Wang et al., 2019) 0.6531 8.988 1.1389 1.2971


LSTM (Sharadga et al., 2020) 0.6373 8.321 1.0637 1.1314
CLSTM (Wang et al., 2019) 0.4215 7.532 0.9538 0.9097
LSTM þ AM (Heidari and Khovalyg, 2020) 0.4824 7.106 0.9763 0.9531
GRU (Lee and Kim, 2020) 0.4617 7.312 0.9807 0.9617
*PV-Net 0.3987 6.614 0.9013 0.8124

9
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

Fig. 5. Evaluating the impact of using different input intervals on the forecasting errors of competing models a) the models’ RMSE error for different ranges of input sequence. B)
models’ error for different ranges of input sequence.; c) models’ MAPE error for different ranges of input sequence.

particular degree, the input information is further from the pre- (Jun.eAug.). For every season, Fig. 6 shows the forecasting errors of
dicted outcome, and consequently becomes less correlated with competing methods in term of RMSE, MAE, and MAPE. It is
the models’ prediction; thus, increasing the data might decrease observable that the proposed PV-Net surpasses the competing
forecasting performance. To analyze the performance of the pro- approaches for short-term PV energy prediction in all seasons,
posed PV-Net more intuitively, we present the RMSE, MAE, and attaining the lowest RMSE of 0.798 in the autumn and the highest
MAPE of competing approaches using different input intervals as RMSE of 0.829 in the summer as shown in Fig. 6 (a). PV-Net ach-
depicted in Fig. 5 (a)e(c). ieves the lowest MAE of 0.391 in the spring and the highest MAE of
Motivated by the fact that PV energy is greatly affected by sea- 0.408 in the autumn, as shown in Fig. 6 (b). PV-Net achieves the
sonal factors, this study comparatively evaluates the performance lowest MAPE of 0.056 in the spring and the highest MAPE of 0.074
of the proposed PV-Net against other competing approaches on in the winter, as shown in Fig. 6 (c). This further confirms the su-
seasonal test data. The original test set is partitioned into four sub- periority of the proposed PV-Net for seasonal changes compared
groups related to the four seasons in Australia, specifically autumn with the competing approaches.
(Mar.eMay), summer (Dec.eFeb.), spring (Sept.eNov.), and winter

10
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

Fig. 6. The results of experimenting the competing methods on different seasonal data a) the RMSE results; b) the MAE results; c) the MAPE results.

E. Ablation experiments In order to validate the effectiveness of the redesigned Conv-


GRU, we implement the proposed PV-Net using different variants
The main objective of this sub-section is to comprehensively of recurrent cells and report the corresponding RMSE results in
inspect and evaluate the effect and the contribution of different Fig. 7. It could be seen that implementing the PV-Net using the
components on the final hour-ahead load forecasting. In these ex- SimpleRNN shows the highest prediction error (RMSE: 0.8531). The
periments, we chose LSTM (Abreu et al., 2019) as a robust baseline LSTM and GRU cells improve the performance of PV-Net with
architecture for the proposed PV-Net. 0.0262 and 0.0216, respectively. Compared with LSTM, employing
the ConvLSTM cells in PV-Net improves the performance by 0.0318.
1) Impact of Conv-GRU More importantly, the proposed Conv-GRU cells shows great

11
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

Fig. 7. The ablation experiments for analyzing the impact of Conv-GRU.


Fig. 9. The ablation experiments for analyzing the impact of BN layer.

performance improvements outperforming other conventional


cells with 0.8124 RMSE which in turn validates the effectiveness of performance with 0.0365, 0.0226, and 0.0151 on RMSE, MAE, MAPE
Conv-GRU in modeling the spatial-temporal information inherent in that order. This explains the influence of the BN layer in reducing
in the PV energy data. the negative impact of the nonstationary nature of the data.

2) Impact of residual linking 4) Impact of Bi-directional layers

For experimenting with the efficacy of the introduced residual To evaluate the importance of the Bi-Dir block in the design of
link, variants of PV-Net were designed by implementing the re- PV-Net, an additional experiment was performed to evaluate the
sidual Conv-GRU module using no residual link, convolutional re- performance of PV-Net when the Conv-GRU is designed in a uni-
sidual connection (He et al., 2016), average-pooling connection, directional or bi-directional fashion and the resultant values are
max-pooling connection, and the corresponding RMSE results in presented in Fig. 10. It is notable that adopting bidirectional
Fig. 8. It is notable that implementing the PV-Net using the no re- learning reduces the forecasting error with 0.1107, 0.054, and
sidual link shows the highest prediction error (RMSE: 0.9071). 0.0231 on RMSE, MAE, and MAPE respectively. This explains the
Designing the residual link using the average-pooling or max- role of considering backward information during the learning
pooling layer improves the performance of PV-Net with 0.0359 process.
and 0.0498, respectively. The adoption convolutional residual
connection reduces the prediction error to 0.8473. More notably, 5) Impact of dropout
the proposed residual learning shows great performance im-
provements outperforming other residual techniques with RMSE of The dropout layer is often employed to avoid overfitting issues,
0.8124 This demonstrates that the designing residual link using hence determining an appropriate dropout rate necessitate exper-
convolutional empowers the network capability to model posi- imenting with the PV-Net with different rates as depicted in Fig. 11,
tional information of input PV sequence and avoids information which shows that PV-Net achieves the best RMSE with the dropout
loss between layers and alleviating the vanishing gradient problem. rate of 0.5 and achieves the optimal MAE when the dropout rate is
0.3. The lowest MAPE is attained when the 0.5 dropouts are
3) Impact of BN layer employed.

To examine the efficiency of employing the BN layers, we F. Sensitivity analysis


compare two variants of PV-Net with and without the BN layer.
Fig. 9 shows that applying the BN layer improves the prediction Given that the Residual Conv-GRU modules are stacked to

Fig. 8. The ablation experiments for analyzing the impact of Residual learning. Fig. 10. The ablation experiments for analyzing the impact of bidirectional learning.

12
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

Fig. 11. The ablation experiments for analyzing the impact of dropout rates.

capture the spatial and temporal characteristics of input sequences, Table 3


sensitivity analysis experiments were executed on the test set to The P-values calculated using the paired t-test on the test set of the two datasets
using MAE, AEP, and RMSE.
investigate the effect of the number of layers/modules on the
forecasting performance. Fig. 12 shows that the best performance Methods Test set (20%)
(MAE:39.49%, RMSE:81.03%) is realized when the number of layers MAE MAPE RMSE
of PV-Net is three, and when the number of residual Conv-GRU
PV-Net vs CNN 0.01214 0.01003 0.01081
modules per layer is set to be five according to a hyperparameter PV-Net vs LSTM 0.01131 0.01023 0.01234
grid search. This behavior explains that when an abundant number PV-Net vs CLSTM 0.01513 0.00978 0.01087
of modules is reached, the variation in prediction performance is PV-Net vs LSTM þ AM 0.01411 0.00979 0.01687
smaller. Therefore, the proposed PV-Net achieves the best perfor- PV-Net vs GRU 0.00989 0.01034 0.01576

mance based on those selected parameters.

G. Statistical analysis that all these tabulated p-values are less than 0.0.5, demonstrating
that PV-Net is considerably distinctive from the competing DL
To further demonstrate the robustness of the results achieved by approaches.
the proposed PV-Net, the statistical t-test experiment (Virtanen
et al., 2020) is performed to validate the statistical significance of 5. Limitations
the PV-Net compared with the competing data-driven approaches.
In each experiment, three metrices of MAE, MAPE, and RMSE are This section introduces the primary limitations of this study that
estimated, where the significance threshold of a ¼ 0.05. Where a p- need to be considered in future work. First, the study did not
value less than 0.05 indicates statistical significance, and a p-value consider cloud thickness and cloud motion information that might
larger than 0.05 indicates no statistical significance. Table 3 shows have a great effect on forecasting performance. Second, the study
the attained P-values from these t-test experiments for our case formulated the task of PV energy forecasting as a black-box
study. PV-Net realizes the greatest p-values of 0.03114, 0.02916, and approach such that the mathematical correlation between the
0.03746 on the MAE, MAPE, and RMSE, respectively. Meanwhile, input parameters and PV energy production is not completely
the PV-Net had maximum p-values of 0.01513, 0.01034, and disclosed. Thus, it is not obvious which input element is the most
0.01576 on the MAE, MAPE, and RMSE, respectively. It is observable important factor influencing the forecasting performance. Third,
the complexity of PV power stems from the combination of the
physical model of the battery, meteorological conditions, and
environmental elements into an efficient forecasting model. Fourth,
the proposed PV-Net was evaluated for single-step, short-term
forecastingdlonger-term and multi-step scenarios were not
considered.

6. Conclusions and future works

This paper introduces a new data-driven DL framework, called


PV-Net, to forecast short-term PV energy production. The main
target was to alleviate the demand for robust feature engineering to
generate efficient PV power forecasts. In the proposed PV-Net, the
gates of GRU are redesigned using convolutional layers to promote
improved feature extraction and fine-tuned periodical represen-
tations of PV data. The redesigned Conv-GRU cell is integrated into
residually connected bidirectional blocks to improve the network
capabilities in modeling the irregular characteristics inherent in the
received PV time series. Technical and statistical evaluation of a
public use-case demonstrates the superiority of the new PV-Net
Fig. 12. Sensitivity experiments to determine the optimal number of layers in the PV- compared with the latest cutting-edge data-driven techniques.
Net. Further, PV-Net maintains high-level proficiency even with the
13
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

inclusion of complex non-regular features and affecting factors. photovoltaic battery swapping station based on weather/traffic forecasts and
speed variable charging. Appl. Energy 264, 114708.
Consequently, the proposed PV-Net could be adopted as a reliable
Guo, C., Luo, F., Cai, Z., Dong, Z.Y., Zhang, R.J.A.E., 2021. Integrated planning of
framework of PV energy forecasting in a power management internet data centers and battery energy storage systems in smart grids. Appl.
system. Energy 281, 116093.
In our plans for future work, we aim to investigate the potential Han, S., Qiao, Y.-h., Yan, J., Liu, Y.-q., Li, L., Wang, Z.J.A.E., 2019. Mid-to-long term
wind and photovoltaic power generation prediction based on copula function
of PV-Net for mid-term and long-term forecasting with more and long short term memory network. Appl. Energy 239, 181e191.
complex features and noisy data such as highly randomized infor- He, K., Zhang, X., Ren, S., Sun, J., 2016. Deep residual learning for image recognition.
mation about cloud thickness and movement. Additionally, In: Proceedings of the IEEE conference on computer vision and pattern recog-
nition, pp. 770e778.
inspired by recent improvements in internet of things (IoT) tech- Heidari, A., Khovalyg, D.J.S.E., 2020. Short-term energy use prediction of solar-
nologies, we intend to investigate PV energy forecasting in a assisted water heating system: application case of combined attention-based
complicated IoT enabled power systems. The interpretability of the LSTM and time-series decomposition. Sol. Energy 207, 626e639.
Huang, Q., Liang, X., Yan, C., Liu, Y.J.A.E., 2020. Review of interface solar-driven
proposed PV-Net will be investigated for different categories of steam generation systems: high-efficiency strategies, applications and chal-
renewable energy sources (i.e., wind power) to enable clarifying lenges. Appl. Energy 116361.
which input is more influential for prediction performance. Huang, Q., Wei, S.J.E.C., Management, 2020. Improved quantile convolutional neural
network with two-stage training for daily-ahead probabilistic forecasting of
Furthermore, the probabilistic forecasting of PV energy has not photovoltaic power. Energy Convers. Manag. 220, 113085.
been fully investigated in the literature, which might help esti- Ineichen, P., Perez, R., Seal, R., Maxwell, E., Zalenka, A.J.A.T., 1992. Dynamic global-
mating the ambiguity data, hence cope with potential uncertain to-direct irradiance conversion models. Build. Eng. 98, 354e369.
Ioffe, S., Szegedy, C.J.a.p.a., 2015. Batch normalization: accelerating deep network
operating circumstances.
training by reducing internal covariate shift arXiv preprint arXiv.
Kim, S., Kim, D., Ryu, B.-G., Chang, Y.K.J.A.E., 2020. Design optimization of large-
Funding scale attached cultivation of Ettlia sp. to maximize biomass production based
on simulation of solar irradiation. Appl. Energy 279, 115802.
Lee, D., Kim, K.J.R.E., 2020. PV power prediction in a peak zone using recurrent
This research has no funding source. neural networks in the absence of future meteorological information. Renew.
Energy.
Li, P., Zhou, K., Lu, X., Yang, S.J.A.E., 2020. A hybrid deep learning model for short-
Ethical approval term PV power forecasting. Appl. Energy 259, 114216.
Liu, H., Tian, H.-q., Li, Y.-f.J.A.E., 2012. Comparison of two new ARIMA-ANN and
This article does not contain any studies with human partici- ARIMA-Kalman hybrid methods for wind speed prediction. Appl. Energy 98,
415e424.
pants or animals performed by any of the authors. Liu, Y., Qin, H., Zhang, Z., Pei, S., Wang, C., Yu, X., Jiang, Z., Zhou, J.J.A.E., 2019.
Ensemble spatiotemporal forecasting of solar irradiation using variational
Bayesian convolutional gate recurrent unit network. Appl. Energy 253, 113596.
CRediT authorship contribution statement
Ma, S., Zhang, Y., Lv, J., Ge, Y., Yang, H., Li, L.J.E., 2020. Big data driven predictive
production planning for energy-intensive manufacturing industries. Energy 211,
Mohamed Abdel-Basset: Investigation, Methodology, Re- 118320.
Mayer, M.J., Gro  f, G.J.A.E., 2020. Extensive comparison of physical models for
sources, Visualization, Software, Writing - original draft, Writing -
photovoltaic power forecasting. Appl. Energy 116239.
review & editing. Hossam Hawash: Investigation, Methodology, Moreira, M., Balestrassi, P., Paiva, A., Ribeiro, P., Bonatto, B.J.R., Reviews, S.E., 2021.
Resources, Visualization, Software, Writing - original draft, Writing Design of experiments using artificial neural network ensemble for photovol-
- review & editing. Ripon K. Chakrabortty: Conceptualization, taic generation forecasting. Renew. Sustain. Energy Rev. 135, 110450.
Müller, J., Trutnevyte, E.J.A.E., 2020. Spatial projections of solar PV installations at
Methodology, Writing - review & editing. Michael Ryan: Investi- subnational level: accuracy testing of regression models. Appl. Energy 265,
gation, Validation, Supervision, Writing - review & editing. 114747.
Oh, H., Lee, W.-Y., Won, J., Kim, M., Choi, Y.-Y., Han, S.-B.J.A.E., 2020. Residual-based
fault diagnosis for thermal management systems of proton exchange mem-
Declaration of competing interest brane fuel cells. Appl. Energy 277, 115568.
Oord, A.v.d., Dieleman, S., Zen, H., Simonyan, K., Vinyals, O., Graves, A.,
Kalchbrenner, N., Senior, A., Kavukcuoglu, K.J.a.p.a., 2016. Wavenet: a generative
The authors declare that they have no known competing in-
model for raw audio arXiv preprint arXiv.
terests or personal relationships that could have appeared to in- Pan, M., Li, C., Gao, R., Huang, Y., You, H., Gu, T., Qin, F., 2020. Photovoltaic power
fluence the work reported in this paper. forecasting based on a support vector machine with improved ant colony
optimization. J. Clean. Prod. 277, 123948.
Perez, R.J.P., Seals, R., Michalsky, J., Stewart, 2002. Ineichen, pp. 271e289.
References Ridley, B., Boland, J., Lauret, P.J.R.E., 2010. Modelling of diffuse solar fraction with
multiple predictors. Renew. Energy 35, 478e483.
Abreu, E.F., Canhoto, P., Costa, M.J.J.R., Reviews, S.E., 2019. Prediction of diffuse Roth, J., Martin, A., Miller, C., Jain, R.K.J.A.E., 2020. SynCity: using open data to create
horizontal irradiance using a new climate zone model. Renew. Sustain. Energy a synthetic city of hourly building energy estimates by integrating data-driven
Rev. 110, 28e42. and physics-based methods. Appl. Energy 280, 115981.
Ahmed, R., Sreeram, V., Mishra, Y., Arif, M.J.R., Reviews, S.E., 2020. A review and Sharadga, H., Hajimirza, S., Balog, R.S.J.R.E., 2020. Time series forecasting of solar
evaluation of the state-of-the-art in PV solar power forecasting: techniques and power generation for large-scale photovoltaic plants. Renew. Energy 150,
optimization. Renew. Sustain. Energy Rev. 124, 109792. 797e807.
Bird, R.E., Hulstrom, R.L., 1981. Simplified Clear Sky Model for Direct and Diffuse Shi, X., Chen, Z., Wang, H., Yeung, D.-Y., Wong, W.-K., Woo, W.-c., 2015. Convolu-
Insolation on Horizontal Surfaces. Solar Energy Research Inst., Golden, CO tional LSTM network: a machine learning approach for precipitation now-
(USA). casting. Adv. Neural Inf. Process. Syst. 28, 802e810.
Bullich-Massague , E., Cifuentes-García, F.-J., Glenny-Crende, I., Cheah-Man ~e, M., Shi, X., Gao, Z., Lausen, L., Wang, H., Yeung, D.-Y., Wong, W.-k., Woo, W.-c., 2017.
Aragüe s-Pen~ alba, M., Díaz-Gonza lez, F., Gomis-Bellmunt, O.J.A.E., 2020. Deep learning for precipitation nowcasting: a benchmark and a new model.
A review of energy storage technologies for large scale photovoltaic power Adv. Neural Inf. Process. Syst. 5617e5627.
plants. Appl. Energy 274, 115213. Shi, Z., Yao, W., Li, Z., Zeng, L., Zhao, Y., Zhang, R., Tang, Y., Wen, J.J.A.E., 2020.
Dairi, A., Harrou, F., Sun, Y., Khadraoui, S.J.A.S., 2020. Short-term forecasting of Artificial intelligence techniques for stability analysis and control in smart
photovoltaic solar power production using variational auto-encoder driven grids: methodologies, applications, challenges and future directions. Appl. En-
deep learning approach. Appl. Sci. 10, 8400. ergy 278, 115733.
DKASC, 2020. Desert Knowledge Australia Centre. 25/12/2020. Download Data. Skartveit, A., Olseth, J.A.J.S.e., 1987. A model for the diffuse fraction of hourly global
Location (e.g. Alice Springs). http://dkasolarcentre.com.au/download date radiation. Sol. Energy 38, 271e274.
accessed: 25/12/2020. Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R., 2014.
Engerer, N.J.S.E., 2015. Minute resolution estimates of the diffuse fraction of global Dropout: a simple way to prevent neural networks from overfitting. J. Mach.
irradiance for southeastern Australia. Sol. Energy 116, 215e237. Learn. Res. 15, 1929e1958.
Erbs, D., Klein, S., Duffie, J.J.S.e., 1982. Estimation of the diffuse radiation fraction for Starke, A.R., Lemos, L.F., Boland, J., Cardemil, J.M., Colle, S.J.R.E., 2018. Resolution of
hourly, daily and monthly-average global radiation. Sol. Energy 28, 293e302. the cloud enhancement problem for one-minute diffuse radiation prediction.
Feng, J., Hou, S., Yu, L., Dimov, N., Zheng, P., Wang, C.J.A.E., 2020. Optimization of Renew. Energy 125, 472e484.

14
M. Abdel-Basset, H. Hawash, R.K. Chakrabortty et al. Journal of Cleaner Production 303 (2021) 127037

Sun, M., Feng, C., Zhang, J.J.A.E., 2020. Probabilistic solar power forecasting based on Solar Power Forecasting, vol. 214, p. 112909.
weather scenario generation. Appl. Energy 266, 114823. Wang, K., Qi, X., Liu, H.J.A.E., 2019. A comparison of day-ahead photovoltaic power
Theocharides, S., Makrides, G., Livera, A., Theristis, M., Kaimakis, P., forecasting models based on deep learning neural network. Appl. Energy 251,
Georghiou, G.E.J.A.E., 2020. Day-ahead photovoltaic power production fore- 113315.
casting methodology based on machine learning and statistical post-process- Yin, W., Han, Y., Zhou, H., Ma, M., Li, L., Zhu, H.J.R.E., 2020. A novel non-iterative
ing. Appl. Energy 268, 115023. correction method for short-term photovoltaic power forecasting. Renew.
Virtanen, P., Gommers, R., Oliphant, T.E., Haberland, M., Reddy, T., Cournapeau, D., Energy.
Burovski, E., et al., 2020. SciPy 1.0: fundamental algorithms for scientific Zhang, G., Cottrill, A.L., Koman, V.B., Liu, A.T., Mahajan, S.G., Piephoff, D.E.,
computing in Python. Nat. Methods. https://doi.org/10.1038/s41592-019-0686- Strano, M.S., 2020a. Persistent, single-polarity energy harvesting from ambient
2. thermal fluctuations using a thermal resonance device with thermal diodes.
Wang, F., Xuan, Z., Zhen, Z., Li, K., Wang, T., Shi, M.J.E.C., Management, 2020a. A day- Appl. Energy 280, 115881.
ahead PV power forecasting method based on LSTM-RNN model and time Zhang, J., Tan, Z., Wei, Y., 2020b. An adaptive hybrid model for day-ahead photo-
correlation modification under partial daily pattern prediction framework. voltaic output power prediction. J. Clean. Prod. 244, 118858.
Energy Convers. Manag. 212, 112766. Zhang, Y., Qin, C., Srivastava, A.K., Jin, C., Sharma, R.K., 2020c. Data-driven day-ahead
~o, J.P.J.E.C.,
Wang, F., Xuan, Z., Zhen, Z., Li, Y., Li, K., Zhao, L., Shafie-khah, M., Catala PV estimation using autoencoder-LSTM and persistence model. IEEE Trans. Ind.
Management, 2020b. A minutely solar irradiance forecasting method based on Appl. 56, 7185e7192.
real-time sky image-irradiance mapping model. Energy Convers. Manag. 220, Zhou, Y., Zhou, N., Gong, L., Jiang, M.J.E., 2020. Prediction of Photovoltaic Power
113075. Output Based on Similar Day Analysis, Genetic Algorithm and Extreme Learning
Wang, H., Liu, Y., Zhou, B., Li, C., Cao, G., Voropai, N., Barakhtenko, E.J.E.C., Man- Machine. Energy, p. 117894.
agement, 2020c. Taxonomy Research of Artificial Intelligence for Deterministic

15

You might also like