Professional Documents
Culture Documents
Electricity Demand Forecasting Using Modern Modified Techniques With Arma and Arima in Tamilnadu Using R Programming
Electricity Demand Forecasting Using Modern Modified Techniques With Arma and Arima in Tamilnadu Using R Programming
Electricity Demand Forecasting Using Modern Modified Techniques With Arma and Arima in Tamilnadu Using R Programming
Submitted by
NANDHINIEE K 310617105059
NAVEEN B 310617105060
RAJAHAMSA R 310617105073
YASHWANTH KUMAR SP 310617105106
BACHELOR OF ENGINEERING
In
ELECTRICAL AND ELECTRONICS ENGINEERING
BONAFIDE CERTIFICATE
SIGNATURE SIGNATURE
Dr.E.KALIAPPAN,M.Tech,Ph.D., Dr.E.KALIAPPAN, M.Tech,Ph.D.,
SUPERVISOR HEAD OF THE DEPARTMENT
Professor and Head Department of Electrical and
Department of Electrical and Electronics Engineering
Electronics Engineering Easwari Engineering College
Easwari Engineering College Ramapuram, Chennai-89
Ramapuram, Chennai-89
-----------------------------------------------------------------------------------------------
Dr. R.S. KUMAR, M.Tech., Ph.D., and the management for providing us with
We are very much thankful to Head of the department and our project
guide Dr. E. KALIAPPAN, M.Tech., Ph.D., for his motivation and support for
completion of our project with perfection. In addition, we thank him for his
collecting the official data pertaining to the load demands from verified official
sources.
K. V. THILAGAR, M.E, Ph.D., for his valuable support and encouragement for
the completion of our project with perfection.
TABLE OF CONTENTS
Abstract 1
List of Figures 2
1. Introduction 4
2. AUTOREGRESSIVE MODELS 13
Average(ARIMA) Model
3. Literature Review 23
4.2 Auto-correlation 29
Electricity demand forecasts are significant for energy providers and other
marketplaces. Accurate models for electric power load forecasting are crucial
Our discussion about different time series models are supported by giving the
measures.
1
LIST OF FIGURES
2010-2015
Figure 2.3 Boxplot Across Months to Illustrate Seasonal Effects
Figure 2.4 Complete Auto-correlation function plot in Time Series
function
years:2010 -2022
Figure 3.1 Demand Forecast data of the year 2016 in Tamil Nadu
Figure 3.2 Forecast error data of the year 2016 In Tamil Nadu
Figure 3.3 Demand forecast data of the year 2017 in Tamil Nadu
Figure 3.4 Forecast error data of the year 2017 in Tamil Nadu
CHAPTER 1: INTRODUCTION
The maximum savings can be achieved when load forecasting is used to control
operations and decisions like economic dispatch/ unit commitment and fuel
allocation /on -line network analysis. The operating cost is increased due to the
over time is considered. Finally, the appropriate model is selected to predict the
energy. Results show that the proposed method is reasonable and effective.
mathematical analysis used in the forecasting model. These are presented into
two basic types, namely: quantitative and qualitative methods. In most cases,
accurately, these methods are Delphi method, Curve fitting and technological
papers, the load forecasting techniques may be grouped broadly in three major
5
1.2 Traditional Forecasting Techniques
One of the most important topics for the planners of the nation is to predict
future load demands for planning the infrastructure, development trends and
index of overall development of the country etc. In early days, these predictions
been augmented with the finding of researches for more effective forecasting in
least-squares technique.
Regression is one of the most widely used statistical techniques and it is often
conditions, day types and customer classes. This method assumes that the load
can be divided in a standard load trend and a trend linearly dependent on some
factors influencing the load. The mathematical model can be written as:
6
Where, is the normal or standard load at time t, ai is the estimated
procedure requires few parameters that can be easily calculated from historical
precisely throughout a year, one should consider seasonal load change, annual
load growth and the latest daily load change. To deal with these characteristics
The transformation function is estimated with the previous year's data points, in
order that the function converts the data points into a set of new data points with
[4, 5-7].
Multiple Regressions
Multiple Regressions is the most popular method and often used to forecast the
7
capita growth, electricity prices, economic growth etc. Multiple Regression
The data analysis program can select the Polynomial degree of influence of the
variables from 1 to 5. In most cases, linear dependency gives the best results. In
subsequent steps, it uses the maximum of the initial hourly forecast; the most
recent initial peak forecast error and exponentially smoothed errors as variables
components.
8
Exponential smoothing is one of the approaches used for load forecasting. In
this method, the first load is model based on previous data, then to use this
model to predict the future load. Here, the load at time t, y(t), is modeled using a
model order and parameters. The method uses an operator that controls one
function and the partial autocorrelation function of the resulting differenced past
load data are utilized to identify a suboptimal model of the load dynamics. A
optimal model and the subsequent parameter estimates. Consider the parameter
accurate when the errors are not Gaussian. Iterative methods are used to find
The traditional forecasting techniques have been modified so that they are able
environmental conditions. Some of the techniques that are the modified version
10
1.7 Adaptive Demand Forecasting
nature and can be used as an on-line software package in the utilities control
system. Next state vector is estimated using current prediction error and the
historical data set analysis. Switching between multiple and adaptive regression
analysis is possible in this mode. The same model as in the multiple regression
model for load prediction composed of three components (nominal load, type
load and residual load) was developed .To use Kalman’s filter nominal load is
11
modeled accordingly and the parameters of the model are adapted by the
The Time series methods appear to be among the most popular approaches that
applied to STLF. Time series methods are based on the assumption that the data
available data and then obtain the forecasted value with respect to time using
the established model. The next subsection discusses some of the time series
12
Auto-Regressive (AR) model can be used to model the load profile, if the load
as:
α i, i =1…m are unknown coefficients and the above given equation is the auto
tuned on-line using the well-known least mean square (LMS) algorithm.
ARMA model represents the current value of the time series y (t) linearly in
likelihood approach. In this method, the original time series of monthly peak
13
demands are decomposed into deterministic and stochastic load components,
the WRLS (Weighted Recursive Least Squares) algorithm can be used to update
the parameters of their adaptive ARMA model. Using minimum mean square
the stationary form has to be done first. The differencing process can do this
For a series that needs to be, differenced d times and has orders p and q for
the AR and MA components, i.e. ARIMA (p; d; q), the model is written as:
The trend component to forecast the growth in the system load, the weather
parameters to forecast the weather sensitive load component, and the ARIMA
model to produce the non-weather cyclic component of the weekly peak load.
14
2.4 Support Vector Machine based Techniques
theory (SLT), which analyzes data and recognizes patterns, used for
modified more heavily than the distant insensitive errors, they named this
method as C-ascending support vector machine. They conclude by a test that the
C-ascending support vector machines with the actually ordered sample data
To estimate the relations between input and output variables Lee & Song further
model. This method was derived by modifying the risk function of the standard
The model is proved able to enhance the accuracy, improve global convergence
emerged to deal with such models effectively and most efficiently in research
scenarios. It has been very widely in use over the last few decades. Soft
systems mimic the ability of the human mind to employ modes of reasoning that
are approximate rather than exact. The basic theme of soft computing is that
precision and certainty carry a cost and that intelligent systems should exploit,
optimization tool. The demand/ load forecasting techniques are also developed
based expert systems have been utilized for this purpose also.
space and need not assume the search space is differentiable or unit-modal, it is
The general scheme of the Genetic Algorithm process is briefly described here.
17
The integer or real valued variables to be determined in the genetic algorithm
randomly generated range in each dimension. Each parent vector then generates
with the best fitness values, to become the new parents for the next generation.
generations is reached.
ARMAX form:
time t,
- white noise at time t, and q-1 - back-shift operator and A(q), B(q), and
C(q) are parameters of the autoregressive (AR), exogenous (X), and moving
18
optimization problem, and then solved by a combination of heuristics and
optimization of GAs, several results have been obtained over the last few years.
It is well known that a fuzzy logic system with centroid de-fuzzification can
identify and approximate any unknown dynamic system (here load) on the
compact set to arbitrary accuracy. Liu observed that a fuzzy logic system has
great capability in drawing similarities from huge data. The similarities in input
data (L-i -L0) can be identified by different first order differences ( ) and
The fuzzy logic-based forecaster works in two stages: training and on-line
19
forecasting. In the training stages, the metered historical load data are used to
database and a fuzzy rule base by using first and second-order differences of the
data. After enough training, it will be linked with a controller to predict the load
change online. If a most probably matching pattern with the highest possibility
power system. A hybrid approach was proposed which can accurately forecast
on weekdays, public holidays, and days before and after public holidays.
Simulated annealing and the steepest descent method perform the search for the
optimum solution. A hybrid scheme combining fuzzy logic with both neural
networks and expert systems for load forecasting. Fuzzy load values are inputs
to the neural network, and the output is corrected by a fuzzy rule inference
mechanism. The fuzzy logic approach for next-day load forecasting offers three
advantages. These are namely the ability to (1) handle non-linear curves, (2)
forecast irrespective of day type and (3) provide accurate forecasts in hard-to-
phase STLF methodology, that also uses orthogonal least squares (OSL) in
applications because of their ability to learn. Neural networks offer the potential
network, etc. There are multiple hidden layers in the network. In each hidden
layer, there are many neurons. Inputs are multiplied by weights ωi and are
added to a threshold θ to form an inner product number called the net function.
The net function NET for example, is put through the activation function y, to
produce the unit’s final output, y (NET). The main advantage here is that most
of the forecasting methods seen in the literature do not require a load model.
type neural network. The network outputs are linear functions of the weights
that connect inputs and hidden units to output units. Therefore, linear equations
can be solved for these output weights. In each iteration through the training
data (epoch), the output weight optimization training method uses conventional
back propagation to improve hidden unit weights, then solves linear equations
for the output weights using the conjugate gradient approach. Attention was
paid to accurately model special events, such as holidays, heat waves, cold
21
snaps and other conditions that disturb the normal pattern of the load.
Expert systems are new techniques that have emerged because of advances in
has the ability to reason, explain and have its knowledge base expanded as new
what is called the knowledge base component of the expert system. This
knowledge is represented as facts and IF-THEN rules, and consists of the set of
relationships. Between the changes in the system load and changes in natural
and forced condition factors that affect the use of electricity this rule base is
used daily to generate the forecasts. Some of the rules do not change over time,
relationships between weather load and the prevailing daily load shapes have
been widely examined to develop different rules for different approaches. The
typical variables in the process are the season under consideration, day of the
22
CHAPTER 3: LITERATURE REVIEW
take the relationships of time series from different locations into consideration .
Traditional prediction methods also include random time series models such as
algorithms, etc. These modern methods are getting more popular among
since the neural networks are products of biological simulation that follow the
behavior of the human brain . Park showed good performance of this type of
concluded that ANNs were highly effective in electrical load forecasting. After
that, many time series forecasting studies were performed using various
artificial neural networks by many researchers. Lou and Dong proved that
electric load forecasting with RFNN showed much higher variability with
hourly data in Macau. Okumus and Dinler integrated ANNs and the adaptive
proved that their proposed hybrid model Energies 2019, 12, 1931 three of 30
was better than the classical methods in forecasting accuracy. Hong selected
better parameters for SVR by using the CPSO algorithm, while Che and Wang
established a hybrid model that was a combination of ARIMA and SVM, called
24
SVRARIMA. Liu et al. built a model integrating EMD, extended extreme
learning machine (ELM), Kalman filter, and PSO algorithm. Although the
hybrid model seemed better than individual classical models, the limitations of
each model due to the nature of the structure seemed inevitable. In order to
forecasting theory has been developed through the joint efforts of three
shortcomings. There are some typical studies: Zhang et al. successfully obtained
difficulty in searching for electric load data. Moreover, because of the strict
Linear regression relies too much on historical data to cope with nonlinear
analysis models will become weaker and weaker. In addition, when faced with
recognize the turning point of the data and does not perform well in long-term
forecasting. As for the autoregressive moving average model, it only gets results
addition, strong random factors of the data may lead to instability of the model,
these models meets the accuracy required by an electric load forecasting system.
There are three major problems. First, it is hard to choose the parameters of
the outcomes. Second, ANNs are inclined to fall into local minima owing to
their relatively slow self-learning convergence rate. Lastly, the number of layers
and neurons in a neural network structure has an effect on the forecasting result
26
and computing time. As to other models, SVM has a high requirement for
storage space and expert systems strongly rely on knowledge databases, while
gray forecasting models can produce decent results only under the condition of
are applied. When the optimization algorithms are combined with Energies
2019, 12, 1931 four of 30 forecasting models, more reasonable parameters will
The main contributions and novelties of our proposed model are summarized as
follows:
The presented model includes all the state-of-art statistical methods used in the
under-forecasting values and by bringing the forecast values near to the actual.
These results are better than considering individual algorithms used in the two
models viz. ARMA & ARIMA. The experimental results show that our
27
CHAPTER 4: METHODOLOGY AND COMPARISON
28
Figure 1.1 The Building stages of the ARMA models
vector machine based, soft computing based models- genetic algorithms, fuzzy
logic, neural networks and knowledge based expert systems etc. have been
applied to load forecasting. The merits and demerits of these techniques are
presented technique wise. From the works reported so far, it can be inferred that
major advantages for their effective use. There is also a clear move towards
29
hybrid methods, which combine two or more of these techniques. The research
has been shifting and replacing old approaches with newer and more efficient
ones.
model. Compared with the CDE algorithm, the IDE algorithm is more effective
developed ARMA models based on the IDE algorithm can forecast throughput
performance of acid pickling and rolling mill units with acceptable accuracy
4.2 Auto-Correlation
This compares the persistence over time of the random walk. The
non-stationary.
30
The difference between time steps of the autocorrelation can also be determined
validation.
model is stationary or not. This figure shows the stationary auto-regression plot
of the time series model. It shows that the model is random and has the
tendency to move beyond the center and back, which makes the model
nonstationary.
31
Figure 1.3 Stationary Auto-Regression of the time series model
The two models share many similarities. The AR and MA components are
average model MA (q). AR (p) makes predictions using previous values of the
dependent variable. MA (q) makes predictions using the series mean and
ARMA model is a stationary model; if your model is not stationary, then you
can achieve stationarity by taking a series of differences. The “I” in the ARIMA
the model, then it becomes simply an ARMA. A model with a dth difference to
fit and ARMA (p, q) model is called an ARIMA process of order (p, d, q). You
can select p, d, and q with a wide range of methods, including AIC, BIC, and
32
empirical autocorrelations measures of accuracy.
Mean error in detail according to the orders p and q to determine their optimal
values. We compute the difference between the values obtained through model
ARMA (p, q) and those obtained experimentally. However, beyond these values
the gain is small. Due to the constraint capabilities of the network devices, the
computational power usage. Median values are used for the prediction, and
MAPE is used for error analysis. Using MAPE has low profit loss
◼ Pro: The MAPE measure works well in load forecasting, since load values
◼ Con: MAPE values are biased with the actual electricity price values, which
class of models in the field of load curve forecasting, namely the ARMA model.
33
A methodology that considers temperature and breaks down the deterministic
and the stochastic component, the latter modelled as an ARMA process. If two
models are generally similar in terms of their error statistics and other
diagnostics, you should prefer the one that is simpler and/or easier to
understand. The simpler model is likely to be closer to the truth, and others will
> library(timeSeries)
> library(trend)
> library(tseries)
> library(unittest)
> library(urca)
> library(wmtsa)
>library(zoo)
> library(xts)
= c(2015,12))
34
#This is the start of the time series and end of the time series.
> class(de.ts)
#This tells you that the data series is in a time series forma
[1] "ts"
> plot(de.ts)
35
> abline(reg=lm(de.ts~time(de.ts)))
> cycle(de.ts)
Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
2010 1 2 3 4 5 6 7 8 9 10 11 12
2011 1 2 3 4 5 6 7 8 9 10 11 12
2012 1 2 3 4 5 6 7 8 9 10 11 12
2013 1 2 3 4 5 6 7 8 9 10 11 12
2014 1 2 3 4 5 6 7 8 9 10 11 12
2015 1 2 3 4 5 6 7 8 9 10 11 12
36
> plot(aggregate(de.ts,FUN=mean))
#This will aggregate the cycles and display a year on year trend
Figure 2.3 Aggregated cycle data for the year on year trend for 2010-2015
37
> boxplot(de.ts~cycle(de.ts))
Figure 2.5 Complete auto-correlation function plot in time series analysis with its lagged values
>acf(de.ts)
Figure 2.6 Partial auto-correlation function plot in time series analysis with its correlation of the
residuals
38
>pacf(de.ts)
Figure 2.7 Plot between partial auto-correlation function and lag function
>log(de.ts)
Nov Dec
>diff(de.ts)
data: diff(log(de.ts))
Warning message:
> Acf(log(de.ts))
Figure 2.8 Correlogram of the observed data trend obtained with the function plot.acf()
41
>pacf(log(de.ts))
Figure 2.9 Correlogram of the observed data trend with the function plot.pacf()
> (fit <- arima(log(de.ts), c(1, 1, 1),seasonal = list(order = c(0, 1, 1), period =
12)))
period = 12))
Coefficients:
42
> Acf(ts(fit$residuals),main="ACF RESIDUALS")
Figure 2.10 Correlogram of the observed residuals trend obtained with the function of acf
>pacf(ts(fit$residuals),main=”ACF RESIDUALS”)
Figure 2.11 Correlogram of the observed residuals trend obtained with the function of pcaf
>pred
$pred
43
Jan Feb Mar Apr May
Nov Dec
$se
45
>ts.plot(de.ts,2.178^pred$pred,log=”y”,lty=c(1,3))
Figure 2.12 AARIMA model to predict the future 5 years with seasonal components in the ARIMA
formulation
Figure 2.13 Predicted ARIMA formulation graphy for the next 10 years 2010-2020
46
3.7 Comparison of Results
Figure 3.1 Demand forecast data of the year 2016 in Tamil Nadu
47
Figure 3.3 Demand forecast data of the year 2017 in Tamil Nadu
Figure 3.4 Forecast Error data of the year 2017 in Tamil Nadu
48
Figure 3.5 Forecasted data of year 2018 in Tamil Nadu
49
Figure 3.7 Forecasted data of year 2019 in Tamil Nadu
51
CHAPTER 5: CONCLUSION FUTURE SCOPE
forecast And Regression with ARMA errors have been Considered. Results
indicated that AR (2), which was the mean corrected series differenced at lag 12
and 1, emerged as the best model for forecasting the maximum demand of
forecasting the electricity and this will be looked into for future research.
the best performing models for the demand forecasting process. Various classes
of time series models, namely ARIMA, Regression with ARMA errors have
been considered.
The statistics related to the last 10 years were used to train the models and the 5
past years were used to forecast. To the best of our knowledge, this is the first
attempt to consolidate the real time data with the ARMA model algorithm, and
52
different time series methods for demand forecasting systems. After comparison
between ARMA and ARIMA model, we conclude that ARIMA model as most
FUTURE SCOPE
based on the regularities in time series related to the changes in seasons. Based
on the above work, we use a real time regional monthly electric power
consumption demand. It may be useful in forecasting the electricity and this will
53
REFERENCES
Techniques:https://ieeexplore.ieee.org/stamp/stamp.jsp?
Shri Harsha.J(2012)
Algorithm:https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8256132
forecasting and control, 4rd ed., Prentice Hall: Englewood Cliffs, NJ, 2011.
[5] Implementing “R” Programming for Time Series Analysis and Forecasting
[6] Prapanna, M., Labani S., & Saptarsi G. (2014). Study of Effectiveness of
13-29.
54
[7] Time series forecasting using improved ARIMA:
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7529496 Soheila
[8] Forecast of hourly average wind speed with ARMA models in Navarre
(Spain)":https://www.sciencedirect.com/science/article/abs/pii/S0038092X0400
https://www.sciencedirect.com/science/article/abs/pii/S0960148108003327?via
[10] Time Series ARIMA Model for Prediction of Daily and Monthly Average
Global Solar Radiation: The Case Study of Seoul, South Korea, Mohammed H.
[11] Tarno, Subanar, D. Rosadi and Suhartono, "New procedure for determining
10.1109/ICSSBE.2012.6396643.(2012)
[12] Aston, John & Findley, David & Mcelroy, Tucker & Wills, Kellie &
Martin, Donald. (2007). New ARIMA Models for Seasonal Time Series and
[13] Bartholomew, D. & Box, G. & GM, Jenkins. (1976). Time Series Analysis:
55
Forecasting and Control. Operat. Res. Quart.. 22. 199-201. 10.2307/3008255.
[14] G. Chang Y. Zhang D. Yao and Y. Yue. Short-term traffic flow forecasting
56
57