Professional Documents
Culture Documents
Sales Forecasting Using Kernel Based Support Vector Machine Algorithm
Sales Forecasting Using Kernel Based Support Vector Machine Algorithm
ABSTRACT
Due to huge applications, machine learning techniques are researched 2. FORECASTING TECHNIQUES
more widely. This paper establishes a model of sales prediction based All Classification and prediction are two forms of data analysis
on improved support vector machines, to forecast the sales data. that can be used to extract models describing important data classes
Support vector regression provides global optimal solutions which or to predict future data trends. Such analysis can help us to provide
avoid overtraining. To check the efficiency of forecasting model with a better understanding of the large data. Classification predicts
various accuracy parameters are used, including Mean Absolute categorical (discrete, unordered) labels, while prediction models
Percentage Error (MAPE), Mean Absolute Deviation (MAD) and continuous valued functions. Classification technique is capable of
Mean Squared Error (MSE) etc. processing a wider variety of data than regression and is growing in
popularity. Classification is also called supervised learning, as the
Development of an improved Sales forecasting model was done instances are given with known labels, contrast to unsupervised
which is based on kernel based support vector machine and other learning in which labels are not known. Each instance in the dataset
optimization parameters. Traditional SVM classifier is a linear used by supervised or unsupervised learning method is represented
classifier, Kernel approach gives nonlinear classifier and maximum by a set of features or attributes which may be categorical or
margin hyperplanes which in turn fit the maximum margin continuous [1] [2].
hyperplane into a transformed feature space. Due to the non-linear Classification is the process of building the model from the training
nature, kernel based SVM used to build the forecasting model to set made up of database instances and associated class label. The
improve the forecast precision. resulting model is then used to predict the class label of the testing
instances where the values of the predictor features are known.
Testing of forecasting model was done by applying it on apparel sales Supervised classification is one of the tasks most frequently carried
data which was from 2009/01/01 to 2013/12/30. Forecasting model out by intelligent techniques. The large numbers of techniques have
was rerun and same dataset applies to various classical techniques been developed.
along with kernel based support vector machine algorithm to ensure
that the results are comparable. The obtained Results show that the
forecast efficiency of kernel based support vector machine was better 2.1 Classical Techniques
than classical approaches and traditional SVM.
2.1.1 Simple Moving Average
Keywords Simple Moving Average is the most basic forecasting method.
Simple Moving Average, Naïve Bayes, Exponential Smoothing, Procedure to do forecasting using Simple Moving Average is as
Adaptive Rate Smoothing, Weighted Moving Average, Machine follows,
Learning, SVM, MLP, Time series prediction, MAPE, MAD, MSE.
Take data from last N periods
Machine learning refers to a system that has the capability to Use average to forecast future period
automatically learn knowledge from experience and other ways.
Classification and prediction are two forms of data analysis that can
be used to extract models describing important data classes or to
predict future data trends [3]. (1)
Performance analysis of classical and machine learning algorithms
including neural network and SVM is carried out. These algorithms Where, Ft = forecast for period t
are used for classifying the actual sales data which were collected St-i = data
from apparel manufacturing firm. N = Number of time periods
Experimental setup is done using c# implemented code in visual
studio. The goal of this research is to find the best classifier which
outperforms other classifiers in all the aspects and use best algorithm 2.1.2 Weighted Moving Average
to build a forecasting model which will give better forecasting Simple moving average gives equal weight to all the data, the
results. weighted moving average can give more weight to more recent data.
(2)
(3)
If actual data point was above the last forecast, it would tend to move (4)
the future forecast up and if the data point was below the last
forecast, it would tend to move the future forecast down. Where, Yj is the output of node j, f (.) is the transfer function, wij the
connection weight between node j and node i in the lower layer and
Xij is the input signal from the node i in the lower layer to node j.
2.1.4 Adaptive Rate Smoothing
Adaptive rate smoothing modifies the exponential smoothing Artificial Neural Network(ANN) largely used in forecasting,
technique by modifying the alpha smoothing parameter for each assists multivariate analysis. Multivariate models can rely on
period's forecast by the inclusion of a Tracking Signal. This is a greater information, where not only the lagged time series being
technique shown to respond much more quickly to step changes forecast, but also other indicators (such as technical,
while retaining the ability to filter out random noise. fundamental, inter-marker etc. for financial markets), are
combined to act as predictors. In addition, ANN is more effective
2.1.5 Naïve Bayes in describing the dynamics of non stationary time series due to
A naive Bayes classifier is a simple probabilistic classifier based its unique non-parametric, non-assumable, noise-tolerant and
on applying Bayes' theorem with strong (naive) independence adaptive properties. ANNs are universal function approximators
assumptions. A more descriptive term for the underlying probability that can map any nonlinear function without a prior assumptions
model would be "independent feature model". Depending on the about the data [2].
precise nature of the probability model, naive Bayes classifiers can be
trained very efficiently in a supervised learning setting. In many
practical applications, parameter estimation for naive Bayes models
uses the method of maximum likelihood; in other words, one can 2.3 Support Vector Machines (SVM)
work with the naïve Bayes model without believing in Bayesian
probability or using any Bayesian methods [2][3]. 2.3.1 Support Vector Machines (SVM)
Support Vector Machine (SVM) is a Machine learning
techniques comes under classification method which was based on
the construction of hyperplanes in a multidimensional space Support
2.2 Neural Networks vector machine (SVM) is a useful technique for data classification,
regression and prediction. For Given a set of training examples, each
2.2.1 ANN marked as belonging to one of two categories, an SVM training
Artificial Neural Network (ANN) is a Machine learning algorithm builds a model that predicts whether a new example falls
techniques which largely used in forecasting, assists multivariate into one category or the other. SVM model is a representation of the
analysis [7]. Many research activities were done to optimize the examples as points in space, mapped so that the examples of the
usage of ANN for forecasting in the past decade. ANNs provide a separate categories are divided by a clear gap that is as wide as
great deal of promise, but it also uncertain in behavior. Till the date possible. New examples are then mapped into that same space and
researchers are still not certain about the effect of key factors on the predicted to belong to a category based on which side of the gap they
forecasting performance of ANNs. fall on.
A linear support vector machine is composed of a set of given
support vectors z and a set of weights w. The computation for the
output of a given SVM with N support vectors z 1, z2, … , zN and
weights w1, w2, … , wN is then given by,
(5)
Using kernels, the original formulation for the SVM given SVM with
support vectors z1, z2, … , zN and weights w1, w2, … , wN is then
given by,
(6)
(7)
(8)
Classical
Product ANN K-SVM
(Naive)
P1 0.2774248 0.25823926 0.2313461
P2 0.2491538 0.29823926 0.2487574
P3 0.2844138 0.2728239 0.2918824
P4 0.2774248 0.32823926 0.3182393
Fig. 3 Quarterly apparel’s sales data P5 0.2974148 0.30296282 0.2762824
P6 0.2874871 0.29782393 0.2678239
There are three category of apparels are manufactured namely Avg. 0.2788865 0.29305473 0.27238858
Menswear, Woodenware and Kids ware. We have selected Men’s
Category from product details. The Product details along with the In the Table 3, the observed value of MAD for K-SVM is lower than
category are as follows; the other Classical technique and ANN except the few cases.
Classical
Category 1 Category 2 Category 3 Product ANN K-SVM
Product (Naive)
Men’s Women’s Kids
P1 Shirts Shirts Shirts P1 119.11536 108.886481 99.886481
P2 Trousers Trousers Trousers P2 117.41536 102.64577 104.936977
P3 T-Shirts Dress T-shirts P3 118.17436 111.123645 89.123645
P4 Jeans Jeans Jeans P4 109.94336 116.239836 98.239836
Kurtas &
P5 Kurtas Dress P5 109.88536 111.465725 113.465725
Kurtis
P6 Jackets Sarees Shorts P6 118.41536 106.917582 104.78652
Avg. 115.49152 109.546506 101.739864