Professional Documents
Culture Documents
SJNanda - Spider and CollidingBodies
SJNanda - Spider and CollidingBodies
Supervised learning, you train the machine using data which is well "labeled."
It means some data is already tagged with the correct answer. It can be compared
to learning which takes place in the presence of a supervisor or a teacher.
Classification: Regression:
Classification means to group the output Regression technique predicts a single
inside a class. If the algorithm tries to output value using training data.
label input into two distinct classes, it is Example: You can use regression to predict
called binary classification. Selecting the house price from training data. The
between more than two classes is input variables will be locality, size of a
referred to as multiclass classification. house, etc.
Example: Determining whether or not
someone will be a defaulter of the loan.
Unsupervised Learning
Unsupervised learning is a machine learning technique, where you do not need to
supervise the model. The model is allowed to work on its own to discover information.
Importance :
Unsupervised machine learning finds all kind of unknown patterns in data.
Unsupervised methods help you to find features which can be useful for categorization.
It is taken place in real time, so all the input data to be analyzed and labeled in the
presence of learners.
It is easier to get unlabeled data from a computer than labeled data, which needs
manual intervention.
Clustering :
Supervised / Unsupervised Learning
Parameters Supervised machine learning Unsupervised machine learning
technique technique
Process In a supervised learning model, input and In unsupervised learning model, only
output variables will be given. input data will be given
Input Data Algorithms are trained using labeled data. Algorithms are used against data which is
not labeled
Algorithms Used Support vector machine, Neural network, Unsupervised algorithms can be divided
Linear and logistics regression, random into different categories: like Cluster
forest, and Classification trees. algorithms, K-means, Hierarchical
clustering, etc.
Computational Supervised learning is a simpler method. Unsupervised learning is computationally
Complexity complex
Use of Data Supervised learning model uses training Unsupervised learning does not use
data to learn a link between the input and output data.
the outputs.
Accuracy of Highly accurate and trustworthy method. Less accurate and trustworthy method.
Results
Real Time Learning method takes place offline. Learning method takes place in real time.
Learning
Data Clustering
Data Clustering refers to the unsupervised classification of a dataset into
groups, such that elements within a group are similar to each other.
Example : IRIS Flower Data Set
150 Observations (50 Observations for each flower)
4 attributes (Sepal Length, Sepal Width, Petal length, Petal Width)
7 7
6 6
Setosa
5
Petal Length
5
Petal Length
4 4
3 3
2 2
Versicolor
1
1
5
5
8
8 4
4 7
7
3 6
3 6
5
5
Sepal Length 2 4
Sepal Length 2 4 Sepal Width
Versinica Sepal Width
Manhattan Euclidian
K-Means Clustering
Step 3: Update the centroid of each Cluster.
Step 4 : Check whether iteration should stop or not?, as per Total distortion
Step 5: Output Yi can be the nearest centroid of the cluster containing Xi.
Data Clustering as an Optimization Problem
Nature Inspired Algorithms for Clustering
Dominant Male
Female spider
Non-Dominant
Male
Social Spider Algorithm
Vibrations received by ith dominant male spider from the nearest female spider
5.Female Moments
Exploration
6. Dominant Male Movements
Mating
6. Non-Dominant Male Movements
Exploitation
Step 7. Mating T2
T1
8. Mating for T1 set
M1
26% F1
19%
F1
19% F2
36%
Original Social Spider Algorithm
START
INIT POP
&
DATA-BASE PARA TEMP
STORAGE
FITNESS
ANALYSIS F1<F2
F2 STOP
If stopping
FINAL POP criterion
VIBRATION reached
DESIGN AFTER
GEN-1
FEMALE
UPDATE
NEW POP MATING
DOM-MALE
UPDATE
NON-DOM
MALE
UPDATE
Parallel Social Spider Algorithm
START
INIT POP
&
DATA-BASE PARA TEMP
STORAGE
FITNESS
ANALYSIS F1<F2
F2 STOP
If stopping
FINAL POP criterion
VIBRATION reached
DESIGN AFTER
GEN-1
NON-DOM
FEMALE DOM-MALE
MALE
UPDATE UPDATE
UPDATE
NEW POP
MATING
Benchmark Datasets Used for Analysis
Small Dimensional
Sr.no Data Set Observation Attributes Class Train Test
1 Iris 150 4 3 112 38
2 Balance 625 4 3 469 156
3 Wine 178 13 3 133 45
4 Breast Cancer 699 10 2 524 175
5 Thyroid 215 5 3 162 53
6 Credit 690 14 2 518 172
7 Heart 303 14 5 227 76
8 Glass 214 10 7 161 53
9 Horse 364 28 3 273 91
Sr.no Data Set SSO ABC [1] PSO [2] RGA k-mean
1 Iris 0 0 2.63 0 0
2 Balance 30.12 15.38 25.47 28.20 32.69
3 Wine 0 0 2.22 22.72 15.90
4 Breast Cancer 1.32 0 2.87 25.24 41.37
5 Thyroid 3.77 3.77 5.55 18.86 5.66
6 Credit 21.25 13.37 22.96 20.26 30.81
7 Heart 15.66 14.47 17.46 55.21 56.00
8 Glass 32.07 41.50 39.05 52.63 50.94
9 Horse 36 38.26 40.98 30.12 29.33
[1] D. Karaboga, C. Ozturk, A novel clustering approach: Artificial bee colony (abc) algorithm,
Applied soft computing 11 (1) (2011) 652–657.
[2] I. De Falco, A. Della Cioppa, E. Tarantino, Facing classification problems with particle
swarm optimization, Applied Soft Computing 7 (3) (2007) 652–658.
KDD CUP04 BIO Dataset with 10 Clusters
Computational Time Analysis
Sr.no Parameter Setting Value
1 Pop size 50
2 No of independent iteration 10
3 Mating probability 75%
P-SSO O-SSO
Sr.no Dataset Time CEP Times CEP
(sec) (sec)
1 Statlog 25.0623 7.3055 44.4226 8.455
2 Handwritten Digit 25.0435 2.7988 35.3781 3.5313
3 Dermatology 1.4142 13.3880 1.626 14.235
4 Mice 5.0680 6.3889 7.6621 13.250
Ex. Analysis on Flood Affected Area of Chennai
1) U. P. Shukla and S. J. Nanda, Parallel social spider clustering algorithm for high
dimensional datasets. Engineering Applications of Artificial Intelligence; Elsevier,
vol. 56, pp : 75-90, 2016.
2) U. P. Shukla and S. J. Nanda, A Binary Social Spider Optimization Algorithm for
Unsupervised Band Selection in Compressed Hyperspectral Images, Expert Systems
with Applications, Elsevier, vol. 97, pp: 336-356, 2018.
3) U. P. Shukla and S. J. Nanda. "Dynamic clustering with binary social spider algorithm
for streaming dataset." Soft Computing , Springer, vol. 23, no. 21 , pp : 10717-10737,
2019.
4) U. P. Shukla and S. J. Nanda. "Designing of a Risk Assessment Model for Issuing Credit
Card Using Parallel Social Spider Algorithm." Applied Artificial Intelligence, Taylor &
Francis, vol. 33, no. 3, pp: 191-207, 2019.
5) R. Gupta, S. J. Nanda, and U. P. Shukla. "Cloud detection in satellite images using
multi-objective social spider optimization." Applied Soft Computing, Elsevier, vol. 79
pp : 203-226, 2019.
6) S. Aggarwal, P. Chatterjee, R. P. Bhagat, K. K. Purbey, S. J. Nanda, A Social Spider
Optimization Algorithm with Chaotic Initialization for Robust Clustering, Procedia
Computer Science, Elsevier, 2019.
Colliding Bodies Optimization
Stationary
Body
Global Optima
Minimum Energy Level
2 2 2 2
New velocities after collision
( m1 m2 )v1 ( m2 m2 )v2 (m2 m1 )v2 (m1 m1 )v1
v
'
v
'
m1 m2 m1 m2
1 2
1
For maximization problem is replaced by fit i .
fit i
Initialize half of the most fittest bodies as stationary bodies and others are moving bodies
Moving CBs : mi m K vi
i
vi' 2 K K
; i 1, 2,..., K
mi m K 2 2
i
2
Colliding Bodies Optimization (CBO)
• Step 6 : Update the Positions of Colliding Bodies :
K
Stationary CBs : i
new
x x rand * v
i
'
i
i 1,2 , ...,
2
K K
Moving CBs : x new
i xi K / 2 rand * v '
i
i 1, 2 , ..., K
2 2
Where rand is a random number uniformly distributed in the range (-1,1).
iter
1
itermax
Step 8 : Termination Criteria
The Steps 2-8 are repeated until a termination criterion such as maximum number of
iteration or get a sufficiently good fitness value is achieved.
Selected Journal Publications
Sl. Authors Title Name of Journal Vol. Page Year Impact
Factor
1 R. Gupta, Improved framework of IET Image Processing 14 1495- 2020 1.995
S. J. Nanda many‐objective evolutionary 4807
algorithm to handle cloud
detection problem in satellite
imagery
3 R. K. Vijay, Shared Nearest Neighborhood IEEE Journal of 12 1619 - 2019 3.392
S. J. Nanda Intensity Based Declustering Selected Topics in (5) 1627
Model for Analysis of Spatio- Applied Earth
Temporal Seismicity Observations and
Remote Sensing
4 R. Gupta, Cloud detection in satellite Applied Soft 79 203-226 2019 4.873
S. J. Nanda, images using multi-objective Computing, Elsevier
U. P. Shukla social spider optimization
5 U. P. Shukla, A Binary Social Spider Expert Systems with 97 336-356 2018 4.577
S. J. Nanda Optimization Algorithm for Applications,
Unsupervised Band Selection Elsevier
in Compressed Hyper. Images
Selected International Conference Proceedings:
Sl. Authors Title Name of Conference Place Year
1 R. Gupta, A Binary NSGA-III for Unsupervised IEEE Congress on Wellington, 2019
S. J. Nanda Band Selection in Hyper-spectral Evolutionary Computation New Zealand
Satellite Images (IEEE-CEC-2019)
2 P. Khare, Clustering Networks based on IEEE Congress on Wellington, 2019
S. J. Nanda, Physarum Optimization for Seismic Evolutionary Computation New Zealand
R. K. Vijay Catalogs Analysis (IEEE-CEC-2019)
3 D. K. Point Symmetry Distance based K- IEEE Int. Conf. on Systems, Bari, Italy 2019
Kotary, Means Algorithm for Distributed Man and Cybernetics
S. J. Nanda Clustering in Peer to Peer Networks (IEEE-SMC-2019)
4 R. K. Vijay, De-clustering of an earthquake catalog IEEE Congress on San 2017
S. J. Nanda based on ergodicity using parallel Grey Evolutionary Computation Sebastian,
Wolf Optimization (IEEE-CEC-2017) Spain
5 I. Kumawat, Multi-objective whale optimization IEEE Region 10 Conference, Penang, 2017
S. J. Nanda (IEEE-TENCON-2017) Malaysia