Professional Documents
Culture Documents
DA For Mathematical Modelling Theory 20BCE0985
DA For Mathematical Modelling Theory 20BCE0985
Aditya Sarangarajan
20BCE0985
Theory Slot: E2
Mathematical Modelling for Data Science
(i)Training
engine
For each input
record there is a
label describing
the type of
connection. We
use this label to
train the engine
as following:
•
First the
number of
good records
and bad
records in
the training
dataset are
calculated.
•
Then two
hash tables
are created;
the first one
includes the
frequency of
each attribute
for normal
records, and
the second
one includes
the
frequency of
each attribute
of the not
normal
records.
•
Finally, a
third hash
table is
created. This
table
contains
each attribute
from the
normal and
not normal
records and
it is scored
using the
following
formula
where
•
B is the
frequency of
that attribute
in the hash
table related
to not-
normal file.
•
G the
frequency of
that attribute
in the hash
table related
to normal
file.
Data Collector
: NSL-KDD data
set is first of all
collected at
this block.
Pre-processor
: This block takes
the original data
from
the MIT Lincoln
Lab, extracts the
required features,
and
converts the data
set into Matlab
compatible
format. This
basically
performs the data
cleaning
procedure.
Encoder
: The attributes
given in the data
set are
converted into
double data type
to make it
compatible
with the ANN
Tool box of
Matlab.
Neural Network
Classifier
: The data at the
output of the
encoder stage is
fed into the
neural network