Welcome to Scribd!

CSE-3501 Information Security Analysis and Audit ELA (L9+L10) Digital Assignment-5

Uploaded by

0% found this document useful (0 votes)

6 views9 pages

This document discusses machine learning models for malware detection. It provides pseudocode for creating a classification model using logistic regression and decision trees. Code snippets are included to import libraries, preprocess data, train models on a training set, predict results on a test set, and evaluate performance using metrics like confusion matrix and accuracy. Decision trees are shown to have higher accuracy (99.996%) than logistic regression (94.008%) on this malware detection problem.

Original Description:

Original Title

LAB DA 5

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

6 views9 pages

CSE-3501 Information Security Analysis and Audit ELA (L9+L10) Digital Assignment-5

Uploaded by

Yash Agarwal

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 9

Search inside document

CSE-3501

Information Security Analysis and Audit

ELA(L9+L10)
DIGITAL ASSIGNMENT-5
YASH AGARWAL
(19BCE0691)

1. MACHINE LEARNING BASED MALWARE DETECTION

SYSTEM

PSEUDOCODE:
o Firstly, Import packages, functions, and classes

o Secondly, Get the data to work with and, if appropriate, transform it

o Thirdly, Create a classiﬁcation model and train (or ﬁt) it with your existing data

o Lastly, evaluate your model to see if its performance is satisfactory

The first step is to perform the feature scaling of the data set, so that if one variable is in
the range from say 10000 to 50000 while other is from say 1 to 20 than they must be
scaled around the same value. Standard scalar library does that. Confusion matrix is a
2X2 matrix with values at [0][1] and [1][0] showing the number of wrong values in the
prediction. You can go through Logistic
Regression class and change several parameters for the classiﬁer.

#FiBng Logistic Regression to dataset

from sklearn.linear_model import LogisticRegression classifier = LogisticRegression()
classifier.fit(X_train, y_train)

#Predicting the test set result

y_pred = classiﬁer.predict(X_test)

#Making the confusion matrix

from sklearn.metrics import confusion_matrix cm = confusion_matrix(y_test, y_pred)

CODE:

To import the necessary libraries.

To import the given dataset

Label encoding

Splitting the data-set into the Training set and Test set
Feature scaling
Training Model on the Training set

Predicting the test set results

Confusion Matrix and Accuracy

PSEUDOCODE FOR DECISION TREE:

❖ At the beginning, we consider the whole training set as the root.

❖ Attributes are assumed to be categorical for information gain and for gini index,
attributes are assumed to be continuous.

❖ On the basis of attribute values records are distributed recursively.

❖ We use statistical methods for ordering attributes as root or internal node.
❖ Find the best attribute and place it on the root node of the tree.
❖ Now, split the training set of the dataset into subsets. While making the subset make
sure that each subset of training dataset should have the same value for an attribute.

❖ Find leaf nodes in all branches by repeating 1 and 2 on each subset.

While implementing the decision tree we will go through the following two phases:

• Building Phase

• Preprocess the dataset.

• Split the dataset from train and test using Python sklearn package.

• Train the classiﬁer.

• Operational Phase

• Make predictions.

• Calculate the accuracy.

CODE :

To import the required libraries

To import the data-

set

Label encoding
Splitting the data-set into Training set and
Testing set

Feature Scaling

Training model on the training set

Predicting the test set results

Confusion matrix and Accuracy

OUTPUT:
6 19BCE0691.ipynb

k 19BCE0691.ipynD
, a ‹s8ceo1co.ipynb

6 19BCE0591 ipynb
Comparison of Decision tree and Logistic Regression

Decision Logistic
tree regression
Accuracy (in %) 99.996% 94.008%

Average precision recall 1.00 0.91

score (Range:
[0,1])

Coincent - Data Science With Python Assignment
Document23 pages
Coincent - Data Science With Python Assignment
Sai Nikhil Nellore
100% (2)
Data Mining - Weka 3.6.0
Document5 pages
Data Mining - Weka 3.6.0
Navee Jayakody
No ratings yet
Data Preprocessing
Document38 pages
Data Preprocessing
Pradhana Riza
No ratings yet
A Practical Guide To Support Vector Classification: I I I N L
Document15 pages
A Practical Guide To Support Vector Classification: I I I N L
rabbityeah
No ratings yet
Ass3 v1
Document4 pages
Ass3 v1
Reeya Prakash
No ratings yet
A Practical Guide To Support Vector Classification
Document16 pages
A Practical Guide To Support Vector Classification
Jônatas Oliveira Silva
No ratings yet
Export Model
Document8 pages
Export Model
hu ans
No ratings yet
Bangla Hand Written Digit Recognition
Document19 pages
Bangla Hand Written Digit Recognition
Khondoker Abu Naim
No ratings yet
ML - Practical File
Document15 pages
ML - Practical File
Jatin Mathur
No ratings yet
Data Science Chapitre 1
Document54 pages
Data Science Chapitre 1
Leonel Ska
No ratings yet
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
Document8 pages
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
César Julián Donnarumma
No ratings yet
Pattern
Document1 page
Pattern
ahmadkhalil
No ratings yet
6 - Steps of The Classification Algorithm in Supervised Learning
Document15 pages
6 - Steps of The Classification Algorithm in Supervised Learning
Rajendra Chadalawada
No ratings yet
Lab 1. Boston House
Document7 pages
Lab 1. Boston House
dimas bayu
No ratings yet
Chapter 6: Data Preprocessing, Parameter Selection, and Inductive Conformal Prediction
Document56 pages
Chapter 6: Data Preprocessing, Parameter Selection, and Inductive Conformal Prediction
dsgssgsg
No ratings yet
House Price Prediction Analysis PDF
Document78 pages
House Price Prediction Analysis PDF
Shashank Chowdary
No ratings yet
Capstone Project 2
Document27 pages
Capstone Project 2
pranavi p
No ratings yet
Experiment 2.2 KNN Classifier
Document7 pages
Experiment 2.2 KNN Classifier
Arslan Mansoori
No ratings yet
Deep Learning (R20A6610)
Document46 pages
Deep Learning (R20A6610)
barak
No ratings yet
St. John College of Engineering and Management, Palghar - Maharashtra
Document11 pages
St. John College of Engineering and Management, Palghar - Maharashtra
pranay
No ratings yet
20dit073 Jay Prajapati ML
Document68 pages
20dit073 Jay Prajapati ML
Jay Prajapati
No ratings yet
All Types of Cross Validation
Document9 pages
All Types of Cross Validation
Priya dharshini.G
No ratings yet
Machine Learning Program 4 (SHANKAR)
Document6 pages
Machine Learning Program 4 (SHANKAR)
21EE076 NIDHIN
No ratings yet
NoCA2019-ProxyML 2019nov29
Document24 pages
NoCA2019-ProxyML 2019nov29
Salah Uddin
No ratings yet
MACHINE LEARNING WITH PYTHON - Digit Recognition With Scikit-Learn and Mnist
Document11 pages
MACHINE LEARNING WITH PYTHON - Digit Recognition With Scikit-Learn and Mnist
alexandre
No ratings yet
Decision Tree
Document6 pages
Decision Tree
Sazeda Sultana
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
Document8 pages
ML Lab 11 Manual - Neural Networks (Ver4)
dodela6303
No ratings yet
Anu Document Merged
Document8 pages
Anu Document Merged
Mani Megalai
No ratings yet
PythonMalware FirstReview
Document25 pages
PythonMalware FirstReview
Meenachi Sundaram
No ratings yet
Machine Learning Program 4 (Mohan)
Document7 pages
Machine Learning Program 4 (Mohan)
21EE076 NIDHIN
No ratings yet
Image Classification
Document18 pages
Image Classification
Darshna Gupta
No ratings yet
ML0101EN Clas K Nearest Neighbors CustCat Py v1
Document11 pages
ML0101EN Clas K Nearest Neighbors CustCat Py v1
banicx
100% (1)
Scikit Learn
Document25 pages
Scikit Learn
aslamzohaib
No ratings yet
Machine Learning & Data Mining
Document4 pages
Machine Learning & Data Mining
Priyaprasad Panda
No ratings yet
Theoryassignment PDF
Document11 pages
Theoryassignment PDF
Karthik Reddy
No ratings yet
Lab 2
Document3 pages
Lab 2
ptyquyen22
No ratings yet
DM Lab Cycle 2 1
Document10 pages
DM Lab Cycle 2 1
ispclx
No ratings yet
Assignment 3 - LP1
Document13 pages
Assignment 3 - LP1
bbad070105
No ratings yet
Lab Manual-ANN
Document7 pages
Lab Manual-ANN
faizan majid
No ratings yet
A Practical Guide To Support Vector Classi Cation - Chih-Wei Hsu, Chih-Chung Chang and Chih-Jen Lin
Document12 pages
A Practical Guide To Support Vector Classi Cation - Chih-Wei Hsu, Chih-Chung Chang and Chih-Jen Lin
Vítor Mangaravite
No ratings yet
Lab 08 - Data Preprocessing
Document9 pages
Lab 08 - Data Preprocessing
rida
No ratings yet
Deep Learning Lab (Ai&ds)
Document39 pages
Deep Learning Lab (Ai&ds)
BELMER GLADSON Asst. Prof. (CSE)
No ratings yet
17 Ensemble Techniques Problem Statement
Document28 pages
17 Ensemble Techniques Problem Statement
Jadhav A.S
No ratings yet
Project 1
Document4 pages
Project 1
aqsa yousaf
No ratings yet
Batch - 7 FINAL Review (DEEP LEARNING)
Document42 pages
Batch - 7 FINAL Review (DEEP LEARNING)
John Joshua surangula
No ratings yet
ML - Expt 7
Document6 pages
ML - Expt 7
mitali.201433201
No ratings yet
10 PDF
Document12 pages
10 PDF
Aishwarya Das
No ratings yet
DMlab - FilE prINCE
Document27 pages
DMlab - FilE prINCE
Rajput Prince Singh Kachhwaha
No ratings yet
Develop A Program To Implement Data Preprocessing Using
Document19 pages
Develop A Program To Implement Data Preprocessing Using
Fucker Jamun
No ratings yet
Binary Classification Tutorial With The Keras Deep Learning Library
Document33 pages
Binary Classification Tutorial With The Keras Deep Learning Library
Shudu Tang
No ratings yet
ML 5th
Document8 pages
ML 5th
sahugungun76
No ratings yet
utf-8''C2M1 Assignment
Document24 pages
utf-8''C2M1 Assignment
Sarah Mendes
No ratings yet
DFT
Document21 pages
DFT
Muhsin Nk
100% (1)
Machine Learning With Scikit-Learn: George Boorman
Document34 pages
Machine Learning With Scikit-Learn: George Boorman
AS
No ratings yet
WekaManual 101 200
Document100 pages
WekaManual 101 200
ihsan muttaqin
No ratings yet
Lab 09
Document4 pages
Lab 09
Muhammad Huzaifa Amjad
No ratings yet
Machine Learning
Document56 pages
Machine Learning
Mani Vrs
100% (3)
Demo Class 15 and 16102022 (Pandas in Python)
Document45 pages
Demo Class 15 and 16102022 (Pandas in Python)
Oskar Nguyen
No ratings yet
Building Good Training Sets UNIT 1 PART2
Document46 pages
Building Good Training Sets UNIT 1 PART2
Aditya Sharma
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
Curriculum Plan - Jesanne G. Aguilar
Document9 pages
Curriculum Plan - Jesanne G. Aguilar
Jesanne Aguilar
No ratings yet
CDMTU Flyer MBM 2021
Document4 pages
CDMTU Flyer MBM 2021
Shivendra
No ratings yet
Project Management Process Groups
Document10 pages
Project Management Process Groups
Queen Valle
100% (1)
FOME Team, Teamwork & Leadership
Document30 pages
FOME Team, Teamwork & Leadership
Nixy Claudia
No ratings yet
MAPEH 7 - Music and Arts 4TH Q-DLL
Document5 pages
MAPEH 7 - Music and Arts 4TH Q-DLL
JUDITH APOSTOL
No ratings yet
GeEd 3013 PPT Cha1-3
Document124 pages
GeEd 3013 PPT Cha1-3
Esuye Fame Pupil
No ratings yet
Artistic Research: A Performative Paradigm?
Document14 pages
Artistic Research: A Performative Paradigm?
Denise Bandeira
No ratings yet
Test Bank For Essentials of Marketing Research A Hands On Orientation Naresh K Malhotra
Document25 pages
Test Bank For Essentials of Marketing Research A Hands On Orientation Naresh K Malhotra
tracywrightodmyxafzin
100% (31)
CUSTOMER ANALYTICS CHAPTER 1 and 2 Reviewer
Document8 pages
CUSTOMER ANALYTICS CHAPTER 1 and 2 Reviewer
KAH' CHISMISS
No ratings yet
Sneha Gudela: Links
Document3 pages
Sneha Gudela: Links
sneha gudela
No ratings yet
Silverman CV Updated
Document40 pages
Silverman CV Updated
Ilyass Zouhairi
No ratings yet
CIDAM On Simple and Compound Interest
Document2 pages
CIDAM On Simple and Compound Interest
Tyrone Pel
No ratings yet
Communication Strategies - Luciano Mariani
Document64 pages
Communication Strategies - Luciano Mariani
Nhi Nguyễn
No ratings yet
Career Talk Psychology
Document13 pages
Career Talk Psychology
nearraine guintu
No ratings yet
Module 4 Socialization
Document11 pages
Module 4 Socialization
xemiho2660
No ratings yet
Thesis With Descriptive Method
Document6 pages
Thesis With Descriptive Method
denisemillerdesmoines
100% (1)
Activity 5
Document2 pages
Activity 5
flory mae gudia
No ratings yet
Peer Group Influence: Effects On The Academic Performance of Beed 1St and 2Nd Year Students
Document28 pages
Peer Group Influence: Effects On The Academic Performance of Beed 1St and 2Nd Year Students
Dhime Aguilando II
No ratings yet
Report Paper: Field Reports
Document3 pages
Report Paper: Field Reports
VAN
No ratings yet
PedsQL Full
Document51 pages
PedsQL Full
Elsie Dyana Pretty Stephanie
No ratings yet
Creative Arts, Music and Drama For Young Children: Final Examination
Document2 pages
Creative Arts, Music and Drama For Young Children: Final Examination
Violet Silver
No ratings yet
Current Issues in The Teaching of Grammar
Document5 pages
Current Issues in The Teaching of Grammar
16040484 Phạm Ngọc Việt Anh
No ratings yet
Compensation in PID Control System For Valve Stiction Based On Equivalent-Input-Disturbance Approach
Document6 pages
Compensation in PID Control System For Valve Stiction Based On Equivalent-Input-Disturbance Approach
damara fernando
No ratings yet
Final Announcement - WWETC2021 Workshop
Document2 pages
Final Announcement - WWETC2021 Workshop
Giannis Dow
No ratings yet
Module 9 Professional Ed
Document6 pages
Module 9 Professional Ed
Aubrey Fabroa Bendijo
No ratings yet
Instrumen Kepuasan Pasien Terhadap Pelayanan Keperawatan: Literature Review
Document11 pages
Instrumen Kepuasan Pasien Terhadap Pelayanan Keperawatan: Literature Review
Erfina Fadilatul Hamidah
No ratings yet
19752-Article Text-42885-1-10-20191231
Document9 pages
19752-Article Text-42885-1-10-20191231
Sunita Aprilia Dewi
No ratings yet
Conceptual Framework and Theoretical Framework
Document3 pages
Conceptual Framework and Theoretical Framework
kimberlydonozomonternel
No ratings yet
HR Manpower Forecasting and Pooling - 20240205 - 125633 - 0000
Document32 pages
HR Manpower Forecasting and Pooling - 20240205 - 125633 - 0000
Realyn Zambas
No ratings yet
ICET2019 094TeacherspreparationforthefourthindustrialRevolution AcaseofSouthAfrica
Document16 pages
ICET2019 094TeacherspreparationforthefourthindustrialRevolution AcaseofSouthAfrica
Jane Ganado
No ratings yet