Welcome to Scribd!

Virus Detection Using Deep Learning: Saurabh Malusare Rojan Sudev Rishabh Nrupnarayan

Uploaded by

0% found this document useful (0 votes)

10 views28 pages

This document proposes using deep learning to classify files as viruses or legitimate. It discusses extracting relevant features from file headers, training a deep belief network (DBN) in an unsupervised manner using restricted Boltzmann machines (RBM), and then fine-tuning the DBN in a supervised manner using logistic regression. The system is able to classify files with 94.5% accuracy, demonstrating that deep learning can overcome limitations of conventional virus detection techniques by learning complex patterns without large signature databases.

Original Description:

Project

Original Title

Final Project Presentation.pptx 0

Copyright

Available Formats

ODP, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as ODP, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as odp, pdf, or txt

0% found this document useful (0 votes)

10 views28 pages

Virus Detection Using Deep Learning: Saurabh Malusare Rojan Sudev Rishabh Nrupnarayan

Uploaded by

9545417941

Copyright:

Available Formats

Download as ODP, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as odp, pdf, or txt

Jump to Page

You are on page 1of 28

Search inside document

VIRUS DETECTION USING DEEP

LEARNING

By
Saurabh Malusare
Rojan Sudev
Rishabh Nrupnarayan

Under The Guidance of

Prof. Anil M. Bhadgale
INTRODUCTION

A computer virus is a program or piece of code

that, when executed replicates by reproducing
itself or infecting other computer program by
modifying them.
VIRUS DETECTING TECHNIQUES

• Signature Based Detection

• Heuristic Based Detection
• Detection using Bait
LIMITATIONS OF CONVENTIONAL
TECHNIQUES

• Time required between virus detection and

creation
• Large Database have to be maintained
• New patterns of virus cannot be detected
PROBLEM DEFINITION

Using Deep learning to classify whether a file is

virus or legitimate , while overcoming the
existing limitations of conventional techniques.
System Architecture
Important fields of PE header:
Feature Selection
• Extract only features relevant to classification
• Fisher Score algorithm for feature selection
• Fisher Score based on ranks.
• Ranks between 0 and 1
• Higher rank,more relevance
Fisher Score formula:

• µi,p = mean of positive samples for ith PE header feature

• µi,n = mean of negative samples for ith PE header feature
• σi,p = standard deviation of positive samples for ith PE
header feature
• σi,n = standard deviation of negative samples for ith PE
header feature
Feature Extraction
• Extract 21 most relevant features determined
using Fisher Score.
• These features are real values.
• Normalize features using min-max
normalization
• Features are scaled to [0,1]
• Normalized Feature values are then converted
to binary values using the condition:

If feature >mean(feature)
feature=1
else
feature-0
DBN
• Deep belief network obtained by stacking
several RBMs(Restricted Boltzmann machine)
on top of each other.
• The hidden layer of the RBM at layer `i`
becomes the input of the RBM at layer `i+1`.
• When used for classification, the DBN is
treated as a MLP, by adding a logistic
regression layer on top.
RBM

Fig. RBM

Fig. Forward phase

Fig. Backward phase

RBM Training
Contrastive Divergence-k(CD-k):
• Take a training sample v, compute the
probabilities of the hidden units and sample a
hidden activation vector h from this
probability distribution.
• Compute the outer product of v and h and call
this the positive gradient.
• From h, sample a reconstruction v1 of the
visible units, then resample the hidden
activations h1 from this.
• Repeat above step k times to calculate vk and
Training DBN
• DBN trained in semi-supervised way.
2 phases:
1)Unsupervised training phase
2)Supervised training phase
Unsupervised Training
Algorithm:
• 1. Train the first layer as an RBM that models the raw input as its visible
layer.
• 2. Use that first layer to obtain a representation of the input that will be
used as data for the second layer.
• 3. Train the second layer as an RBM, taking the transformed data
(samples ) as training examples (for the visible layer of that RBM).
• 4. Iterate (2 and 3) for the desired number of layers, each time
propagating upward either samples .
Supervised Training
• Uses Logistic Regression on top of DBN
• Logistic Regression Model trained in
Supervised way-uses labelled virus and
legitimate files
• Logistic regression is a probabilistic, linear
classifier parametrized by a weight
matrix W and a bias vector b .
Fine Tuning Parameters

• Number of hidden layers

• Number of processing units per hidden layer
• Learning rate
PERFORMANCE
EVALUATION

03/06/17 CS-152 23
SNAPSHOTS
RESULTS
• Feature Extractor capable of extracting
relevant features from dataset and input
file.
• DBN capable of classifying a given PE
structure file as virus or legitimate with an
accuracy of 94.5%.
CONCLUSION

House Price Prediction: Project Description
Document11 pages
House Price Prediction: Project Description
POLURU SUMANTH NAIDU STUDENT - CSE
No ratings yet
Confined Space Identification
Document4 pages
Confined Space Identification
Lucila Zambrano
No ratings yet
Acoustics Phenomena in Chichen Itza
Document8 pages
Acoustics Phenomena in Chichen Itza
jacctito2
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
Document20 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
vanjchao
No ratings yet
Unit 3 (B) NGP
Document84 pages
Unit 3 (B) NGP
animehv5500
No ratings yet
DINTA Object Recognition (1)
Document47 pages
DINTA Object Recognition (1)
Pablo Vivero
No ratings yet
CSL0777 L22
Document35 pages
CSL0777 L22
Konkobo Ulrich Arthur
No ratings yet
CSL0777 L19
Document23 pages
CSL0777 L19
Konkobo Ulrich Arthur
No ratings yet
BTP PPT Phase1
Document14 pages
BTP PPT Phase1
MANISH KUMAR
No ratings yet
Project
Document27 pages
Project
bisan.ahmad.alhaj
No ratings yet
L09-10 DL and CNN
Document56 pages
L09-10 DL and CNN
Paulo Santos
No ratings yet
BTP PPT Phase1
Document14 pages
BTP PPT Phase1
MANISH KUMAR
No ratings yet
Cvpresentation 190812154654
Document25 pages
Cvpresentation 190812154654
PyariMohan Jena
No ratings yet
MLP_1122_20240509_ch10_DeepNN
Document47 pages
MLP_1122_20240509_ch10_DeepNN
ck10401025
No ratings yet
Chapter 6 - Subprogram Control
Document29 pages
Chapter 6 - Subprogram Control
migad
No ratings yet
NLP-NeuralNetworks Reading Notes
Document13 pages
NLP-NeuralNetworks Reading Notes
David
No ratings yet
ML Lab Manual
Document38 pages
ML Lab Manual
Rahul
No ratings yet
CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0
Document50 pages
CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0
Zee Ingame
No ratings yet
Neuralnetworks 1
Document65 pages
Neuralnetworks 1
rdsraj
No ratings yet
Advanced Machine Learning CIE
Document13 pages
Advanced Machine Learning CIE
sharma xerox
No ratings yet
UNIT 2 Self Notes
Document10 pages
UNIT 2 Self Notes
jainayushtech
No ratings yet
Person Re-Identification Via Structural Deep Metric Learning
Document31 pages
Person Re-Identification Via Structural Deep Metric Learning
sankaridevi
No ratings yet
CSL0777 L16
Document25 pages
CSL0777 L16
Konkobo Ulrich Arthur
No ratings yet
Application of Wavelets To Document Image Processing, Graph Processing and Video Processing
Document62 pages
Application of Wavelets To Document Image Processing, Graph Processing and Video Processing
Mr.Mohammed Zakir B ELECTRONICS & COMMUNICATION
No ratings yet
Machine Learning
Document11 pages
Machine Learning
Kumar Sahu
No ratings yet
Lab Expt 6 SVM Classifer With Feature Kernel Techniques
Document14 pages
Lab Expt 6 SVM Classifer With Feature Kernel Techniques
AYUSHI WAKODE
No ratings yet
Ker As Tutorial
Document33 pages
Ker As Tutorial
Yoann Dragneel
No ratings yet
Credit Card Fraud Detection
Document18 pages
Credit Card Fraud Detection
gchetan8008
No ratings yet
Arabic OCR Report
Document20 pages
Arabic OCR Report
Amir
No ratings yet
Vineela Ann1
Document9 pages
Vineela Ann1
vineela
No ratings yet
Basics of CNN and Face Recognition and Verification
Document41 pages
Basics of CNN and Face Recognition and Verification
Aditi
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
Document29 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
Minh Nguyen
No ratings yet
21BCS1133 - Exp 2.3
Document4 pages
21BCS1133 - Exp 2.3
jiteshkumardj
No ratings yet
DCGAN (Deep Convolution Generative Adversarial Networks)
Document27 pages
DCGAN (Deep Convolution Generative Adversarial Networks)
lakpa tamang
No ratings yet
Convolutional Networks
Document211 pages
Convolutional Networks
iamrishitaganguly
No ratings yet
hamsa
Document7 pages
hamsa
preethipgowda2004
No ratings yet
Final Project Report Nur Alam (肖恩） 20183290523
Document12 pages
Final Project Report Nur Alam (肖恩） 20183290523
Xiao en
No ratings yet
Your Paragraph Text
Document13 pages
Your Paragraph Text
Athulya B.S
No ratings yet
Malware Detection Technique For Android Iot Devices: Presented By-Tellakula Hima Bindu Reg No. 221003100
Document22 pages
Malware Detection Technique For Android Iot Devices: Presented By-Tellakula Hima Bindu Reg No. 221003100
Erukulla Dayakar
No ratings yet
Major Classes of Neural Networks
Document21 pages
Major Classes of Neural Networks
bhaskar rao m
No ratings yet
SSRN Id3884722
Document6 pages
SSRN Id3884722
Ezekiel Choosen
No ratings yet
Technical Answers To Real World Problems - CBS1901 Vellore Institute of Technology, Vellore Summer Special Semester 2021-22
Document4 pages
Technical Answers To Real World Problems - CBS1901 Vellore Institute of Technology, Vellore Summer Special Semester 2021-22
Charan Bhogaraju
No ratings yet
Machine Learning Introduction
Document17 pages
Machine Learning Introduction
Er Himanshu Singhal
No ratings yet
Neural Networks For Machine Learning: Lecture 14A Learning Layers of Features by Stacking Rbms
Document39 pages
Neural Networks For Machine Learning: Lecture 14A Learning Layers of Features by Stacking Rbms
Tyler Roberts
No ratings yet
SML Unit 4
Document61 pages
SML Unit 4
aryan kumar
No ratings yet
Pipelines
Document17 pages
Pipelines
vgokuul007
No ratings yet
Labview Academy: 12. Óra - Event, Property Node
Document54 pages
Labview Academy: 12. Óra - Event, Property Node
劉燕明
No ratings yet
Unit 3
Document110 pages
Unit 3
Nishanth Nuthi
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
Document69 pages
Data Mining: Practical Machine Learning Tools and Techniques
elgatoa3
No ratings yet
15 ML
Document60 pages
15 ML
maykelnawar
No ratings yet
Aidl Unit III
Document79 pages
Aidl Unit III
kanchiraju vamsi
No ratings yet
Data Mining Data Transformations: Gergely Lukács
Document51 pages
Data Mining Data Transformations: Gergely Lukács
Blazs
No ratings yet
Ann PDF
Document129 pages
Ann PDF
Sohan Reddy
No ratings yet
26 Weka
Document5 pages
26 Weka
sandyguru05
No ratings yet
Technical Seminar (Wildfire)
Document16 pages
Technical Seminar (Wildfire)
insightsunveiledofficial
No ratings yet
DuongToGiangSon 517H0162 HW2 Nov-26
Document17 pages
DuongToGiangSon 517H0162 HW2 Nov-26
Son Tran
No ratings yet
Deep Learning Algorithms For Object Detection
Document43 pages
Deep Learning Algorithms For Object Detection
Vaijayanthi
No ratings yet
Statistical Methods in Artificial Intelligence CSE471 - Monsoon 2015: Lecture 02
Document26 pages
Statistical Methods in Artificial Intelligence CSE471 - Monsoon 2015: Lecture 02
srikanth.mujjiga
No ratings yet
MLT Essentials
Document32 pages
MLT Essentials
TANISHA SAXENA
No ratings yet
Exercise 2
Document3 pages
Exercise 2
vamsi29yt
No ratings yet
Software Testing and Quality Assurance
Document30 pages
Software Testing and Quality Assurance
akshita
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Resume Jigar Jayswal
Document2 pages
Resume Jigar Jayswal
jigarjayswal
No ratings yet
Questions and Solutions Wit - 1665247828131
Document5 pages
Questions and Solutions Wit - 1665247828131
Manish
No ratings yet
Windows 10 (All Editions) Activation Text File (Updated)
Document2 pages
Windows 10 (All Editions) Activation Text File (Updated)
Mark Gil Anthony Rulog
83% (6)
BibType Guide
Document3 pages
BibType Guide
wankbass
No ratings yet
ER Diagram Case Study
Document4 pages
ER Diagram Case Study
Rishikesh Mahajan
No ratings yet
Bubble-Up Effects of Subculture Fashion.20120924.161844
Document2 pages
Bubble-Up Effects of Subculture Fashion.20120924.161844
anon_699035352
No ratings yet
Standards For Philippine Libraries
Document59 pages
Standards For Philippine Libraries
Gen Lamsis Almora
83% (6)
L-3 EOS M2124 Specification Sheet
Document2 pages
L-3 EOS M2124 Specification Sheet
smk729
No ratings yet
Categories of Computer and Computer Language
Document3 pages
Categories of Computer and Computer Language
Sameer
No ratings yet
Elegibility Certificate Format
Document1 page
Elegibility Certificate Format
Gouse Maahi
No ratings yet
Logistics Process Design
Document53 pages
Logistics Process Design
basma
No ratings yet
Roof Truss Details: Proposed Reinforcement and Roof of Existing Parapet at Roof Deck Thru: Engr. Sergio Mellejor JR
Document1 page
Roof Truss Details: Proposed Reinforcement and Roof of Existing Parapet at Roof Deck Thru: Engr. Sergio Mellejor JR
Sbftechnical Staff
No ratings yet
BPFCOY Sample Report
Document3 pages
BPFCOY Sample Report
Finn Balor
No ratings yet
Normative Ethics
Document37 pages
Normative Ethics
mustafe ABDULLAHI
75% (8)
Performance Vs Load Vs Stress
Document2 pages
Performance Vs Load Vs Stress
sujadilip
No ratings yet
Chemistry Everyday Life
Document52 pages
Chemistry Everyday Life
SunilDuttB
100% (1)
Burns05 Tif 14
Document16 pages
Burns05 Tif 14
XiaolinChen
0% (1)
13-14 April 2010: Sào Paulo, Brazil
Document6 pages
13-14 April 2010: Sào Paulo, Brazil
murugesh_adodis
No ratings yet
Subject Programme Grades 10-11
Document40 pages
Subject Programme Grades 10-11
Svetlana Kassymova
No ratings yet
English Iv Unit 1: ESL Language Center
Document32 pages
English Iv Unit 1: ESL Language Center
Carlos Nava Chacin
No ratings yet
Servicescape Bharati Airtel
Document3 pages
Servicescape Bharati Airtel
dacchu
No ratings yet
AA7013B Contemporary Issues in Finance 30% PLO5 (Topic TBC)
Document1 page
AA7013B Contemporary Issues in Finance 30% PLO5 (Topic TBC)
AKMAL
No ratings yet
Experiment 1 Foundation
Document6 pages
Experiment 1 Foundation
Alfredo Cerdeña Jr.
No ratings yet
Vehicles Tactics
Document21 pages
Vehicles Tactics
eremeevion
No ratings yet
UGP Marxist Theories Lecture Presentation0
Document26 pages
UGP Marxist Theories Lecture Presentation0
waheed shar
No ratings yet
Comparative and Contrastive Studies of Information Structure PDF
Document321 pages
Comparative and Contrastive Studies of Information Structure PDF
Ale Xander
No ratings yet
2008JCECapillary PDF
Document4 pages
2008JCECapillary PDF
shailesh
No ratings yet
Under The Guidance Of: Dr. K V A Balaji Professor & Head Dept of Mechanical Engg SJCE, Mysore
Document41 pages
Under The Guidance Of: Dr. K V A Balaji Professor & Head Dept of Mechanical Engg SJCE, Mysore
sri87
No ratings yet