Professional Documents
Culture Documents
6th - Sem - Detection of Transaction Fraud Using Deep Learning
6th - Sem - Detection of Transaction Fraud Using Deep Learning
Deep Learning
A project report submitted for the fulfilment of
the
Bachelor Of Technology Degree
In
Computer Science and Engineering
Under
Maulana Abul Kalam Azad University Of
Technology
Under the supervision of
Prof. Bavrabi Ghosh
Submitted by:
Adrineel Saha
(university roll no. 10400319150,
reg. No. 014117 )
Ayush Kumar Jha
(university roll. No. 10400719072,
reg. No. 012073)
Academic session: 2019-2023
Department of Computer Science &
Engineering
2. Literature Survey
3.Proposed Framework
3.1 Data-Set Used
3.2 Proposed Methodology
3.3 Implementation
4. Implementation
4.1 Import Libraries
4.2 Load Dataset
4.3 Exploring Datset and describe numerical attributes
4.4 Check Data Type of the Features and missing values
4.5 Exploring all types of data by Plotting
4.5.1 Univariate Analysis
4.5.2 Bivariate Analysis
4.5.3 Multivariate Analysis
4.6 Machine Learning Modelling
4.6.1 Baseline
4.6.2 Logistic Regression
4.6.3 K Nearest Neighbours
4.7 Comparing Model Performance
6.Conclusion
7. References
8. List of Tables
8.1 Summary of collected dataset
8.2 Architecture of proposed RNN Model
8.3 Exploring Datset and descbibe numerical attributes
9. List Of Figures
9.1 Exploring Datset and describe numerical attributes
9.2 Check Data Type of the Features and missing values
9.3 Exploring all types of data by Plotting
9.3.1 Univariate Analysis
9.3.2 Bivariate Analysis
9.3.3 Multivariate Analysis
9.4 Machine Learning Modelling
9.4.1 Baseline
9.4.2 Logistic Regression
9.4.3 K Nearest Neighbours
Challenges to Overcome
The sheer volume of fraud claims isn’t the only challenge facing fraud
management teams during the pandemic. Fraud and dispute
financial crime teams surveyed aid that increased fraud attacks
during COVID-19 had drastically impacted their operations. Survey
respondents specifically noted challenges related to obtaining skilled
staff, having insufficient resources, strained customer experience,
and disrupted cross-departmental communication channels. The
ability to find experienced staff in the fraud and compliance industry
has been a pressing issue. FICO reported that 38% of their
operational challenges stemmed from the inability to obtain skilled
and experienced staff.
The preference for E-commerce websites for purchasing various
products at a more economic or reasonable price have a positive
impact on growth of the target market. Mobile payment system
facilitate nearly any type of payments. Most merchants prefer
online system of operations for online shopping during COVID-
19. Any payment can be made if a person has online transaction
facility and has mobile. Mobile is required for receiving One Time
Password (OTP). Mobile wallets helps to increase the overall use of
mobile payment. It is found that mobile payments have reached
$194.1 billion in 2017 and $30.2 billion in 2017 as compared to $18.7
billion in 2016.
Objective Of The Proposed Work
The objective of this paper is to establish an effective and accurate
fraudulent financial mobile money transaction detection model with
high efficiency and low error rate. It utilises Deep Learning (DL)
techniques for implementing this model. These techniques are
beneficial since it automatically captures hierarchical features
present in the financial dataset. Recurrent Neural Network (RNN)
follows DL architecture which is utilised in this paper. A stacked RNN
model is proposed as a recommender system for detection of fraud
transaction. Automatic recognising of suspicious activities that
trigger illegal attempts will alarm the customers so that economic
loss can be prevented. Analysis of the proposed algorithms includes
determination of quantitative, qualitative, comparative and
complexity measures. The proposed methods have been rigorously
tested using dataset.
Literature Survey
Altab Althar Taha and Sareef Jameel Malbery described that up-
gradation in e-commerce and communication technology have made
credit card usage more popular way of payment and the fraud associated
with transactions is also increasing. They have used the optimized light
gradient boosting machine, where Bayesian based hyper-parameter
optimization are combined to tune with parameter of light gradient
boosting machine (LightGBM). In this approach they used two set of real
world public dataset consisting of fraudulent and non-fraudulent
transactions. Based on the comparison with other techniques, their
proposed system outperformed in terms of accuracy. The proposed
system produces the accuracy of 98.40%, area under receiver operating
characteristics curve (AUC) of 92.88%, Precision of 97.34% and F1-score
of 56.95% [1].
S. Makki, Z. Assaghir, Y. Taher, R. Haque, M. Hacid and H. Zeineddine
research describes that the credit card fraud cause huge financial loss.
Most of the researchers have been working on this to provide an
innovative ways to eradicate this loss and most of the available methods
are costly, time consuming and labor incentive task. The authors have
found out that the imbalanced classification of dataset is the main reason
for the inaccurate results after many experimental studies. These
imbalance classifications consist of un-balanced dataset, which caused
the model to predict inaccurate and causes the financial loss. Therefore,
they have found that LR, C5.0 decision tree algorithm, SVM and ANN
are best algorithm based on accuracy, AUCPR and sensitivity. They have
used the balanced dataset in order to train these models [2].
Proposed Framework
Data-Set Used
Financial dataset simulated by Pay-Sim that identifies mobile money
transactions based on a sample of real transactions. These
transactions are collected from one month financial logs of a mobile
money service. The dataset consists of 6362620 online transaction
records during COVID-19 and each record is formulated as a
collection of several attributes. The non-numeric data present in the
dataset is transformed into numeric data. Next, all the numeric data
are scaled down into a specific range from 0 to 1. This will help in
pre-processing dataset on which proposed classifier is applied. Cash-
out and transfer type transactions are having suspicious transaction
set. The attribute ‘isFraud’ is kept as target variable of classification
procedure.
Proposed Methodology
Load Dataset:-
fraud_0.1origbase.csv
Describe Numeric Features and check Data
Type of the Features and missing values:-
Exploring all types of data by Plotting:-
Univariate Analysis:-
Numerical Variables:-
Categorical Variables:-
Bivariate Analysis:-
The majority fraud transaction occours for the same user:-
All the fraud amount is greater than 10.000:-
60% of fraud transaction occours using cash-out-type method:-
Multivariate Analysis:-
Machine Learning Modelling:-
Baseline:-
Logistic Regression:-
K Nearest Neighbours:-
Comparing Model Performance:-
HyperParameter Fine Tuning:-
Conclusion:-
With more people shopping online during the
pandemic, goods are having to be shipped and
fraudsters know this. they are targeting either
redirecting genuine orders or alternatively placing
fraudulent orders with compromised consumer
accounts to genuine customer addresses, and then
redirecting them once shipped Due to increasing
demand of mobile money transfer, it is necessary to
discover fraud activities during transactions. This is
now inevitable during COVID-19. Discovering illegal
attempts will prevent the customers to be harassed
from financial dispute. The study has been made from
the announcement of Covid-19 to first unlock period
announced by the Government. The main aim of the to
minimize fraud as far as possible. It shows that the
method is practical and is highly suitable for
implementation at the present scenario. This proposed
method is very favourable as of its applicability on large
financial dataset. An efficient and low error system is
required in the field of mobile transaction since it will
notify the customers by triggering deceptive
transactions. It is quite clear that the proposed model is
capable of recognizing suspicious transactions with
promising efficiency. This
proposed method is favourable because of its
applicability on large financial dataset. An efficient and
low error system is required in the field of mobile
transaction since it will notify the customers by
triggering deceptive transactions.
References
[1]A.A. Taha, S.J. Malebary.“An intelligent approach to credit card fraud detection
using an optimized light gradient boosting machine”. IEEE Access, 8 (2020), pp.
25579-25587, 10.1109/ACCESS.2020.2971354
[2] S. Makki, Z. Assaghir, Y. Taher, R. Haque, M. Hacid, H. Zeineddine. “An
experimental study with imbalanced classification approaches for credit card fraud
detection”.IEEE Access, 7 (2019), pp. 93010-
93022,10.1109/ACCESS.2019.2927266
[3] D. Prusti, S.K. Rath.“Web service based credit card fraud detection by applying
machine learning
techniques”.Proceedings of the TENCON 2019 - 2019 IEEE Region 10 Conference
(TENCON), Kochi, India (2019), pp. 492-497,10.1109/TENCON.2019.8929372
[4] Alae Chouiekha, EL Hassane Ibn EL Haj. “ConvNets for Fraud Detection
analysis”. Procedia Computer Science 127, pp.133–138.2018.
[5] Tom Sweers. “Autoencoding Credit Card Fraud”. Bachelor Thesis, Radboud
University. June 2018.