Data Mining: (Kumar, Viswanath and Rao, 2016)

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

1

Data Mining
model. By dividing the material into smaller
Abstract-- This research paper provides details about the portions, an associated decision tree is gradually
Artificial Intelligent algorithms. This paper provides with built (Kumar, Viswanath and Rao, 2016).
the information about the AI algorithms that are being
used. This paper also contains the working and 2) AIMS AND OBJECTIVES
comparisons of the two algorithms that are used in Data
mining in Banking Sector. This paper explains about the The aim is to use an AI algorithm to solve a
problems related to banking sector that can be solved by problem of Banking Sector. The objective is to
these two algorithms that are Random Forest Tree and
Decision Tree. solve the following problem using Random Forest
Keywords: Random Forest Tree, Decision Tree, Data Tree algorithm and Decision Tree algorithm.
Mining, Artificial Intelligence, Algorithms.
2.1) Problem
1) INTRODUCTION An item is given in promotional efforts by phoning

T he process of obtaining information from a consumer directly. All enrolled consumers are
massive amounts of data is referred to as "data contacted as part of an advertising campaign
mining." Data kinds include spatial data, strategy. In any case, such a quick approach is
multimedia data, time series data, text data, and
internet data. Data mining is the process of ineffective because the great majority of the phoned
extracting fascinating, nontrivial, implicit, clients aren't interested in the provided goods. In
previously unknown, and potentially helpful these circumstances, a large number of calls will be
patterns or information from enormous amounts of
data. It is a collection of activities for identifying wasted, compounding the mission's misery.
new, hidden, or surprising patterns in data, as well However, a few consumers may not be on the call
as unusual data patterns. By exploiting the list, and the mission supervisor may overlook
information included inside a data warehouse, data
mining may frequently uncover answers to clients who may potentially purchase the item.
questions about a firm that a decision maker had 2.2) Solution
not previously considered asking. (Imran Moin
and Baseer Ahmed). The goal of a Direct Marketing Campaign is to
Data mining techniques include data cleansing, achieve the expected arrangement of consumers
data integration, data transformation, data mining,
pattern evaluation, and data display. The Principal with enrolled profiles by calling the customers and
Component Analysis (PCA) deals with data using their information to get to the items provided
dimensionality, whereas the cluster analysis deals directly. To analyses the data and acquire the best
with data likeness. The purpose of cluster analysis
is to discover likely patterns in a set of data. Data results, we will do various tests on decision trees,
containing out-of-the-ordinary observations is random forests, artificial neural networks, and
considered inconsistent and may be eliminated gradient boosting.
using cleaning techniques (Fawei and Ludera,
2020).
Data mining in banking sector is helping the banks 3) RELATED WORK
a lot. Using the Artificial Intelligence techniques,
it can resolve a lot of problems in banking sector. 3.1) Literature Review
The Artificial Intelligence algorithms thar are
being used are Random Forest Tree and Decision
Tree.
Random Forest Tree as its name suggests, it is a 4) METHOD
Supervised Classification Algorithm, as the name
implies, that classifies data by producing a number
of Classifiers with the purpose of improving
prediction accuracy. To analyses a data collection,
the Random Forest techniques are employed, with
trees being formed and individuals being linked to
forecast the class label. It is not suggested to
classify a large quantity of data with a single
classifier since it may result in poorer accuracy
(Gashler, Giraud-Carrier and Martinez, 2009).
Whereas A decision tree is a categorization
approach that use a tree to construct a regression
2

Banking is an intensely competitive sector. It is


aware of political and economic developments in
their home nations as well as throughout the
world. Because of the numerous hazards, many
banks are focusing on improving their
performance by lowering expenses and boosting
revenues. Data is a great asset in the banking
industry since it contains useful knowledge and
fascinating trends. Manual techniques will be used
to make decisions in conservative situations. Data
may be one of a bank's most important assets, but Figure 2: Decision making with data mining.[3]
only if it learns how to extract meaningful insights
from raw data. Data mining allows for the Banking has been a favorite implementation topic
extraction of information from previous data and
the prediction of future outcomes [3]. The for academics with DM talents during the last
conservative decision-making process is depicted decades of the information science revolution since
in Figure 1. Data mining techniques are suited for it is a data-intensive issue. Banks have realized that
making successful decisions in different domains
such as marketing, risk management, fraud information, rather than cash resources, is the new
detection, and customer relationship management. most valuable asset. There are different techniques
Figure 2 shows how data mining may be used to that are being used to do data mining in banking.
make decisions [3].
Classification is one of the machines learning based
model that is used for predicting the future instances
by the given training data set. Classification
algorithms cover a wide range of learning
approaches, including decision trees, Bayesian
classification, Rule-based methods, Memory-based
Learning, and support vector machines, all of which
are widely utilized in a variety of applications and
have demonstrated their accuracy and correctness
[5]. Decision trees are simple to understand and do
not require any subject expertise. The use of a
decision tree to categories a huge amount of data
into appropriate class labels may result in prediction
inaccuracy and uncertain results. Instead of
employing a single classifier, we employed
ensembles, which are a type of decision forest that
Figure 1: Conservative Decision Making[3] uses several classifiers [7]. Random Forests are
simple to understand for both computer specialists
and non-statistical end-users. There is no need for
cross-validation with a random forest, and it is not
over-fitting. To build many classifiers, the Random
Forest employs Adaboost and Bootstrapping
algorithms [9].
3

Figure 3: Visual Representation of Random Forest

Random Forest have some advantages [10]


Precision.
Intuitive in dealing with large datasets.
Handles hundreds of input variables quickly and
easily.
Provides information on essential factors that aren't
included in the categorization.
Methods for guessing missing data are provided.
Missing data is handled without sacrificing
accuracy.
Prototypes are used to provide information (meta
data) on the relationship between several variables.
Allows for the investigation of variable interactions.

5) REFRENCES
Fawei, T. and Ludera, D.T.J. (2020) Data Mining
Solutions for Direct Marketing Campaign Data
Mining Solutions for Direct Marketing Campaign
View project Data Mining Solutions for Direct
Marketing Campaign.
Gashler, M., Giraud-Carrier, C. and Martinez, T.
(2009) “Decision Tree Ensemble: Small
Heterogeneous Is Better Than Large
Homogeneous,” pp. 900–905.
doi:10.1109/ICMLA.2008.154.
Imran Moin, K. and Baseer Ahmed, Q. (no date)
“Use of Data Mining in Banking,” International
Journal of Engineering Research and Applications
(IJERA), 2, pp. 738–742. Available at:
www.ijera.com (Accessed: January 16, 2022).
Kumar, G.K., Viswanath, P. and Rao, A.A. (2016)
“Ensemble of randomized soft decision trees for
robust classification,” Sādhanā 2016 41:3, 41(3),
pp. 273–282. doi:10.1007/S12046-016-0465-Z.
 

You might also like