Welcome to Scribd!

0% found this document useful (0 votes)

10 views

Synopsis Machine Learning

Uploaded by

The document discusses using machine learning models to predict loan repayment. Specifically, it trains several models on lending club data to determine if a borrower will repay a loan on time. It finds that the logistic regression model is the optimal predictive model, with factors like FICO score and annual income significantly influencing the forecasts. The methodology includes pre-processing, feature engineering, exploratory data analysis, encoding, feature selection, and using linear regression and decision tree models. It concludes the random forest algorithm achieved over 99% accuracy, making it the best model for the loan repayment prediction problem.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Data Mining Business Report Hansraj Yadav
Document34 pages
Data Mining Business Report Hansraj Yadav
P Venkata Krishna Rao
83% (12)
Project: Animesh Halder
Document12 pages
Project: Animesh Halder
animeh
50% (2)
EAP Grammar in Context: Reading & Writing
Document27 pages
EAP Grammar in Context: Reading & Writing
Mónica Coirolo
50% (2)
Business Report M2 PDF
Document14 pages
Business Report M2 PDF
A d
100% (2)
Loan Eligibility Prediction: Machine Learning
Document8 pages
Loan Eligibility Prediction: Machine Learning
Jeet Maru
No ratings yet
Data Mininig Project
Document28 pages
Data Mininig Project
Karthikeyan Manimaran
67% (3)
Telco Customer Churn
Document11 pages
Telco Customer Churn
Hamza Qazi
100% (2)
Reading 11 - Programming End-to-End Solution
Document13 pages
Reading 11 - Programming End-to-End Solution
lussy
No ratings yet
Customer Loan Prediction: Term Project Report
Document11 pages
Customer Loan Prediction: Term Project Report
Anusha Bhatnagar
100% (1)
Article Review 11 Eng
Document18 pages
Article Review 11 Eng
Cecilia Fauziah
No ratings yet
Data Science Assignment 2
Document14 pages
Data Science Assignment 2
anigunasekara
No ratings yet
TB 969425740
Document16 pages
TB 969425740
guohong hu
No ratings yet
Mathematics For Intelligent System: Project: Loan Prediction Using Decision Tree, Random Forest & Logistic Regression
Document20 pages
Mathematics For Intelligent System: Project: Loan Prediction Using Decision Tree, Random Forest & Logistic Regression
swami thati
No ratings yet
Using Logistic Regression Predict Credit Default
Document6 pages
Using Logistic Regression Predict Credit Default
Sneha Singh
No ratings yet
BIA 660: Glassdoor Sentimental Analysis and Salary Prediction
Document15 pages
BIA 660: Glassdoor Sentimental Analysis and Salary Prediction
aditghanekar10
No ratings yet
Jupyter Lab
Document42 pages
Jupyter Lab
Paul Shaaf
No ratings yet
CE802 Report
Document7 pages
CE802 Report
prenithjohnsamuel
No ratings yet
Machin e Learnin G: Lab Record Implementation in R
Document30 pages
Machin e Learnin G: Lab Record Implementation in R
yuktha
No ratings yet
Does Data Splitting Improve Prediction?: Julian J. Faraway
Document12 pages
Does Data Splitting Improve Prediction?: Julian J. Faraway
MarcAparici
No ratings yet
Project Report - ML
Document17 pages
Project Report - ML
Sankeeta Jha
100% (1)
Project Lit Final1
Document15 pages
Project Lit Final1
poornima
No ratings yet
Salary Prediction Using Machine Learning
Document4 pages
Salary Prediction Using Machine Learning
Neha Arcot
No ratings yet
Bank Data Analysis Report
Document14 pages
Bank Data Analysis Report
Chandra Prakash S
No ratings yet
Stock Market Analysis Using Supervised Machine Learning
Document6 pages
Stock Market Analysis Using Supervised Machine Learning
Abishek Pangotra (Abi Sharma)
No ratings yet
Project Stage I Report
Document17 pages
Project Stage I Report
ravenharley1863
No ratings yet
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
Document5 pages
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
International Journal of Innovative Science and Research Technology
No ratings yet
House Price Forecasting Using Machine Learning Methods: Uter and Mathematics Education 11 (2021), 3624-3632
Document9 pages
House Price Forecasting Using Machine Learning Methods: Uter and Mathematics Education 11 (2021), 3624-3632
Rahul Patil
No ratings yet
V3i403 PDF
Document3 pages
V3i403 PDF
IJCERT PUBLICATIONS
No ratings yet
Acquisition Analytics Assignment
Document15 pages
Acquisition Analytics Assignment
Harshad Ambekar
No ratings yet
COMP1801 - Copy 1
Document18 pages
COMP1801 - Copy 1
SujiKrishnan
No ratings yet
E Commerce Project
Document12 pages
E Commerce Project
ydaajay1999
No ratings yet
Predictive Model For E-Commerce
Document3 pages
Predictive Model For E-Commerce
Nipun Goyal
100% (1)
CE802 Pilot
Document2 pages
CE802 Pilot
prenithjohnsamuel
No ratings yet
ICAICT 2016 Paper 26
Document8 pages
ICAICT 2016 Paper 26
Karthick Jothivel
No ratings yet
Abhay Seminar Papers
Document16 pages
Abhay Seminar Papers
HARRY POTTER
No ratings yet
Data Science Interview Questions (#Day11) PDF
Document11 pages
Data Science Interview Questions (#Day11) PDF
Sahil Goutham
100% (1)
Prediction of Company Bankruptcy: Amlan Nag
Document16 pages
Prediction of Company Bankruptcy: Amlan Nag
Express Business Services
100% (2)
Predicting Churn
Document37 pages
Predicting Churn
SAHARUDIN BIN JAPARUDIN Moe
No ratings yet
Home-Credit Risk Analysis and Prediction Modelling Using Python
Document12 pages
Home-Credit Risk Analysis and Prediction Modelling Using Python
IJRASETPublications
No ratings yet
3 ADMSDM Predictive Analytics and Data Mining
Document23 pages
3 ADMSDM Predictive Analytics and Data Mining
Alifa Rifda
No ratings yet
Credit Risk Analysis
Document6 pages
Credit Risk Analysis
Subangkit Ramadiputra
No ratings yet
Advanced Data Analytics Assignment
Document6 pages
Advanced Data Analytics Assignment
Olwethu N Mahlathini (Lethu)
No ratings yet
Loan Eligibility Prediction
Document12 pages
Loan Eligibility Prediction
Uddhav Chalise
No ratings yet
A.I. and Advanced Analytics Final Project
Document2 pages
A.I. and Advanced Analytics Final Project
Srishti Dewangan
No ratings yet
Types of Machine Learning Algorithms
Document14 pages
Types of Machine Learning Algorithms
Vipin Rajput
No ratings yet
Paper 3
Document5 pages
Paper 3
Vishal Labde
No ratings yet
Research Paper On Decision Tree in Data Mining
Document4 pages
Research Paper On Decision Tree in Data Mining
wopugemep0h3
100% (1)
Exer8 Indigo 1
Document2 pages
Exer8 Indigo 1
Karl Soriano
No ratings yet
Medical Insurance Cost Prediction System: Dharesh Bahety EN18EL301057 Under The Guidance of Mr. Parag Ravekar Sir
Document18 pages
Medical Insurance Cost Prediction System: Dharesh Bahety EN18EL301057 Under The Guidance of Mr. Parag Ravekar Sir
Brij Maheshwari
No ratings yet
Using Genetic Programming To Improve Software Effort Estimation Based On General Data Sets
Document11 pages
Using Genetic Programming To Improve Software Effort Estimation Based On General Data Sets
Nivya Ganesh
No ratings yet
Final Assessment MBAS901-Sabina.K
Document9 pages
Final Assessment MBAS901-Sabina.K
Sabina
No ratings yet
Customer Churn Prediction in Banking Sector - A Hybrid Approach
Document6 pages
Customer Churn Prediction in Banking Sector - A Hybrid Approach
IJRASETPublications
No ratings yet
Salary Prediction
Document4 pages
Salary Prediction
MOHIT KUMAR
No ratings yet
Final Project - Big Data
Document6 pages
Final Project - Big Data
Eve Prothien
No ratings yet
Stock Price Trend Forecasting Using Supervised Learning Methods
Document17 pages
Stock Price Trend Forecasting Using Supervised Learning Methods
SHAIK AZAJUDDIN 18MIS7192
No ratings yet
33 Submission
Document8 pages
33 Submission
sharvari solapure
No ratings yet
IV4i1 5
Document4 pages
IV4i1 5
32CST2khadtale.sakshi
No ratings yet
The Automated Travel Agent: Hotel Recommendations Using Machine Learning
Document4 pages
The Automated Travel Agent: Hotel Recommendations Using Machine Learning
shital shermale
No ratings yet
DMDW 03
Document25 pages
DMDW 03
Harsh Nag
No ratings yet
Assignment Report - Predictive Modelling - Rahul Dubey
Document18 pages
Assignment Report - Predictive Modelling - Rahul Dubey
Rahul
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Data Science Revealed: With Feature Engineering, Data Visualization, Pipeline Development, and Hyperparameter Tuning
From Everand
Data Science Revealed: With Feature Engineering, Data Visualization, Pipeline Development, and Hyperparameter Tuning
Tshepo Chris Nokeri
No ratings yet
Final FDocumentation
Document30 pages
Final FDocumentation
Vismay Agarwal cse07419
No ratings yet
Growth of Technology
Document3 pages
Growth of Technology
Vismay Agarwal cse07419
No ratings yet
Searching and Sorting Algorithm
Document44 pages
Searching and Sorting Algorithm
Vismay Agarwal cse07419
No ratings yet
Linear Search
Document21 pages
Linear Search
Vismay Agarwal cse07419
No ratings yet
Bubble Sort
Document25 pages
Bubble Sort
Vismay Agarwal cse07419
No ratings yet
Ethical-Dilemma Presentation
Document4 pages
Ethical-Dilemma Presentation
Naymau Suarez
No ratings yet
M-428-Malhari Mahatmya - Kavikulguru Kalidas Sanskrit University Ramtek Collection
Document47 pages
M-428-Malhari Mahatmya - Kavikulguru Kalidas Sanskrit University Ramtek Collection
shubhpatil1313
No ratings yet
Summative Test in Oral Comm Weeks 1-4
Document2 pages
Summative Test in Oral Comm Weeks 1-4
Joyce Nuena
No ratings yet
Chandorkar Meghna - Resume May 2020
Document2 pages
Chandorkar Meghna - Resume May 2020
api-374397440
No ratings yet
Orientation On The Manual For The Evaluation (Autosaved)
Document31 pages
Orientation On The Manual For The Evaluation (Autosaved)
Em Boquiren Carreon
No ratings yet
14122-1 - Choice of Fixed Means of Access Between Two Levels
Document20 pages
14122-1 - Choice of Fixed Means of Access Between Two Levels
david.gonda8
No ratings yet
Character Analysis
Document3 pages
Character Analysis
arianmedusa
No ratings yet
Physics Olevel Edexcel Exam 5
Document22 pages
Physics Olevel Edexcel Exam 5
fqbkgnyp92
No ratings yet
Diffraction
$Diffraction$
Document14 pages
Diffraction
Yogendra Kshetri
No ratings yet
Module Sixteen: Organizational Change, Development, and Learning
Document56 pages
Module Sixteen: Organizational Change, Development, and Learning
samantha cinco
No ratings yet
SCX Series: Precision Compensated Pressure Sensors
Document10 pages
SCX Series: Precision Compensated Pressure Sensors
Joel Palza
No ratings yet
Tuba Jamil
Document1 page
Tuba Jamil
3J Solutions BD
No ratings yet
34 Concrete Shear
Document4 pages
34 Concrete Shear
Mario Sajulga Dela Cuadra
No ratings yet
BatStateU FO TAO 08 Grades Form 1 Regular Admission Rev. 02 1
Document1 page
BatStateU FO TAO 08 Grades Form 1 Regular Admission Rev. 02 1
asdfgh
No ratings yet
PA3 v3.0 E Instruction Manual
Document30 pages
PA3 v3.0 E Instruction Manual
cristianiacob
No ratings yet
Liquid Ow in Polyurethane Foams For Filtration Applications: A Study On Their Characterization and Permeability Estimation
Document14 pages
Liquid Ow in Polyurethane Foams For Filtration Applications: A Study On Their Characterization and Permeability Estimation
Estefania Villamarin
No ratings yet
The Story of The Aral Sea: Reading Comprehension
Document2 pages
The Story of The Aral Sea: Reading Comprehension
Greceanu Alina
No ratings yet
Philips 26pf5321d Chassis El1.u Aa SM PDF
Document189 pages
Philips 26pf5321d Chassis El1.u Aa SM PDF
Gigel Ice
100% (1)
Welded Connection
Document8 pages
Welded Connection
utsav_koshti
No ratings yet
What We Can Do For Street Children
Document13 pages
What We Can Do For Street Children
AJMAL RASHEEK A H
No ratings yet
Effect of Financial Ratios On Firm Performance Study of Selected Brewery Firms in Nigeria
Document8 pages
Effect of Financial Ratios On Firm Performance Study of Selected Brewery Firms in Nigeria
Editor IJTSRD
No ratings yet
Chaboche Material Model
Document15 pages
Chaboche Material Model
sashka13370
0% (1)
2021 Ford® Ranger Midsize Pickup Truck - Brochures
Document5 pages
2021 Ford® Ranger Midsize Pickup Truck - Brochures
Rizal Mohamad Al-Ayyubi
No ratings yet
Host High Medium Low Log Nombre
Document7 pages
Host High Medium Low Log Nombre
jorge arias
No ratings yet
Precision Statements in UOP Methods
Document15 pages
Precision Statements in UOP Methods
Davin
No ratings yet
Study of Cellular Automata Models For Urban Growth
Document9 pages
Study of Cellular Automata Models For Urban Growth
Welly Pradipta bin Maryulis
No ratings yet
Money Laundering: Risks You Cannot Ignore
Document4 pages
Money Laundering: Risks You Cannot Ignore
Kunwarbir Singh lohat
No ratings yet
كتالوج الشيلرات
Document54 pages
كتالوج الشيلرات
Fathy Rakha
No ratings yet

Synopsis Machine Learning

Uploaded by

Vismay Agarwal cse07419

0% found this document useful (0 votes)

10 views18 pages

Original Description:

Original Title

synopsis machine learning

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

10 views18 pages

Synopsis Machine Learning

Uploaded by

Vismay Agarwal cse07419

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 18

Search inside document

MACHINE LEARNING

BUSINESS
REPAYMENT
DATE
PREDICTION
MADE WITH JUPYTER
OUR GROUP:

1 2 3 4 5
TEAM LEAD RESEARCH DEBUGGING QUALITY TESTING

ANALYST

VISMAY AAYUSH PUSHKAR NADEEM SHIVANSH KUMAR

AGARWAL MARWAH PANDEY AHMED SINGH

CSE/074/19 CSE/091/19 CSE/101/19 CSE/088/19 CSE/117/19

Made possible with the esteem guidance
of:

Prof. Tapan Kumar

Dey
&
Prof. Rahul Ranjan
In the business industry, manufacturers provide goods in loan in exchange for
the promise of repayment on its sale.

If the borrower repays the loan, then the lender would make earning. However,
if the borrower fails to repay the loan, then the lender loses money or have to
cease manufacturing operation.

Therefore, lenders face the problem of predicting the risk of a borrower being

Objective and unable to repay a loan or delay in repayment.

Use Case: In this study, the data from Lending club is used to train several Machine
Learning models to determine if the borrower has the ability to repay its loan
within promised time.

In addition,we would analyze the performance of the models (Random Forest,

Logistic Regression, Sup-port Vector Machine, and K Nearest Neighbors).

As a result, logistic regression model is found as the optimal predictive model

and it is expected that Fico Score and annual income significantly influence the
forecast.
Methodology
Null Implification
Pre-Processing
Feature Engineering
EDA (Exploratory Data Analysis)
Encoding
Feature Selection
Linear Regression
Decision Tree
NULL
IMPLIFICATION
Finds empty columns in data set.

Returns true for parameter if null

column is found, else returns

false.

If a null column parameter is

found, we drop that parameter

i.e, remove it from data set.

PRE-
PROCESSING
Converts non readable, i.e, non

int or float target and associate

data frames into readable

dataframes

We do so by using regular

expressions.

Here our dataframe is date,

month and year.

Using datetime library of python

FEATURE
ENGINEERING
After pre processing, we have date,

month and year of due_in_date,

posting_date and clear_date.

Instead of storing all 3 in new

columns, we will find the difference

in days between all 3 results to

establish a relationship b/w them.

This saves useless storage that

would have been occupied

otherwise.
EDA
We use displot and boxplot on

training data set to find outliers by

visualizing using these graphs.

After identifying outliers, we use

scatterplot to check if there are any

trend in the data set.

If we find any unique dataframes, we

can drop it since it won't affect the

prediction.
LABEL
ENCODING
Now that we have all necessary

dataframes, we will try and convert

these dataframes into machine

readable data i.e, into integer type.

This is done using encoding. We

have implemented label encoding

here.

We are applying label encoding on

data frames that are in object type.

Before label encoding: After label encoding:
FEATURE
SELECTION
We will now search for co-relation in

train dataset using heatmap.

We will drop highly corelated items

that have a corelation of over 90%.

Next we will find variance in these

corelation and drop unique

dataframes.
LINEAR
REGRESSION
Using linear regression algorithm,

we will now find accuracy between

train dataset and prediction data

set.

Through the lineplot graph we can

see the disturbance between the

two data set.

According to the graph and result,

the accuracy is only 59.99%.

DECISION TREE -
RANDOM FOREST
ALGO
Using decision tree regression algorithm,

we will now find accuracy between train

dataset and prediction data set.

Through the lineplot graph we can see

the disturbance between the two data

set is better than in linear regression.

According to the graph and result, the

accuracy is over 99%.

Hence we will use decision tree

algorithm and our project is a success.

Graphical Representation:
CONCLUSION
We did Exploratory data Analysis on the features of this dataset and saw how each feature is
distributed.
We did bivariate and multivariate analysis to see imapct of one another on their features using charts.
We analysed each variable to check if data is cleaned and normally distributed.
We cleaned the data and removed NA values
We also generated hypothesis to prove an association among the Independent variables and the Target
variable. And based on the results, we assumed whether or not there is an association.
We calculated correaltion between independent variables and found that applicant clear_date and
posting_date have significant relation.
We created dummy variables for constructing the model
We constructed models taking different variables into account and found through odds ratio that
due_in_date, posting_date & baseling_create_date are creating the most impact on target decision
Finally, we got a model with coapplicant income and credit history as independent variable with highest
accuracy.
We tested the data and got the accuracy of 99.97 %
Thank you

Data Mining Business Report Hansraj Yadav
Document34 pages
Data Mining Business Report Hansraj Yadav
P Venkata Krishna Rao
83% (12)
Project: Animesh Halder
Document12 pages
Project: Animesh Halder
animeh
50% (2)
EAP Grammar in Context: Reading & Writing
Document27 pages
EAP Grammar in Context: Reading & Writing
Mónica Coirolo
50% (2)
Business Report M2 PDF
Document14 pages
Business Report M2 PDF
A d
100% (2)
Loan Eligibility Prediction: Machine Learning
Document8 pages
Loan Eligibility Prediction: Machine Learning
Jeet Maru
No ratings yet
Data Mininig Project
Document28 pages
Data Mininig Project
Karthikeyan Manimaran
67% (3)
Telco Customer Churn
Document11 pages
Telco Customer Churn
Hamza Qazi
100% (2)
Reading 11 - Programming End-to-End Solution
Document13 pages
Reading 11 - Programming End-to-End Solution
lussy
No ratings yet
Customer Loan Prediction: Term Project Report
Document11 pages
Customer Loan Prediction: Term Project Report
Anusha Bhatnagar
100% (1)
Article Review 11 Eng
Document18 pages
Article Review 11 Eng
Cecilia Fauziah
No ratings yet
Data Science Assignment 2
Document14 pages
Data Science Assignment 2
anigunasekara
No ratings yet
TB 969425740
Document16 pages
TB 969425740
guohong hu
No ratings yet
Mathematics For Intelligent System: Project: Loan Prediction Using Decision Tree, Random Forest & Logistic Regression
Document20 pages
Mathematics For Intelligent System: Project: Loan Prediction Using Decision Tree, Random Forest & Logistic Regression
swami thati
No ratings yet
Using Logistic Regression Predict Credit Default
Document6 pages
Using Logistic Regression Predict Credit Default
Sneha Singh
No ratings yet
BIA 660: Glassdoor Sentimental Analysis and Salary Prediction
Document15 pages
BIA 660: Glassdoor Sentimental Analysis and Salary Prediction
aditghanekar10
No ratings yet
Jupyter Lab
Document42 pages
Jupyter Lab
Paul Shaaf
No ratings yet
CE802 Report
Document7 pages
CE802 Report
prenithjohnsamuel
No ratings yet
Machin e Learnin G: Lab Record Implementation in R
Document30 pages
Machin e Learnin G: Lab Record Implementation in R
yuktha
No ratings yet
Does Data Splitting Improve Prediction?: Julian J. Faraway
Document12 pages
Does Data Splitting Improve Prediction?: Julian J. Faraway
MarcAparici
No ratings yet
Project Report - ML
Document17 pages
Project Report - ML
Sankeeta Jha
100% (1)
Project Lit Final1
Document15 pages
Project Lit Final1
poornima
No ratings yet
Salary Prediction Using Machine Learning
Document4 pages
Salary Prediction Using Machine Learning
Neha Arcot
No ratings yet
Bank Data Analysis Report
Document14 pages
Bank Data Analysis Report
Chandra Prakash S
No ratings yet
Stock Market Analysis Using Supervised Machine Learning
Document6 pages
Stock Market Analysis Using Supervised Machine Learning
Abishek Pangotra (Abi Sharma)
No ratings yet
Project Stage I Report
Document17 pages
Project Stage I Report
ravenharley1863
No ratings yet
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
Document5 pages
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
International Journal of Innovative Science and Research Technology
No ratings yet
House Price Forecasting Using Machine Learning Methods: Uter and Mathematics Education 11 (2021), 3624-3632
Document9 pages
House Price Forecasting Using Machine Learning Methods: Uter and Mathematics Education 11 (2021), 3624-3632
Rahul Patil
No ratings yet
V3i403 PDF
Document3 pages
V3i403 PDF
IJCERT PUBLICATIONS
No ratings yet
Acquisition Analytics Assignment
Document15 pages
Acquisition Analytics Assignment
Harshad Ambekar
No ratings yet
COMP1801 - Copy 1
Document18 pages
COMP1801 - Copy 1
SujiKrishnan
No ratings yet
E Commerce Project
Document12 pages
E Commerce Project
ydaajay1999
No ratings yet
Predictive Model For E-Commerce
Document3 pages
Predictive Model For E-Commerce
Nipun Goyal
100% (1)
CE802 Pilot
Document2 pages
CE802 Pilot
prenithjohnsamuel
No ratings yet
ICAICT 2016 Paper 26
Document8 pages
ICAICT 2016 Paper 26
Karthick Jothivel
No ratings yet
Abhay Seminar Papers
Document16 pages
Abhay Seminar Papers
HARRY POTTER
No ratings yet
Data Science Interview Questions (#Day11) PDF
Document11 pages
Data Science Interview Questions (#Day11) PDF
Sahil Goutham
100% (1)
Prediction of Company Bankruptcy: Amlan Nag
Document16 pages
Prediction of Company Bankruptcy: Amlan Nag
Express Business Services
100% (2)
Predicting Churn
Document37 pages
Predicting Churn
SAHARUDIN BIN JAPARUDIN Moe
No ratings yet
Home-Credit Risk Analysis and Prediction Modelling Using Python
Document12 pages
Home-Credit Risk Analysis and Prediction Modelling Using Python
IJRASETPublications
No ratings yet
3 ADMSDM Predictive Analytics and Data Mining
Document23 pages
3 ADMSDM Predictive Analytics and Data Mining
Alifa Rifda
No ratings yet
Credit Risk Analysis
Document6 pages
Credit Risk Analysis
Subangkit Ramadiputra
No ratings yet
Advanced Data Analytics Assignment
Document6 pages
Advanced Data Analytics Assignment
Olwethu N Mahlathini (Lethu)
No ratings yet
Loan Eligibility Prediction
Document12 pages
Loan Eligibility Prediction
Uddhav Chalise
No ratings yet
A.I. and Advanced Analytics Final Project
Document2 pages
A.I. and Advanced Analytics Final Project
Srishti Dewangan
No ratings yet
Types of Machine Learning Algorithms
Document14 pages
Types of Machine Learning Algorithms
Vipin Rajput
No ratings yet
Paper 3
Document5 pages
Paper 3
Vishal Labde
No ratings yet
Research Paper On Decision Tree in Data Mining
Document4 pages
Research Paper On Decision Tree in Data Mining
wopugemep0h3
100% (1)
Exer8 Indigo 1
Document2 pages
Exer8 Indigo 1
Karl Soriano
No ratings yet
Medical Insurance Cost Prediction System: Dharesh Bahety EN18EL301057 Under The Guidance of Mr. Parag Ravekar Sir
Document18 pages
Medical Insurance Cost Prediction System: Dharesh Bahety EN18EL301057 Under The Guidance of Mr. Parag Ravekar Sir
Brij Maheshwari
No ratings yet
Using Genetic Programming To Improve Software Effort Estimation Based On General Data Sets
Document11 pages
Using Genetic Programming To Improve Software Effort Estimation Based On General Data Sets
Nivya Ganesh
No ratings yet
Final Assessment MBAS901-Sabina.K
Document9 pages
Final Assessment MBAS901-Sabina.K
Sabina
No ratings yet
Customer Churn Prediction in Banking Sector - A Hybrid Approach
Document6 pages
Customer Churn Prediction in Banking Sector - A Hybrid Approach
IJRASETPublications
No ratings yet
Salary Prediction
Document4 pages
Salary Prediction
MOHIT KUMAR
No ratings yet
Final Project - Big Data
Document6 pages
Final Project - Big Data
Eve Prothien
No ratings yet
Stock Price Trend Forecasting Using Supervised Learning Methods
Document17 pages
Stock Price Trend Forecasting Using Supervised Learning Methods
SHAIK AZAJUDDIN 18MIS7192
No ratings yet
33 Submission
Document8 pages
33 Submission
sharvari solapure
No ratings yet
IV4i1 5
Document4 pages
IV4i1 5
32CST2khadtale.sakshi
No ratings yet
The Automated Travel Agent: Hotel Recommendations Using Machine Learning
Document4 pages
The Automated Travel Agent: Hotel Recommendations Using Machine Learning
shital shermale
No ratings yet
DMDW 03
Document25 pages
DMDW 03
Harsh Nag
No ratings yet
Assignment Report - Predictive Modelling - Rahul Dubey
Document18 pages
Assignment Report - Predictive Modelling - Rahul Dubey
Rahul
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Data Science Revealed: With Feature Engineering, Data Visualization, Pipeline Development, and Hyperparameter Tuning
From Everand
Data Science Revealed: With Feature Engineering, Data Visualization, Pipeline Development, and Hyperparameter Tuning
Tshepo Chris Nokeri
No ratings yet
Final FDocumentation
Document30 pages
Final FDocumentation
Vismay Agarwal cse07419
No ratings yet
Growth of Technology
Document3 pages
Growth of Technology
Vismay Agarwal cse07419
No ratings yet
Searching and Sorting Algorithm
Document44 pages
Searching and Sorting Algorithm
Vismay Agarwal cse07419
No ratings yet
Linear Search
Document21 pages
Linear Search
Vismay Agarwal cse07419
No ratings yet
Bubble Sort
Document25 pages
Bubble Sort
Vismay Agarwal cse07419
No ratings yet
Ethical-Dilemma Presentation
Document4 pages
Ethical-Dilemma Presentation
Naymau Suarez
No ratings yet
M-428-Malhari Mahatmya - Kavikulguru Kalidas Sanskrit University Ramtek Collection
Document47 pages
M-428-Malhari Mahatmya - Kavikulguru Kalidas Sanskrit University Ramtek Collection
shubhpatil1313
No ratings yet
Summative Test in Oral Comm Weeks 1-4
Document2 pages
Summative Test in Oral Comm Weeks 1-4
Joyce Nuena
No ratings yet
Chandorkar Meghna - Resume May 2020
Document2 pages
Chandorkar Meghna - Resume May 2020
api-374397440
No ratings yet
Orientation On The Manual For The Evaluation (Autosaved)
Document31 pages
Orientation On The Manual For The Evaluation (Autosaved)
Em Boquiren Carreon
No ratings yet
14122-1 - Choice of Fixed Means of Access Between Two Levels
Document20 pages
14122-1 - Choice of Fixed Means of Access Between Two Levels
david.gonda8
No ratings yet
Character Analysis
Document3 pages
Character Analysis
arianmedusa
No ratings yet
Physics Olevel Edexcel Exam 5
Document22 pages
Physics Olevel Edexcel Exam 5
fqbkgnyp92
No ratings yet
Diffraction
$Diffraction$
Document14 pages
Diffraction
Yogendra Kshetri
No ratings yet
Module Sixteen: Organizational Change, Development, and Learning
Document56 pages
Module Sixteen: Organizational Change, Development, and Learning
samantha cinco
No ratings yet
SCX Series: Precision Compensated Pressure Sensors
Document10 pages
SCX Series: Precision Compensated Pressure Sensors
Joel Palza
No ratings yet
Tuba Jamil
Document1 page
Tuba Jamil
3J Solutions BD
No ratings yet
34 Concrete Shear
Document4 pages
34 Concrete Shear
Mario Sajulga Dela Cuadra
No ratings yet
BatStateU FO TAO 08 Grades Form 1 Regular Admission Rev. 02 1
Document1 page
BatStateU FO TAO 08 Grades Form 1 Regular Admission Rev. 02 1
asdfgh
No ratings yet
PA3 v3.0 E Instruction Manual
Document30 pages
PA3 v3.0 E Instruction Manual
cristianiacob
No ratings yet
Liquid Ow in Polyurethane Foams For Filtration Applications: A Study On Their Characterization and Permeability Estimation
Document14 pages
Liquid Ow in Polyurethane Foams For Filtration Applications: A Study On Their Characterization and Permeability Estimation
Estefania Villamarin
No ratings yet
The Story of The Aral Sea: Reading Comprehension
Document2 pages
The Story of The Aral Sea: Reading Comprehension
Greceanu Alina
No ratings yet
Philips 26pf5321d Chassis El1.u Aa SM PDF
Document189 pages
Philips 26pf5321d Chassis El1.u Aa SM PDF
Gigel Ice
100% (1)
Welded Connection
Document8 pages
Welded Connection
utsav_koshti
No ratings yet
What We Can Do For Street Children
Document13 pages
What We Can Do For Street Children
AJMAL RASHEEK A H
No ratings yet
Effect of Financial Ratios On Firm Performance Study of Selected Brewery Firms in Nigeria
Document8 pages
Effect of Financial Ratios On Firm Performance Study of Selected Brewery Firms in Nigeria
Editor IJTSRD
No ratings yet
Chaboche Material Model
Document15 pages
Chaboche Material Model
sashka13370
0% (1)
2021 Ford® Ranger Midsize Pickup Truck - Brochures
Document5 pages
2021 Ford® Ranger Midsize Pickup Truck - Brochures
Rizal Mohamad Al-Ayyubi
No ratings yet
Host High Medium Low Log Nombre
Document7 pages
Host High Medium Low Log Nombre
jorge arias
No ratings yet
Precision Statements in UOP Methods
Document15 pages
Precision Statements in UOP Methods
Davin
No ratings yet
Study of Cellular Automata Models For Urban Growth
Document9 pages
Study of Cellular Automata Models For Urban Growth
Welly Pradipta bin Maryulis
No ratings yet
Money Laundering: Risks You Cannot Ignore
Document4 pages
Money Laundering: Risks You Cannot Ignore
Kunwarbir Singh lohat
No ratings yet
كتالوج الشيلرات
Document54 pages
كتالوج الشيلرات
Fathy Rakha
No ratings yet

Synopsis Machine Learning

Uploaded by

Copyright:

Available Formats

You might also like

Synopsis Machine Learning

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Synopsis Machine Learning

Uploaded by

Copyright:

Available Formats

MACHINE LEARNING

VISMAY AAYUSH PUSHKAR NADEEM SHIVANSH KUMAR

CSE/074/19 CSE/091/19 CSE/101/19 CSE/088/19 CSE/117/19

Prof. Tapan Kumar

Objective and unable to repay a loan or delay in repayment.

In addition,we would analyze the performance of the models (Random Forest,

As a result, logistic regression model is found as the optimal predictive model

Returns true for parameter if null

column is found, else returns

If a null column parameter is

found, we drop that parameter

i.e, remove it from data set.

int or float target and associate

data frames into readable

Here our dataframe is date,

month and year.

Using datetime library of python

month and year of due_in_date,

posting_date and clear_date.

Instead of storing all 3 in new

columns, we will find the difference

in days between all 3 results to

establish a relationship b/w them.

This saves useless storage that

would have been occupied

training data set to find outliers by

visualizing using these graphs.

After identifying outliers, we use

scatterplot to check if there are any

trend in the data set.

If we find any unique dataframes, we

can drop it since it won't affect the

dataframes, we will try and convert

these dataframes into machine

readable data i.e, into integer type.

This is done using encoding. We

have implemented label encoding

We are applying label encoding on

data frames that are in object type.

train dataset using heatmap.

We will drop highly corelated items

that have a corelation of over 90%.

Next we will find variance in these

corelation and drop unique

we will now find accuracy between

train dataset and prediction data

Through the lineplot graph we can

see the disturbance between the

two data set.

According to the graph and result,

the accuracy is only 59.99%.

we will now find accuracy between train

dataset and prediction data set.

Through the lineplot graph we can see

the disturbance between the two data

set is better than in linear regression.

According to the graph and result, the