Welcome to Scribd!

Skip carousel

Predective Modelling Business Report Set2

Uploaded by

priyada16

0% found this document useful (0 votes)

4 views11 pages

Original Title

Predective Modelling Business Report set2

Copyright

Available Formats

DOC, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOC, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as doc, pdf, or txt

0% found this document useful (0 votes)

4 views11 pages

Predective Modelling Business Report Set2

Uploaded by

priyada16

Copyright:

Available Formats

Download as DOC, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as doc, pdf, or txt

Jump to Page

You are on page 1of 11

Search inside document

1 Part 2 : Logistic Regression, LDA and CART

1.1 Problem Statement:

You are a statistician at the Republic of Indonesia Ministry of Health and you are provided with a data of
1473 females collected from a Contraceptive Prevalence Survey. The samples are married women who
were either not pregnant or do not know if they were at the time of the survey.

The problem is to predict do/don't they use a contraceptive method of choice based on their
demographic and socio-economic characteristics.

Dataset for Problem 2: Contraceptive_method_dataset.xlsx

1.2 Data Dictionary:

1. Wife's age (numerical)
2. Wife's education (categorical) 1=uneducated, 2, 3, 4=tertiary
3. Husband's education (categorical) 1=uneducated, 2, 3, 4=tertiary
4. Number of children ever born (numerical)
5. Wife's religion (binary) Non-Scientology, Scientology
6. Wife's now working? (binary) Yes, No
7. Husband's occupation (categorical) 1, 2, 3, 4(random)
8. Standard-of-living index (categorical) 1=verlow, 2, 3, 4=high
9. Media exposure (binary) Good, Not good
10. Contraceptive method used (class attribute) No,Yes
1.3 Assumption & Solution:
1.3.1 Data Ingestion: Read the dataset. Do the descriptive statistics and do null
value condition check, check for duplicates and outliers and write an inference on it.
Perform Univariate and Bivariate Analysis and Multivariate Analysis.
Summarizing the PCA India Data Census upon Exploratory Data Analysis

 Shape of the Data set

(1473, 10)
 A Glipmpse of the first 5 rows of the data

 Data Summary

 Statistical Description

 Duplicate Checks
 Null Value checks

Univariate Analysis:

Post Outlier Treatment

Bivariate Analysis:
Multivariate Analysis:
1.3.2 Do not scale the data. Encode the data (having string values) for Modelling.
Data Split: Split the data into train and test (70:30). Apply Logistic Regression and
LDA (linear discriminant analysis) and CART.

Encoding the Data:

feature: Husband_education
['Secondary', 'Primary', 'Tertiary', 'Uneducated']
Categories (4, object): ['Primary', 'Secondary', 'Tertiary',
'Uneducated']
[1 0 2 3]

feature: Wife_religion
['Scientology', 'Non-Scientology']
Categories (2, object): ['Non-Scientology', 'Scientology']
[1 0]

feature: Wife_Working
['No', 'Yes']
Categories (2, object): ['No', 'Yes']
[0 1]

feature: Standard_of_living_index
['High', 'Very High', 'Low', 'Very Low']
Categories (4, object): ['High', 'Low', 'Very High', 'Very Low']
[0 2 1 3]

feature: Media_exposure
['Exposed', 'Not-Exposed']
Categories (2, object): ['Exposed', 'Not-Exposed']
[0 1]

feature: Contraceptive_method_used
['No', 'Yes']
Categories (2, object): ['No', 'Yes']
[0 1]

Post Encoding:
Univariate Analysis post Encoding and Treating Outliers:

Multivariate Analysis post Encoding and Treating Outliers:

Cart Model:
Importance of Features:
Imp
Wife_age 0.335927
No_of_children_born 0.301169
Standard_of_living_index 0.112077
Husband_Occupation 0.075047
Husband_education 0.066805
Wife_ education 0.062557
Wife_Working 0.046418
Wife_religion 0.000000
Media_exposure 0.000000

Pruned Tree:

Imp
Wife_age 0.335927
No_of_children_born 0.301169
Standard_of_living_index 0.112077
Husband_Occupation 0.075047
Husband_education 0.066805
Wife_ education 0.062557
Wife_Working 0.046418
Wife_religion 0.000000
Media_exposure 0.000000

The confusion matrix for train_labels, ytrain_predict

The confusion matrix for test_labels, ytest_predict

1.3.3 Performance Metrics: Check the performance of Predictions on Train and Test
sets using Accuracy, Confusion Matrix, Plot ROC curve and get ROC_AUC score for
each model Final Model: Compare Both the models and write inference which model
is best/optimized.
1.3.4 Inference: Basis on these predictions, what are the insights and
recommendations. Please explain and summarise the various steps performed in this
project. There should be proper business interpretation and actionable insights
present.

ML Interview Questions PDF
Document20 pages
ML Interview Questions PDF
nandex777
100% (4)
Employee Attrition Study Case
Document88 pages
Employee Attrition Study Case
Ahza Azr
No ratings yet
Final Capstone Project Report
Document35 pages
Final Capstone Project Report
chinudash
100% (1)
Top 30 Data Analytics Interview Questions & Answers
Document16 pages
Top 30 Data Analytics Interview Questions & Answers
kumar kumar
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
Document34 pages
Thyroid Disease Classification Using Machine Learning Project
dharugayu13475
No ratings yet
Predictive - Modelling - Project - PDF 1
Document31 pages
Predictive - Modelling - Project - PDF 1
Shiva Kumar
No ratings yet
Hypothesis Testing
Document17 pages
Hypothesis Testing
Jose Maria Arjona Caballero
No ratings yet
Why Are We Using Logistic Regression To Analyze Employee Attrition?
Document4 pages
Why Are We Using Logistic Regression To Analyze Employee Attrition?
Akash Kumar
No ratings yet
Thyroid Disease Classification Using ML
Document37 pages
Thyroid Disease Classification Using ML
dharugayu13475
No ratings yet
Workflow of A Machine Learning Project
Document12 pages
Workflow of A Machine Learning Project
ashish
No ratings yet
Code
Document16 pages
Code
Ali Malik
No ratings yet
ML Unit 2
Document18 pages
ML Unit 2
SUJATA SONWANE
No ratings yet
Machine Learnin-WPS Office PDF
Document11 pages
Machine Learnin-WPS Office PDF
Soham Chatterjee
No ratings yet
Machine Learning: Interview Guide
Document21 pages
Machine Learning: Interview Guide
Ronald Alejandro Chaupin Bautista
No ratings yet
B341 MSA Clinic
Document21 pages
B341 MSA Clinic
nancy chan
No ratings yet
Reportprediction of Employee Atrition Uisng Machine Learning
Document6 pages
Reportprediction of Employee Atrition Uisng Machine Learning
Areena Mahek
No ratings yet
Predicting Credit Card Approvals
Document14 pages
Predicting Credit Card Approvals
as
100% (1)
Ranking Features Based On Predictive Power - Importance of The Class Labels
Document11 pages
Ranking Features Based On Predictive Power - Importance of The Class Labels
Juan
No ratings yet
1624106057@g.us
Document13 pages
1624106057@g.us
Nidhi Mallya
No ratings yet
What Is The Difference Between Test Data and Validation Data
Document2 pages
What Is The Difference Between Test Data and Validation Data
pitaninikhitha
No ratings yet
Guide Data Exploration
Document16 pages
Guide Data Exploration
Hakan Yılmaz
No ratings yet
Problem 2 - Logistic Regression and LDA
Document24 pages
Problem 2 - Logistic Regression and LDA
saarang K
No ratings yet
Data Mining Journal 2 Kashan
Document14 pages
Data Mining Journal 2 Kashan
Kashan Riaz
No ratings yet
4 Solved
Document14 pages
4 Solved
Kinza ALvi
100% (1)
Employee Attrition Prediction
Document21 pages
Employee Attrition Prediction
user user
100% (1)
Data Mining Project DSBA PCA Report Final
Document21 pages
Data Mining Project DSBA PCA Report Final
indraneel120
No ratings yet
Study On Performance Evaluation For Non-Profit Organizations Based On GA-BP Algorithm
Document6 pages
Study On Performance Evaluation For Non-Profit Organizations Based On GA-BP Algorithm
Nitesh Solanki
No ratings yet
PFDA
Document23 pages
PFDA
Leong Chun Hui
No ratings yet
Whole ML PDF 1614408656
Document214 pages
Whole ML PDF 1614408656
Kshatrapati Singh
100% (1)
Project Lit Final1
Document15 pages
Project Lit Final1
poornima
No ratings yet
Data Mining Journal 2 Kashan
Document13 pages
Data Mining Journal 2 Kashan
Kashan Riaz
No ratings yet
Stress Managemnet Report
Document6 pages
Stress Managemnet Report
Kashish Sharma
No ratings yet
Proccalis Sasgf2015
Document10 pages
Proccalis Sasgf2015
sysokuva
No ratings yet
Predictive Modeling Business Report Seetharaman Final Changes PDF
Document28 pages
Predictive Modeling Business Report Seetharaman Final Changes PDF
Ankita Mishra
100% (1)
Assignment1 6
Document5 pages
Assignment1 6
api-337039820
No ratings yet
Assignment of Computer1.odt
Document14 pages
Assignment of Computer1.odt
Muhammad Yousaf
No ratings yet
CSL0777 L08
Document29 pages
CSL0777 L08
Konkobo Ulrich Arthur
No ratings yet
Machine Learning Unit 1
Document72 pages
Machine Learning Unit 1
Manshi Jain
No ratings yet
Evaluation of Predictive Models Final
Document6 pages
Evaluation of Predictive Models Final
Ma. Jessabel Azurin
No ratings yet
ES Lecture Note
Document96 pages
ES Lecture Note
Ming Yan Lo
No ratings yet
All Lectures Quiz's With Answer
Document29 pages
All Lectures Quiz's With Answer
ayeshasghar50
No ratings yet
Web Application
Document13 pages
Web Application
ouiam ouhdifa
No ratings yet
Develop A Program To Implement Data Preprocessing Using
Document19 pages
Develop A Program To Implement Data Preprocessing Using
Fucker Jamun
No ratings yet
All Types of Cross Validation
Document9 pages
All Types of Cross Validation
Priya dharshini.G
No ratings yet
Spss Tasks
Document11 pages
Spss Tasks
Alexandru Neagoe
No ratings yet
NAME-Rajat Gupta Section - B2B2 (Marketing and Analytics) UID - 2019-1706-0001-0007
Document9 pages
NAME-Rajat Gupta Section - B2B2 (Marketing and Analytics) UID - 2019-1706-0001-0007
Rajat Gupta
No ratings yet
Validation Over Under Fir Unit 5
Document6 pages
Validation Over Under Fir Unit 5
Harpreet Singh Bagga
No ratings yet
Anshul Dyundi Predictive Modelling Alternate Project July 2022
Document11 pages
Anshul Dyundi Predictive Modelling Alternate Project July 2022
Anshul Dyundi
No ratings yet
Measurement, Reliability, Validity
Document36 pages
Measurement, Reliability, Validity
Hailemariam Atsbeha
No ratings yet
Advance Statistic Assignment - Sudhanva
Document27 pages
Advance Statistic Assignment - Sudhanva
Sudhanva S
No ratings yet
Ai Powered Medical Diagnosis-Phase 3
Document10 pages
Ai Powered Medical Diagnosis-Phase 3
faizalahamed345690
No ratings yet
Project 1
Document17 pages
Project 1
sachin chaudhary
No ratings yet
Chapter-3-Common Issues in Machine Learning
Document20 pages
Chapter-3-Common Issues in Machine Learning
codeavengers0
No ratings yet
Evaluating A Machine Learning Model
Document14 pages
Evaluating A Machine Learning Model
Jean
No ratings yet
Machine Learning Lpu Notes
Document187 pages
Machine Learning Lpu Notes
Shashank Kumar Verma
No ratings yet
Unit 2
Document63 pages
Unit 2
Mdv Prasad
No ratings yet
AIML Expt
Document7 pages
AIML Expt
D Slm
No ratings yet
Churn Analysis of Bank Customers
Document12 pages
Churn Analysis of Bank Customers
Vaibhav Agarwal
100% (1)
UAS 44 Scoring Explanation
Document6 pages
UAS 44 Scoring Explanation
ROBERTO ENRIQUE GARCÍA AGUILAR
No ratings yet
Laboratory Information System A Complete Guide - 2020 Edition
From Everand
Laboratory Information System A Complete Guide - 2020 Edition
Gerardus Blokdyk
No ratings yet
Using MFP in Stata To Get The Best Polynomial Equation For Stset
Document12 pages
Using MFP in Stata To Get The Best Polynomial Equation For Stset
alicorpanao
No ratings yet
Core Mathematics C12: Pearson Edexcel
Document52 pages
Core Mathematics C12: Pearson Edexcel
Abdulrahim Saiid
No ratings yet
Tutorial Topic 8
Document3 pages
Tutorial Topic 8
aida
No ratings yet
Chapter 2 Simple Linear Regression - Jan2023
Document66 pages
Chapter 2 Simple Linear Regression - Jan2023
Lachyn Seidova
No ratings yet
Learning Objectives Learning Objectives: After Completing This Chapter, Students Will Be Able To
Document25 pages
Learning Objectives Learning Objectives: After Completing This Chapter, Students Will Be Able To
creepy444
No ratings yet
Maxima and Minima
Document14 pages
Maxima and Minima
Nitin Grover
No ratings yet
Differential Equations (MA 1150) : Sukumar
Document52 pages
Differential Equations (MA 1150) : Sukumar
Mansi Nanavati
No ratings yet
MIT18 05S14 Cl5cont Slides PDF
Document12 pages
MIT18 05S14 Cl5cont Slides PDF
MoMo
No ratings yet
Syllabus: ME6603 Finite Element Analysis
Document1 page
Syllabus: ME6603 Finite Element Analysis
Bharathi Kanna
No ratings yet
An HPLC Analysis of Sweeteners in Beverages: CHEM 311L Quantitative Analysis Laboratory
Document6 pages
An HPLC Analysis of Sweeteners in Beverages: CHEM 311L Quantitative Analysis Laboratory
Syahrul Ramadhan
No ratings yet
Riemann Surfaces HW 11 Def
Document2 pages
Riemann Surfaces HW 11 Def
Guifré Sánchez Serra
No ratings yet
Data Analytics
Document4 pages
Data Analytics
Narakatla Srinu
No ratings yet
Stirling's Approximation
Document9 pages
Stirling's Approximation
Muhammad Naeem Khan
No ratings yet
Bisection PDF
Document3 pages
Bisection PDF
Danny Bakani
No ratings yet
Hypothesis - Testing - Mohammed Nayeemuddin
Document6 pages
Hypothesis - Testing - Mohammed Nayeemuddin
Nayeem Uddin
0% (1)
Colorado School of Mines CHEN403 Ziegler-Nichols Example: M Is The Amplitude Ratio. K M P
Document6 pages
Colorado School of Mines CHEN403 Ziegler-Nichols Example: M Is The Amplitude Ratio. K M P
arpit garg
No ratings yet
3.1 Differential Calculus - Integration Course 1 For CE PDF
Document11 pages
3.1 Differential Calculus - Integration Course 1 For CE PDF
KenAbejuela
No ratings yet
MS&E 318 (CME 338) Large-Scale Numerical Optimization: Stanford University, Management Science & Engineering (And ICME)
Document6 pages
MS&E 318 (CME 338) Large-Scale Numerical Optimization: Stanford University, Management Science & Engineering (And ICME)
Jerry
No ratings yet
Faculty of Engineering and Technology Model Question Paper - B. Tech
Document5 pages
Faculty of Engineering and Technology Model Question Paper - B. Tech
Manu a
No ratings yet
Invariants of Characteristics of Some Systems of Partial Differential Equations
Document13 pages
Invariants of Characteristics of Some Systems of Partial Differential Equations
Animesh Saha
No ratings yet
Linear Law
Document13 pages
Linear Law
Erwani Kamaruddin
No ratings yet
Kinesthetic Perception of Human in Architectural Space Ameena Shirin
Document4 pages
Kinesthetic Perception of Human in Architectural Space Ameena Shirin
amla
No ratings yet
IOT Implementation - Dissertation (1-3)
Document23 pages
IOT Implementation - Dissertation (1-3)
amjadpsm
No ratings yet
Funcnd Limcont PDF
Document24 pages
Funcnd Limcont PDF
Surabhi mallick
No ratings yet
Numerical Computing Question Bank Unit I
Document5 pages
Numerical Computing Question Bank Unit I
ankit28345
No ratings yet
Metric Note
Document68 pages
Metric Note
zikibruno
No ratings yet
John Chirillo - Hack Attacks Testing - How To Conduct Your Own Security Audit
Document6 pages
John Chirillo - Hack Attacks Testing - How To Conduct Your Own Security Audit
Shane
No ratings yet
Math 131
Document1 page
Math 131
azizaldebasi1
No ratings yet
Basic Data Analysis: Descriptive Statistics
Document23 pages
Basic Data Analysis: Descriptive Statistics
amul65
No ratings yet