Module 14

Uploaded by

tousif Ahmed

0% found this document useful (0 votes)

29 views6 pages

Original Title

Module-14 (1)

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

29 views6 pages

Module 14

Uploaded by

tousif Ahmed

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 6

Search inside document

Topic(s): Decision Tree & Random Forest

1.) A cloth manufacturing company is interested to know about the segment or

attributes contributing to high sale. Approach - A decision tree & random forest
model can be built with target variable 'Sale' (we will first convert it into categorical
variable) & all other variables will be independent in the analysis.

ANSWER:

Business Problem: To know what segments or attributes that are contributing to high sale.

Import the Dataset and Data Preprocessing:

- Converting Sales column into categorical variable
- Removing the 1st column
- Exploratory Data Analysis
- Graphical Representation
- Shuffled the dataset

Data Partition:
- Checking the proportion of class variable and dividing dataset into test and train.

Model Building:
- DECISION TREE
- Train accuracy : 1.0 and Test accuracy : 0.75
- RANDOM FOREST
- Train accuracy : 1.0 and Test accuracy : 0.8

2.) Divide the data (Diabetes) into training and test datasets and create a Random
Forest Model to classify 'Class Variable'.

Answer:
Business Problem: Classify if the person has diabetes or not
Import the dataset and Data Preprocessing:
- Graphical Representation
- Label encoding is done
- Predictors and target columns are rearranged and set for partition
Data Partition:
- Checking the proportion of the output variable. Test data taken as 30% and train as
70%
Model Building:
- Train accuracy is 77.05% and test accuracy is 94.13%.

3.) Use decision trees & random forest algorithm to prepare a model on fraud data,
treating those who have taxable_income <= 30000 as "Risky" and others are "Good".

ANSWER:
Business Problem: Classify the people having a taxable_income <= 30000 as "Risky" and
others are "Good
Import the data set and Data Preprocessing:
- Dataset is imported
- 476 Good customers and 124 risky customers.
- Prediction and confusion matrix- training data.
- Looks like the first six predicted value and original value matches.
- Train accuracy is 1.0 and test accuracy is 0.661 using decision tree
- On the graph, if taxable income is 3000 or greater than they are good customers else
those are risky customers.
- Train accuracy is 0.99 and test accuracy is 0.71 using random forest model.

Hints:
1. Business Problem
1.1. Objective
1.2. Constraints (if any)
2. Data Pre-processing
2.1 Data cleaning, Feature Engineering, EDA etc.
3. Model Building
3.1 Partition the dataset
3.2 Model(s) - Reasons to choose any algorithm
3.3 Model(s) Improvement steps
3.4 Model Evaluation
3.5 Python and R codes
4. Deployment
4.1 Deploy solutions using R shiny and Python Flask.
5. Result Share the benefits/impact of the solution - how or in what way the business (client)
gets benefit from the solution provided.

Note:
© 2013 - 2020 360DigiTMG. All Rights Reserved.
1. For each assignment the solution should be submitted in the format
2. Research and Perform all possible steps for improving the model(s) accuracy
Ex: Feature Engineering, Hyper Parameter tuning etc.
3. All the codes (executable programs) are running without errors
4. Documentation of the module should be submitted along with R & Python codes, elaborating
on every step mentioned here

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5820)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1093)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (852)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (609)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1717)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (590)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (898)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1104)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (540)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2104)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (349)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1018)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1867)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (822)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (122)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (441)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
Rating: 3.5 out of 5 stars
3.5/5 (1947)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (403)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4771)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (808)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4208)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1903)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2522)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3973)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (74)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (105)
Cryptographic Hardware and Embedded Systems CHES 2016
Document649 pages
Cryptographic Hardware and Embedded Systems CHES 2016
saurabhade
No ratings yet
Business - Modelling - Full
Document34 pages
Business - Modelling - Full
tousif Ahmed
No ratings yet
Mcom Business Plan
Document15 pages
Mcom Business Plan
tousif Ahmed
No ratings yet
2012 African Elephants
Document2 pages
2012 African Elephants
tousif Ahmed
No ratings yet
Crisis at Bhilpur
Document9 pages
Crisis at Bhilpur
tousif Ahmed
No ratings yet
Kohler
Document25 pages
Kohler
tousif Ahmed
No ratings yet
Managerial Report On African Elephant Populations
Document8 pages
Managerial Report On African Elephant Populations
tousif Ahmed
No ratings yet
Managerial Communication Non Verbal
Document7 pages
Managerial Communication Non Verbal
tousif Ahmed
No ratings yet
Grocery Purchase Behavior 150052
Document18 pages
Grocery Purchase Behavior 150052
tousif Ahmed
No ratings yet
Cognitive Biases & Its Impact On Communication
Document13 pages
Cognitive Biases & Its Impact On Communication
tousif Ahmed
No ratings yet
Banker'S Algorithm
Document17 pages
Banker'S Algorithm
Muhammad Faisal
No ratings yet
Factoring GCF - Demo Deped - Edited
Document35 pages
Factoring GCF - Demo Deped - Edited
romeo escarial
No ratings yet
CS 229 Project Report: Predicting Used Car Prices
Document5 pages
CS 229 Project Report: Predicting Used Car Prices
shantanu singh
100% (1)
CST201 Data Structures, December 2021
Document2 pages
CST201 Data Structures, December 2021
Anas Ansar
No ratings yet
19 Web Mining 2
Document41 pages
19 Web Mining 2
Chellamuthu Haripriya
No ratings yet
RT Exercises
Document220 pages
RT Exercises
Jhonny t
No ratings yet
Stability Analysis and Controller Tuning
Document19 pages
Stability Analysis and Controller Tuning
Jenny Azzahra
No ratings yet
Wavelet Transform
Document28 pages
Wavelet Transform
kaka_dinh_1
No ratings yet
Knowledge-Based Systems: Oscar Araque, Ganggao Zhu, Carlos A. Iglesias
Document14 pages
Knowledge-Based Systems: Oscar Araque, Ganggao Zhu, Carlos A. Iglesias
laddu
No ratings yet
Robotic Disassembly Line Balancing Problem A Mathematical Model and Ant Colony Optimization Approach
Document24 pages
Robotic Disassembly Line Balancing Problem A Mathematical Model and Ant Colony Optimization Approach
Ahmet Koç
No ratings yet
Tabel Seismologi PDF
Document168 pages
Tabel Seismologi PDF
Dewi Yuanita
No ratings yet
DS-DS Lab-1
Document4 pages
DS-DS Lab-1
Chaitanya 2003
No ratings yet
Data Structure MCQ
Document16 pages
Data Structure MCQ
Maaz Adhoni
No ratings yet
Doubly Linked List: - Ed. 2 and 3.: Chapter 4 - Ed. 4: Chapter 3
Document47 pages
Doubly Linked List: - Ed. 2 and 3.: Chapter 4 - Ed. 4: Chapter 3
sh00008
No ratings yet
Solution of The Sample Problem of CPM
Document2 pages
Solution of The Sample Problem of CPM
syed emon
100% (3)
Financial Management: Academy of Accounts
Document10 pages
Financial Management: Academy of Accounts
Ashish
No ratings yet
EE 650A - Assignment 8 (2020-21-I)
Document2 pages
EE 650A - Assignment 8 (2020-21-I)
Pradeep Kumar
No ratings yet
Full Chapter Machine Learning With Lightgbm and Python 2Nd Edition Andrich Van Wyk PDF
Document53 pages
Full Chapter Machine Learning With Lightgbm and Python 2Nd Edition Andrich Van Wyk PDF
nancey.johnsen783
100% (2)
Co Integration
Document42 pages
Co Integration
Fernando B. Sabino da Silva
No ratings yet
Design and Analysis of Algorithms-TCS-503
Document3 pages
Design and Analysis of Algorithms-TCS-503
Adya Sharma
No ratings yet
TOC - Central Concept of Automata Theory
Document10 pages
TOC - Central Concept of Automata Theory
Dilshad Pt
100% (2)
Evans Analytics2e PPT 10 Data Mining
Document69 pages
Evans Analytics2e PPT 10 Data Mining
Cajetan1957
No ratings yet
UCS551 Chapter 7 - Clustering
Document6 pages
UCS551 Chapter 7 - Clustering
nur ashfaraliana
No ratings yet
CES513 - Notes - Topic 1.2
Document9 pages
CES513 - Notes - Topic 1.2
Athirah Dinata
No ratings yet
Personalised AI Mastery Guide - My HandCrafted
Document25 pages
Personalised AI Mastery Guide - My HandCrafted
raju bp
No ratings yet
Image Steganography and Steganalysis A Survey
Document12 pages
Image Steganography and Steganalysis A Survey
Iwp 201
No ratings yet
Reinforcement Learning
Document2 pages
Reinforcement Learning
mohith
No ratings yet
A Dynamic Programming Algorithm For The Shortest Path Problem With Time Windows and Linear Node Costs
Document13 pages
A Dynamic Programming Algorithm For The Shortest Path Problem With Time Windows and Linear Node Costs
Divyanshu Shukla
No ratings yet
RSA Cryptosystem Using Python
Document3 pages
RSA Cryptosystem Using Python
Editor IJTSRD
No ratings yet