Welcome to Scribd!

0% found this document useful (0 votes)

19 views

PSR 0607 Chap10

Uploaded by

Decision trees can classify both categorical and numerical data to predict a categorical output attribute. Decision tree algorithms split the data into purer child nodes based on questions about feature values. This is done recursively to create a tree structure. However, decision trees are unstable and small changes to the training data can result in a very different tree structure, though classification accuracy often remains stable. Trees created from numeric data in particular can become quite complex.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

HW1 DPV Chapter 6 Solutions
Document11 pages
HW1 DPV Chapter 6 Solutions
mohsin01
No ratings yet
AP Q&A Statistics:With 600 Questions and Answers
From Everand
AP Q&A Statistics:With 600 Questions and Answers
Martin Sternstein
No ratings yet
CSE Syllabus
Document36 pages
CSE Syllabus
Ashok
No ratings yet
Decision Trees
Document37 pages
Decision Trees
Dennis Angel
No ratings yet
Cs301 Supreme File For Mid by HaCkeRzZz
Document55 pages
Cs301 Supreme File For Mid by HaCkeRzZz
Spread The Islam Every Place
No ratings yet
Data Mining Classification Algorithms: Credits: Padhraic Smyth
Document54 pages
Data Mining Classification Algorithms: Credits: Padhraic Smyth
Om Prakash Sharma
No ratings yet
Boosted Tree
Document41 pages
Boosted Tree
Mic Sch
No ratings yet
Decision Trees-Lecture 9&10
Document60 pages
Decision Trees-Lecture 9&10
umtxengineer3
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
Document54 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
Muhammad hanzla
No ratings yet
1.9 Partition Values
Document20 pages
1.9 Partition Values
jaineti
No ratings yet
Numerical Methods: Dr. Charisma Choudhury
Document14 pages
Numerical Methods: Dr. Charisma Choudhury
sam9j
No ratings yet
Lecture 5
Document30 pages
Lecture 5
Jimmy
No ratings yet
Data Structures in Java: Session 7 Instructor: Bert Huang
Document19 pages
Data Structures in Java: Session 7 Instructor: Bert Huang
scribdeer
No ratings yet
Multidimensional Index Structures
Document70 pages
Multidimensional Index Structures
Vishal Mittal
No ratings yet
Sanfoundry Tree MCQ
Document8 pages
Sanfoundry Tree MCQ
Sutanu MUKHERJEE
No ratings yet
Theory, Algorithm, and System Development, Prentice Hall, Upper Saddle River
Document12 pages
Theory, Algorithm, and System Development, Prentice Hall, Upper Saddle River
Ashish K Singh
No ratings yet
Trees and Graphs
Document183 pages
Trees and Graphs
MAYA
No ratings yet
Anket Analiz Ve Sunum
Document26 pages
Anket Analiz Ve Sunum
Serra Çevikkan
No ratings yet
Lesson4 Data
Document31 pages
Lesson4 Data
Halah Aftab
No ratings yet
PWC
Document24 pages
PWC
ratnadepp
No ratings yet
ML Unit 3
Document83 pages
ML Unit 3
sanju.25qt
No ratings yet
TCS Preparation Camp (Data Structures) : Vijay Kumar Dwivedi Assistant Professor UCER, Prayagraj Mobile: 9235668045
Document69 pages
TCS Preparation Camp (Data Structures) : Vijay Kumar Dwivedi Assistant Professor UCER, Prayagraj Mobile: 9235668045
frzi frzi
No ratings yet
QRM+–+Week+1+Seminar+ +Canvas
Document22 pages
QRM+–+Week+1+Seminar+ +Canvas
rxh6kp5pp9
No ratings yet
Be 181
Document43 pages
Be 181
Paarth Sawant
No ratings yet
DWDM PPT
Document35 pages
DWDM PPT
Rakesh Kumar
No ratings yet
Lecture Tree
Document10 pages
Lecture Tree
Rifat Hassan
No ratings yet
Data Structures Test
Document4 pages
Data Structures Test
Mohapatra Sarada
No ratings yet
Connecting The Data: Geometry and Measurement: Bonnie Vondracek Susan Pittman
Document33 pages
Connecting The Data: Geometry and Measurement: Bonnie Vondracek Susan Pittman
Aml Aml
No ratings yet
Lecture 13
Document25 pages
Lecture 13
Abood Fazil
No ratings yet
Quiz#1Reschduled CS502 Design and Analysis of Algorithms
Document2 pages
Quiz#1Reschduled CS502 Design and Analysis of Algorithms
Imran
67% (6)
Dsa24 7
Document43 pages
Dsa24 7
parmarnupur44
No ratings yet
BIDM Session 10
Document38 pages
BIDM Session 10
Ajit chowdary
No ratings yet
Presented To: - Presented By: - Roll No: - Class:: Mam Mubeen Islam Anum Rafiq and Iqra Farheen 391 and 422 BS-IT (2)
Document24 pages
Presented To: - Presented By: - Roll No: - Class:: Mam Mubeen Islam Anum Rafiq and Iqra Farheen 391 and 422 BS-IT (2)
Bilal Malik
No ratings yet
Stats Lect
Document77 pages
Stats Lect
Christopher
No ratings yet
AI Lecture 12
Document49 pages
AI Lecture 12
Khalid Mahmood
No ratings yet
BST MCQ
Document12 pages
BST MCQ
nehru
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
Document36 pages
Descriptive Statistics and Exploratory Data Analysis
Emmanuel Adjei Odame
No ratings yet
15 MCQs ML (DT Classification)
Document6 pages
15 MCQs ML (DT Classification)
gone00404
No ratings yet
CAS CS 565, Data Mining
Document30 pages
CAS CS 565, Data Mining
Kutale Tukura
No ratings yet
Tut5 Soln
Document4 pages
Tut5 Soln
barsamic03
No ratings yet
Unit Iv Non-Linear Data Structures Syllabus
Document21 pages
Unit Iv Non-Linear Data Structures Syllabus
Sathya Arul
No ratings yet
MAST20005 Module01 Notes
Document20 pages
MAST20005 Module01 Notes
Anonymous na314kKjOA
No ratings yet
Nptel Week 7
Document3 pages
Nptel Week 7
deepankarweb2
No ratings yet
Attachment 1
Document7 pages
Attachment 1
Dennis Mutua
No ratings yet
Infoedge Test in IITR
Document2 pages
Infoedge Test in IITR
kartikmaich
No ratings yet
Sample Content - TCS
Document58 pages
Sample Content - TCS
Rutuja Patil
No ratings yet
Algorithms: The Basic Methods Instance-Based Learning: Data Science
Document17 pages
Algorithms: The Basic Methods Instance-Based Learning: Data Science
Ragini Gupta
No ratings yet
Advance Data Analysis - Students
Document82 pages
Advance Data Analysis - Students
Adhy popz
No ratings yet
Machine Learning Unit-3.2
Document61 pages
Machine Learning Unit-3.2
sahil.utube2003
No ratings yet
CS301 Solved MCQs
Document32 pages
CS301 Solved MCQs
MuhammadRizwanIslamKhan
100% (1)
Trees and Binary Trees
Document53 pages
Trees and Binary Trees
MALA AKHILA SREE
No ratings yet
Introduction To Data Science Exploratory Data Analysis
Document55 pages
Introduction To Data Science Exploratory Data Analysis
hunt4nothing
No ratings yet
A Lot of Solved MCQs CS301 For Final 2010 by Risingpakistan
Document48 pages
A Lot of Solved MCQs CS301 For Final 2010 by Risingpakistan
risingpakistan
No ratings yet
DS4 - CLS-Decision Tree
Document32 pages
DS4 - CLS-Decision Tree
Văn Hoàng Trần
No ratings yet
Classification Problems
Document53 pages
Classification Problems
Naveen Jaishankar
No ratings yet
Digital Electronics: CT 304N Unit:1 Number Systems and Codes
Document172 pages
Digital Electronics: CT 304N Unit:1 Number Systems and Codes
Liyanshu patel
No ratings yet
SLM Bivariate Data
Document2 pages
SLM Bivariate Data
api-327041524
No ratings yet
Power One
Document63 pages
Power One
Basanth
No ratings yet
Theory of Regression
Document843 pages
Theory of Regression
vdhien
No ratings yet
Lec.7.intro.D.S. Fall 2023
Document26 pages
Lec.7.intro.D.S. Fall 2023
abdelhamedhsn.2003
No ratings yet
Ged Basics in Mathematics
From Everand
Ged Basics in Mathematics
Henry Varela
Rating: 5 out of 5 stars
5/5 (1)
Trackpad Pro Ver. 5.0 Class 7
From Everand
Trackpad Pro Ver. 5.0 Class 7
Nidhi Arora
No ratings yet
Tree Algorithm Slides
Document367 pages
Tree Algorithm Slides
Moinul Hasan
No ratings yet
Modul - Data Representation Ver 3.0 - Updated
Document63 pages
Modul - Data Representation Ver 3.0 - Updated
afifa
No ratings yet
Applying 5S in Your Computers
Document22 pages
Applying 5S in Your Computers
Myravelle Negro Lagrason
100% (1)
Threaded Binary Tree: Presented To
Document35 pages
Threaded Binary Tree: Presented To
Anonymous igVRM2mA6k
No ratings yet
Birara Sisay Anley
Document12 pages
Birara Sisay Anley
Barie Wakjira
0% (1)
Elmasri/Navathe, Fundamentals of D Atabase Systems, 4th Edition
Document29 pages
Elmasri/Navathe, Fundamentals of D Atabase Systems, 4th Edition
ISHITA GUPTA 20BCE0446
No ratings yet
Lecture10 PDF
Document22 pages
Lecture10 PDF
Andra Pufu
No ratings yet
Practice 2
Document8 pages
Practice 2
Senior Alonzo
No ratings yet
Heaps, Heapsort and Priority Queues: No Gaps in It. The Last Two Conditions Give A Heap A Weak Amount of Order
Document24 pages
Heaps, Heapsort and Priority Queues: No Gaps in It. The Last Two Conditions Give A Heap A Weak Amount of Order
awadhesh.kumar
No ratings yet
CO IM22 JPN Change Investment Program Structure
Document42 pages
CO IM22 JPN Change Investment Program Structure
nguyencaohuy
No ratings yet
A Course in Machine Learning 1648562733
Document193 pages
A Course in Machine Learning 1648562733
logiclover
No ratings yet
Data Structures and Algorithm Analysis: Edition 3.2 (C++ Version)
Document6 pages
Data Structures and Algorithm Analysis: Edition 3.2 (C++ Version)
Doob Strife
No ratings yet
Binary Tree REPORT
Document11 pages
Binary Tree REPORT
Muhammad Mukarrum
No ratings yet
AP Pgecet Cse Question Paper
Document18 pages
AP Pgecet Cse Question Paper
siva kodali
No ratings yet
Elementary Data Structures - Stacks, Queues, & Lists, Amortized Analysis Trees
Document41 pages
Elementary Data Structures - Stacks, Queues, & Lists, Amortized Analysis Trees
FrancesHsieh
No ratings yet
Teachers Recruitment Board Post Graduate Computer Instructor Grade - 1 Question Paper - 27.06.2019
Document68 pages
Teachers Recruitment Board Post Graduate Computer Instructor Grade - 1 Question Paper - 27.06.2019
Kavitha Suresh Kumar
No ratings yet
Huffman Coding Scheme
Document59 pages
Huffman Coding Scheme
Sadaf Rasheed
No ratings yet
Rdbms Interview Ques
Document5 pages
Rdbms Interview Ques
Gohan Kawa
No ratings yet
Algorithms and Data Structures Exercises: Antonio Carzaniga University of Lugano Edition 1.2 January 2009
Document13 pages
Algorithms and Data Structures Exercises: Antonio Carzaniga University of Lugano Edition 1.2 January 2009
Sharon ben aharon
No ratings yet
Test Preparation Computer Knowledge
Document36 pages
Test Preparation Computer Knowledge
Khurshid Manzoor Malik
No ratings yet
Stanford CS224W Limitations of Graph Neural Networks 18-Limitations
Document75 pages
Stanford CS224W Limitations of Graph Neural Networks 18-Limitations
Fgpeqw
No ratings yet
Introduction To Algorithms
Document214 pages
Introduction To Algorithms
asher
No ratings yet
Graduate Aptitude Test in Engineering (Gate)
Document33 pages
Graduate Aptitude Test in Engineering (Gate)
api-26368914
100% (1)
Trees Updated-3
Document39 pages
Trees Updated-3
Hafiza Iqra Maqbool
No ratings yet
Performance Analysis of Different Classification Methods in Data Mining For Diabetes Dataset Using WEKA Tool
Document6 pages
Performance Analysis of Different Classification Methods in Data Mining For Diabetes Dataset Using WEKA Tool
Editor IJRITCC
100% (1)
Oxford Lasers VidPIV 46 Brochure
Document22 pages
Oxford Lasers VidPIV 46 Brochure
Siamak
No ratings yet
Technical Aptitude Question
Document177 pages
Technical Aptitude Question
HARSHAD SONAWANE
No ratings yet
Important C Questions For Written Test & Interview by Crack The Interview
Document236 pages
Important C Questions For Written Test & Interview by Crack The Interview
puranamravinder
100% (8)

PSR 0607 Chap10

Uploaded by

miki1233

0% found this document useful (0 votes)

19 views33 pages

Original Description:

chapter

Original Title

psr_0607_Chap10

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

19 views33 pages

PSR 0607 Chap10

Uploaded by

miki1233

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 33

Search inside document

10.

Decision Trees

Informal Definition:
Telling things apart

Nominal data
No numeric feature vector
Just a list or properties:
Banana: longish, yellow
Apple: round, medium sized, different colors
like red and green but not blue

Basic Classification Tree

Advantages of Decision Trees

Trees can be interpreted
Possibility to include prior knowledge

Conversion to Binary Tree

Every tree can be converted to binary tree

Consider binary trees for simplicity
6

Issues when building a decision tree

Structure of tree (binary?)

What are the right questions to ask?
When should be add a node (splitting)?
When should we remove a node (pruning)?

Types of questions
(also for continuous data)

Questions for nominal data

(Text!)
E.g. for text classification:

Is key word x present?

Is frequency of word x larger than y?
Is a key phrase present?

It all depends on your intuition about the task

Usually asking for presence/absence or words
is a good baseline

Application of CART to Feature

Vectors
Feature Vector (x1xN)
Typ of questions: xi <a
ordinary binary tree

Decision boundary induced by

ordinary binary tree in 2
dimensions
Decision
boundaries
always parallel to
the axis
Only very simple
decision
boundaries can
be modelled

Decision boundary induced by

ordinary binary tree in 3
dimensions

Difficult case for ordinary binary tree

How does the

decision
boundary look
like?

Difficult case for ordinary binary tree

Simple questions
of type xi <a are
not capable of
efficiently
describing
sophisticated
decision
boundaries

Better Decision Boundary

Whats the
corresponding
question and
the resulting
tree?
15

Corresponding Tree and

Question

However, the number of possible

questions of this type is huge?
16

Binary space partition (BSP)

trees
Type of questions:
d
i =1

ai xi <

Induced partitioning of feature space:

Sphere trees
Questions are
d
i =1

( zi xi ) 2 <

Induced partitioning of feature space:

Nonlinear generalizations
Type of questions:

( x, a ) > 0
With Y some nonlinear function
May be different at each node
Issue: lack of simplicity

Adding nodes: splitting

Impurity of a node
Do all the data at a leaf belong to 1 class?
Example:
At one of the present nodes, the distribution is
70% Grapefruit
20% Lemons
10% Apples

Introduce PN(i) at node N

Whats a measure to define which node to split
next?

Missclassification Impurity
i ( N ) = 1 max PN (i )
j

Measures fraction of data that does not belong

to the dominant class
Turns out to be too greedy for multiple splits

Entropy Impurity
PN (i ) log 2 PN (i )

i( N ) =
j

Very popular

Gini Impurity
1
i( N ) =
2

(1 P ( ) )
2

Very popular as well

No practical difference to entropy impurity

Comparison of impurity functions:

Example for 2-class problem

Training Algorithms (Greedy)

Until the tree is large enough
Test all nodes N:
Test all questions Q:
Calculate change in impurity

i ( N ) = i ( N ) PL i ( N L (Q)) (1 PL )i ( N R (Q ))

Add node/question to achieve largest drop in

impurity
NL(Q): new left node
NR(Q): new right
node

PL: fraction of data in NL (Q)

Missclassification Rate

Training of a CART Tree:

Spam-Mail detection (From Hastie et
al.)
Green: 10-fold cross-validation
Red: test set

Root node question: frequency of $ sign

Influence of Amount of Training Data:

Text Classification (From
Manning+Schtze)

High variability for small amounts of training data

Saturation for large amounts of data
28

Pruning a CART-tree
(Text Classification)

Tune the pruning of a validation set

Example for Instability of Tree

Example
data for a
two class
problem

Example for Instability of Tree

Small changes in the data can results in large changes of

the tree however, classification accuracy is usually stable.
32

Summary
Decision trees are easy to understand
Decision trees can classify both
categorical and numerical data, but the
output attribute must be categorical
Decision tree algorithms are unstable.
Trees created from numeric data sets
can be quite complex

HW1 DPV Chapter 6 Solutions
Document11 pages
HW1 DPV Chapter 6 Solutions
mohsin01
No ratings yet
AP Q&A Statistics:With 600 Questions and Answers
From Everand
AP Q&A Statistics:With 600 Questions and Answers
Martin Sternstein
No ratings yet
CSE Syllabus
Document36 pages
CSE Syllabus
Ashok
No ratings yet
Decision Trees
Document37 pages
Decision Trees
Dennis Angel
No ratings yet
Cs301 Supreme File For Mid by HaCkeRzZz
Document55 pages
Cs301 Supreme File For Mid by HaCkeRzZz
Spread The Islam Every Place
No ratings yet
Data Mining Classification Algorithms: Credits: Padhraic Smyth
Document54 pages
Data Mining Classification Algorithms: Credits: Padhraic Smyth
Om Prakash Sharma
No ratings yet
Boosted Tree
Document41 pages
Boosted Tree
Mic Sch
No ratings yet
Decision Trees-Lecture 9&10
Document60 pages
Decision Trees-Lecture 9&10
umtxengineer3
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
Document54 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
Muhammad hanzla
No ratings yet
1.9 Partition Values
Document20 pages
1.9 Partition Values
jaineti
No ratings yet
Numerical Methods: Dr. Charisma Choudhury
Document14 pages
Numerical Methods: Dr. Charisma Choudhury
sam9j
No ratings yet
Lecture 5
Document30 pages
Lecture 5
Jimmy
No ratings yet
Data Structures in Java: Session 7 Instructor: Bert Huang
Document19 pages
Data Structures in Java: Session 7 Instructor: Bert Huang
scribdeer
No ratings yet
Multidimensional Index Structures
Document70 pages
Multidimensional Index Structures
Vishal Mittal
No ratings yet
Sanfoundry Tree MCQ
Document8 pages
Sanfoundry Tree MCQ
Sutanu MUKHERJEE
No ratings yet
Theory, Algorithm, and System Development, Prentice Hall, Upper Saddle River
Document12 pages
Theory, Algorithm, and System Development, Prentice Hall, Upper Saddle River
Ashish K Singh
No ratings yet
Trees and Graphs
Document183 pages
Trees and Graphs
MAYA
No ratings yet
Anket Analiz Ve Sunum
Document26 pages
Anket Analiz Ve Sunum
Serra Çevikkan
No ratings yet
Lesson4 Data
Document31 pages
Lesson4 Data
Halah Aftab
No ratings yet
PWC
Document24 pages
PWC
ratnadepp
No ratings yet
ML Unit 3
Document83 pages
ML Unit 3
sanju.25qt
No ratings yet
TCS Preparation Camp (Data Structures) : Vijay Kumar Dwivedi Assistant Professor UCER, Prayagraj Mobile: 9235668045
Document69 pages
TCS Preparation Camp (Data Structures) : Vijay Kumar Dwivedi Assistant Professor UCER, Prayagraj Mobile: 9235668045
frzi frzi
No ratings yet
QRM+–+Week+1+Seminar+ +Canvas
Document22 pages
QRM+–+Week+1+Seminar+ +Canvas
rxh6kp5pp9
No ratings yet
Be 181
Document43 pages
Be 181
Paarth Sawant
No ratings yet
DWDM PPT
Document35 pages
DWDM PPT
Rakesh Kumar
No ratings yet
Lecture Tree
Document10 pages
Lecture Tree
Rifat Hassan
No ratings yet
Data Structures Test
Document4 pages
Data Structures Test
Mohapatra Sarada
No ratings yet
Connecting The Data: Geometry and Measurement: Bonnie Vondracek Susan Pittman
Document33 pages
Connecting The Data: Geometry and Measurement: Bonnie Vondracek Susan Pittman
Aml Aml
No ratings yet
Lecture 13
Document25 pages
Lecture 13
Abood Fazil
No ratings yet
Quiz#1Reschduled CS502 Design and Analysis of Algorithms
Document2 pages
Quiz#1Reschduled CS502 Design and Analysis of Algorithms
Imran
67% (6)
Dsa24 7
Document43 pages
Dsa24 7
parmarnupur44
No ratings yet
BIDM Session 10
Document38 pages
BIDM Session 10
Ajit chowdary
No ratings yet
Presented To: - Presented By: - Roll No: - Class:: Mam Mubeen Islam Anum Rafiq and Iqra Farheen 391 and 422 BS-IT (2)
Document24 pages
Presented To: - Presented By: - Roll No: - Class:: Mam Mubeen Islam Anum Rafiq and Iqra Farheen 391 and 422 BS-IT (2)
Bilal Malik
No ratings yet
Stats Lect
Document77 pages
Stats Lect
Christopher
No ratings yet
AI Lecture 12
Document49 pages
AI Lecture 12
Khalid Mahmood
No ratings yet
BST MCQ
Document12 pages
BST MCQ
nehru
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
Document36 pages
Descriptive Statistics and Exploratory Data Analysis
Emmanuel Adjei Odame
No ratings yet
15 MCQs ML (DT Classification)
Document6 pages
15 MCQs ML (DT Classification)
gone00404
No ratings yet
CAS CS 565, Data Mining
Document30 pages
CAS CS 565, Data Mining
Kutale Tukura
No ratings yet
Tut5 Soln
Document4 pages
Tut5 Soln
barsamic03
No ratings yet
Unit Iv Non-Linear Data Structures Syllabus
Document21 pages
Unit Iv Non-Linear Data Structures Syllabus
Sathya Arul
No ratings yet
MAST20005 Module01 Notes
Document20 pages
MAST20005 Module01 Notes
Anonymous na314kKjOA
No ratings yet
Nptel Week 7
Document3 pages
Nptel Week 7
deepankarweb2
No ratings yet
Attachment 1
Document7 pages
Attachment 1
Dennis Mutua
No ratings yet
Infoedge Test in IITR
Document2 pages
Infoedge Test in IITR
kartikmaich
No ratings yet
Sample Content - TCS
Document58 pages
Sample Content - TCS
Rutuja Patil
No ratings yet
Algorithms: The Basic Methods Instance-Based Learning: Data Science
Document17 pages
Algorithms: The Basic Methods Instance-Based Learning: Data Science
Ragini Gupta
No ratings yet
Advance Data Analysis - Students
Document82 pages
Advance Data Analysis - Students
Adhy popz
No ratings yet
Machine Learning Unit-3.2
Document61 pages
Machine Learning Unit-3.2
sahil.utube2003
No ratings yet
CS301 Solved MCQs
Document32 pages
CS301 Solved MCQs
MuhammadRizwanIslamKhan
100% (1)
Trees and Binary Trees
Document53 pages
Trees and Binary Trees
MALA AKHILA SREE
No ratings yet
Introduction To Data Science Exploratory Data Analysis
Document55 pages
Introduction To Data Science Exploratory Data Analysis
hunt4nothing
No ratings yet
A Lot of Solved MCQs CS301 For Final 2010 by Risingpakistan
Document48 pages
A Lot of Solved MCQs CS301 For Final 2010 by Risingpakistan
risingpakistan
No ratings yet
DS4 - CLS-Decision Tree
Document32 pages
DS4 - CLS-Decision Tree
Văn Hoàng Trần
No ratings yet
Classification Problems
Document53 pages
Classification Problems
Naveen Jaishankar
No ratings yet
Digital Electronics: CT 304N Unit:1 Number Systems and Codes
Document172 pages
Digital Electronics: CT 304N Unit:1 Number Systems and Codes
Liyanshu patel
No ratings yet
SLM Bivariate Data
Document2 pages
SLM Bivariate Data
api-327041524
No ratings yet
Power One
Document63 pages
Power One
Basanth
No ratings yet
Theory of Regression
Document843 pages
Theory of Regression
vdhien
No ratings yet
Lec.7.intro.D.S. Fall 2023
Document26 pages
Lec.7.intro.D.S. Fall 2023
abdelhamedhsn.2003
No ratings yet
Ged Basics in Mathematics
From Everand
Ged Basics in Mathematics
Henry Varela
Rating: 5 out of 5 stars
5/5 (1)
Trackpad Pro Ver. 5.0 Class 7
From Everand
Trackpad Pro Ver. 5.0 Class 7
Nidhi Arora
No ratings yet
Tree Algorithm Slides
Document367 pages
Tree Algorithm Slides
Moinul Hasan
No ratings yet
Modul - Data Representation Ver 3.0 - Updated
Document63 pages
Modul - Data Representation Ver 3.0 - Updated
afifa
No ratings yet
Applying 5S in Your Computers
Document22 pages
Applying 5S in Your Computers
Myravelle Negro Lagrason
100% (1)
Threaded Binary Tree: Presented To
Document35 pages
Threaded Binary Tree: Presented To
Anonymous igVRM2mA6k
No ratings yet
Birara Sisay Anley
Document12 pages
Birara Sisay Anley
Barie Wakjira
0% (1)
Elmasri/Navathe, Fundamentals of D Atabase Systems, 4th Edition
Document29 pages
Elmasri/Navathe, Fundamentals of D Atabase Systems, 4th Edition
ISHITA GUPTA 20BCE0446
No ratings yet
Lecture10 PDF
Document22 pages
Lecture10 PDF
Andra Pufu
No ratings yet
Practice 2
Document8 pages
Practice 2
Senior Alonzo
No ratings yet
Heaps, Heapsort and Priority Queues: No Gaps in It. The Last Two Conditions Give A Heap A Weak Amount of Order
Document24 pages
Heaps, Heapsort and Priority Queues: No Gaps in It. The Last Two Conditions Give A Heap A Weak Amount of Order
awadhesh.kumar
No ratings yet
CO IM22 JPN Change Investment Program Structure
Document42 pages
CO IM22 JPN Change Investment Program Structure
nguyencaohuy
No ratings yet
A Course in Machine Learning 1648562733
Document193 pages
A Course in Machine Learning 1648562733
logiclover
No ratings yet
Data Structures and Algorithm Analysis: Edition 3.2 (C++ Version)
Document6 pages
Data Structures and Algorithm Analysis: Edition 3.2 (C++ Version)
Doob Strife
No ratings yet
Binary Tree REPORT
Document11 pages
Binary Tree REPORT
Muhammad Mukarrum
No ratings yet
AP Pgecet Cse Question Paper
Document18 pages
AP Pgecet Cse Question Paper
siva kodali
No ratings yet
Elementary Data Structures - Stacks, Queues, & Lists, Amortized Analysis Trees
Document41 pages
Elementary Data Structures - Stacks, Queues, & Lists, Amortized Analysis Trees
FrancesHsieh
No ratings yet
Teachers Recruitment Board Post Graduate Computer Instructor Grade - 1 Question Paper - 27.06.2019
Document68 pages
Teachers Recruitment Board Post Graduate Computer Instructor Grade - 1 Question Paper - 27.06.2019
Kavitha Suresh Kumar
No ratings yet
Huffman Coding Scheme
Document59 pages
Huffman Coding Scheme
Sadaf Rasheed
No ratings yet
Rdbms Interview Ques
Document5 pages
Rdbms Interview Ques
Gohan Kawa
No ratings yet
Algorithms and Data Structures Exercises: Antonio Carzaniga University of Lugano Edition 1.2 January 2009
Document13 pages
Algorithms and Data Structures Exercises: Antonio Carzaniga University of Lugano Edition 1.2 January 2009
Sharon ben aharon
No ratings yet
Test Preparation Computer Knowledge
Document36 pages
Test Preparation Computer Knowledge
Khurshid Manzoor Malik
No ratings yet
Stanford CS224W Limitations of Graph Neural Networks 18-Limitations
Document75 pages
Stanford CS224W Limitations of Graph Neural Networks 18-Limitations
Fgpeqw
No ratings yet
Introduction To Algorithms
Document214 pages
Introduction To Algorithms
asher
No ratings yet
Graduate Aptitude Test in Engineering (Gate)
Document33 pages
Graduate Aptitude Test in Engineering (Gate)
api-26368914
100% (1)
Trees Updated-3
Document39 pages
Trees Updated-3
Hafiza Iqra Maqbool
No ratings yet
Performance Analysis of Different Classification Methods in Data Mining For Diabetes Dataset Using WEKA Tool
Document6 pages
Performance Analysis of Different Classification Methods in Data Mining For Diabetes Dataset Using WEKA Tool
Editor IJRITCC
100% (1)
Oxford Lasers VidPIV 46 Brochure
Document22 pages
Oxford Lasers VidPIV 46 Brochure
Siamak
No ratings yet
Technical Aptitude Question
Document177 pages
Technical Aptitude Question
HARSHAD SONAWANE
No ratings yet
Important C Questions For Written Test & Interview by Crack The Interview
Document236 pages
Important C Questions For Written Test & Interview by Crack The Interview
puranamravinder
100% (8)

PSR 0607 Chap10

Uploaded by

Copyright:

Available Formats

You might also like

PSR 0607 Chap10

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

PSR 0607 Chap10

Uploaded by

Copyright:

Available Formats

10.

Basic Classification Tree

Advantages of Decision Trees

Conversion to Binary Tree

Every tree can be converted to binary tree

Issues when building a decision tree

Structure of tree (binary?)

Questions for nominal data

Is key word x present?

It all depends on your intuition about the task

Application of CART to Feature

Decision boundary induced by

Decision boundary induced by

Difficult case for ordinary binary tree

How does the

Difficult case for ordinary binary tree

Better Decision Boundary

Corresponding Tree and

However, the number of possible

Binary space partition (BSP)

Induced partitioning of feature space:

Induced partitioning of feature space:

Adding nodes: splitting

Introduce PN(i) at node N

Measures fraction of data that does not belong

Very popular as well

Comparison of impurity functions:

Training Algorithms (Greedy)

Add node/question to achieve largest drop in

PL: fraction of data in NL (Q)

Training of a CART Tree:

Root node question: frequency of $ sign

Influence of Amount of Training Data:

High variability for small amounts of training data

Tune the pruning of a validation set

Example for Instability of Tree

Example for Instability of Tree

Example for Instability of Tree

Small changes in the data can results in large changes of

You might also like