Welcome to Scribd!

Randomized Decision Trees II: 1 Feature Selection

Uploaded by

0% found this document useful (0 votes)

15 views3 pages

This document summarizes randomized decision trees and boosting algorithms. It discusses randomized feature selection for decision trees using a subset of high-dimensional features. It then explains boosting intuition as combining multiple weak learners to produce a stronger learner. The full boosting algorithm trains weak learners sequentially on examples weighted by previous performance, upweighting misclassified examples on each round to produce an "expert" learner.

Original Description:

Original Title

n22.pdf

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

15 views3 pages

Randomized Decision Trees II: 1 Feature Selection

Uploaded by

Christine Straub

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 3

Search inside document

Randomized Decision Trees II

compiled by Alvin Wan from Professor Jitendra Maliks lecture

1 Feature Selection

Note that depth-limited trees have a finite number of combinations.

1.1 Randomized Feature Selection

Suppose X is 1000-dimensional. We can randomly select a subset of features to create a new

decision tree. This is called randomized feature selection. If the features are nominal,
such as hair color. Our questions will simply compare: black or brown? brown?

2 Boosting

Our first intuition is the wisdom of the crowds. Our second is that we want experts for
different types of samples. In other words, some trees perform better on particular samples.
How do we give each tree a different weight? This note will cover only the algorithm and
not the proof.

2.1 Intuition

Let us consider the boosting algorithm first proposed, trimmed.

1. Train weak learner.

2. Get weak hypothesis, ht : X {1, +1}.

3. Choose t .

1
At is a single decision tree with error rate t . t is the weighting for a decision tree t.

1 1 t
t = ln
2 t

First thing to notice is that t is at least 0.5, for a binary classification problem. Any less
(say, 0.45) and we can simply invert the classification for a higher accuracy (1-0.45 = 0.55).
Consider the worst case scenario, where t = 0.5. Plugging in t = 0.5, we get t = 0, as
expected.

We pick this weighting per the error rate of a decision tree. Create a classifier h1 , compute
its error 1 and weight 1 . Repeat this for the second, third etc. trees. Here is our scheme,
to train an expert. Take the samples that h1 classified incorrectly. Train h2 on those.
Take the samples that h2 classified incorrectly. Train h3 on those. We can continue in this
fashion to produce an expert. This is the intuition for it, but in reality, we will instead
give more weight to the samples that h1 classified incorrectly.

2.2 Full Algorithm

Now, let us consider the original boosting algorithm in its full glory.

1. Train weak learner with distribution Dt .

2. Get weak hypothesis, ht : X {1, +1}.

3. Choose t .

4. Update Dt = Dt+1 .

P
We compute a probability distribution Dt over the samples. We know that i Dt (i) = 1,
and we can initialize D1 to be

1
D1 (i) = , i
n

For each sample, we multiply its old weight by a factor, which effectively gives more weight
to examples that were classified incorrectly. First, note that for a good classifier, t is high

2
and we weight Dt+1 less, by making our factor et . If otherwise, t is low and our factor is
et , to scale Dt+1 up.

(
Dt (i) et if ht (xi ) = yi
Dt+1 =
Zt et if ht (xi ) 6= yi

We can summarize this update as the following.

Dt (i)exp(t yt ht (xt ))
Dt+1 =
Zt

PDF (8) Panel Q-A PDF
Document9 pages
PDF (8) Panel Q-A PDF
Fatemeh Iglinsky
No ratings yet
Degree Exit Exam Sample Questions
Document4 pages
Degree Exit Exam Sample Questions
Nikodimos Endeshaw
91% (32)
CS5242 Neural Networks and Deep Learning: Quiz 1
Document2 pages
CS5242 Neural Networks and Deep Learning: Quiz 1
Ssss
No ratings yet
Routing Problems 6.1. Vehicle Routing Problems Vehicle Routing Problem, VRP
Document6 pages
Routing Problems 6.1. Vehicle Routing Problems Vehicle Routing Problem, VRP
DrPeter de Barcelona
No ratings yet
LPP - Big M Method
Document6 pages
LPP - Big M Method
Sundaramali Govindaswamy G
50% (2)
Week3 Linear Algebra - Gauss Elim
Document61 pages
Week3 Linear Algebra - Gauss Elim
Nindya Haura
No ratings yet
Ensembles 1
Document4 pages
Ensembles 1
Lam Sin Wing
No ratings yet
Bagging and Boosting
Document32 pages
Bagging and Boosting
Mayank Chauhan
No ratings yet
Lecture6 Notes
Document5 pages
Lecture6 Notes
Melanie2023
No ratings yet
Problem Set 8
Document5 pages
Problem Set 8
clouds lau
No ratings yet
Notes Chapter Linear Classifiers
Document4 pages
Notes Chapter Linear Classifiers
Parias L. Mukeba
No ratings yet
Chapter 2 - Linear Classifiers
Document4 pages
Chapter 2 - Linear Classifiers
walter white
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
Document50 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
Juan Alimana
No ratings yet
CS229 Supplemental Lecture Notes: 1 Boosting
Document11 pages
CS229 Supplemental Lecture Notes: 1 Boosting
Lam Sin Wing
No ratings yet
Robert E. Schapire: AT&T Labs, Shannon Laboratory 180 Park Avenue, Room A279, Florham Park, NJ 07932, USA Schapire
Document6 pages
Robert E. Schapire: AT&T Labs, Shannon Laboratory 180 Park Avenue, Room A279, Florham Park, NJ 07932, USA Schapire
Saifuddin Sidiki
No ratings yet
Lange Talk
Document40 pages
Lange Talk
sanka csaat
No ratings yet
Homework 1: EE 737 Spring 2019-20 Assigned: 30 Jan Due: Beginning of Class, 06 Feb
Document2 pages
Homework 1: EE 737 Spring 2019-20 Assigned: 30 Jan Due: Beginning of Class, 06 Feb
Ankit Pal
100% (1)
ML, WK 04-Questions With Answers
Document4 pages
ML, WK 04-Questions With Answers
ravinyse
No ratings yet
Lec 2
Document5 pages
Lec 2
fwoods10
No ratings yet
1 Algorithm: For I 1 To N Ify
Document6 pages
1 Algorithm: For I 1 To N Ify
Parias L. Mukeba
No ratings yet
Cost Function
Document17 pages
Cost Function
juanka82
No ratings yet
Notes Chapter Logistic Regression
Document6 pages
Notes Chapter Logistic Regression
Parias L. Mukeba
No ratings yet
Stats300a Fall15 Lecture1
Document7 pages
Stats300a Fall15 Lecture1
tristaloid
No ratings yet
1 Introduction/Recap From Last Time: COS 511: Theoretical Machine Learning
Document5 pages
1 Introduction/Recap From Last Time: COS 511: Theoretical Machine Learning
Ashish Sharma
No ratings yet
A Simple Proof of AdaBoost Algorithm
Document4 pages
A Simple Proof of AdaBoost Algorithm
Xuqing Wu
No ratings yet
cs229.... Machine Language. Andrew NG
Document17 pages
cs229.... Machine Language. Andrew NG
krishna
No ratings yet
Voting or Averaging of Predictions of Multiple Pre-Trained Models
Document23 pages
Voting or Averaging of Predictions of Multiple Pre-Trained Models
Nur Hossain
No ratings yet
Slides 1
Document73 pages
Slides 1
emily
No ratings yet
CS229 Supplemental Lecture Notes: 1 Binary Classification
Document7 pages
CS229 Supplemental Lecture Notes: 1 Binary Classification
GauravJain
No ratings yet
Representer Function
Document12 pages
Representer Function
GauravJain
No ratings yet
Boosting and Applications Yuan
Document41 pages
Boosting and Applications Yuan
Claudia Larray
No ratings yet
06 Logistic Regression PDF
Document10 pages
06 Logistic Regression PDF
marc
No ratings yet
Questions and Answers On Unit Roots, Cointegration, Vars and Vecms
Document6 pages
Questions and Answers On Unit Roots, Cointegration, Vars and Vecms
Tinotenda Dube
No ratings yet
CS 229 Autumn 2016 Problem Set#3:Theory & Unsupervised Learning
Document5 pages
CS 229 Autumn 2016 Problem Set#3:Theory & Unsupervised Learning
Zeeshan Ali Sayyed
No ratings yet
Statistical Inference Exercises
Document3 pages
Statistical Inference Exercises
ognobogvoshno
No ratings yet
Part of DL
Document24 pages
Part of DL
Seifedine El Mokni
No ratings yet
An Introduction To Bayesian Statistics
Document20 pages
An Introduction To Bayesian Statistics
jamesyu
100% (9)
MIT Differential Equations Notes
Document233 pages
MIT Differential Equations Notes
TheDarkness
No ratings yet
L3
Document7 pages
L3
Maria Roa
No ratings yet
Chapter 2
Document10 pages
Chapter 2
Fatima Abaid
No ratings yet
04 Interference Dynamics Notes
Document13 pages
04 Interference Dynamics Notes
kasturi.kandalam
No ratings yet
Devoir 1
Document6 pages
Devoir 1
Melania Nitu
No ratings yet
Homework 3: SVM and Sentiment Analysis: Minted Listings
Document7 pages
Homework 3: SVM and Sentiment Analysis: Minted Listings
Mayur Agrawal
No ratings yet
Improper Integral
Document8 pages
Improper Integral
sathya2814879
No ratings yet
Math 2280 - Lecture 4: Separable Equations and Applications: Dylan Zwick Fall 2013
Document8 pages
Math 2280 - Lecture 4: Separable Equations and Applications: Dylan Zwick Fall 2013
Kawsar Mobin
No ratings yet
Lectura 2 Point Estimator Basics
Document11 pages
Lectura 2 Point Estimator Basics
Alejandro Marin
No ratings yet
Algorithms Notes
Document66 pages
Algorithms Notes
sameeringate
No ratings yet
0 Perceptron
Document16 pages
0 Perceptron
Luis Enrique Carmona Gutierrez
No ratings yet
Adam: Adaptive Moment Estimation: The Error To Be Minimized
Document4 pages
Adam: Adaptive Moment Estimation: The Error To Be Minimized
Raghunath Siripudi
No ratings yet
Final Topics Fall2015
Document2 pages
Final Topics Fall2015
Chan-Ho Lee
No ratings yet
Final 2019 Solution
Document14 pages
Final 2019 Solution
Shokhrud Safarov
No ratings yet
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
Document19 pages
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
Akhi Danu
No ratings yet
Saxena Cs4758 Lecture4
Document2 pages
Saxena Cs4758 Lecture4
Ivan Avramov
No ratings yet
MIT15 450F10 Probs
Document7 pages
MIT15 450F10 Probs
Logon Christ
No ratings yet
Improper Integrals An Alternative Criterion
Document4 pages
Improper Integrals An Alternative Criterion
Natchanon Yingyot
No ratings yet
Differential Equations: 12.1 What Is A Differential Equation?
Document10 pages
Differential Equations: 12.1 What Is A Differential Equation?
imran hameer
No ratings yet
1.august 8 Class Notes
Document3 pages
1.august 8 Class Notes
man human
No ratings yet
1 Review of Key Concepts From Previous Lectures: Lecture Notes - Amber Habib - December 1
Document4 pages
1 Review of Key Concepts From Previous Lectures: Lecture Notes - Amber Habib - December 1
Christopher Bell
No ratings yet
Chapter 1 B
Document35 pages
Chapter 1 B
emily
No ratings yet
02 Mlsec Learning Exercise
Document4 pages
02 Mlsec Learning Exercise
laribiamal24
No ratings yet
1 Solving Systems of Linear Inequalities
Document4 pages
1 Solving Systems of Linear Inequalities
ylw
No ratings yet
The Student T Distribution: Sample Mean
Document4 pages
The Student T Distribution: Sample Mean
anamae gonzales
No ratings yet
HW1
Document4 pages
HW1
sun_917443954
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
Rating: 4 out of 5 stars
4/5 (9)
n14 PDF
Document4 pages
n14 PDF
Christine Straub
No ratings yet
Convolutional Neural Networks: 1 Convolution
Document2 pages
Convolutional Neural Networks: 1 Convolution
Christine Straub
No ratings yet
n9 PDF
Document6 pages
n9 PDF
Christine Straub
No ratings yet
Linear Regression: 1 Perspective 1: Maximum Likelihood Estimation
Document5 pages
Linear Regression: 1 Perspective 1: Maximum Likelihood Estimation
Christine Straub
No ratings yet
n15 PDF
Document4 pages
n15 PDF
Christine Straub
No ratings yet
Neural Networks: Derivation: 1 Model
Document9 pages
Neural Networks: Derivation: 1 Model
Christine Straub
No ratings yet
Bias-Variance Tradeoffs: 1 Single Sample MLE
Document7 pages
Bias-Variance Tradeoffs: 1 Single Sample MLE
Christine Straub
No ratings yet
n27 PDF
Document3 pages
n27 PDF
Christine Straub
No ratings yet
Clustering With Gradient Descent: 1 Performance
Document4 pages
Clustering With Gradient Descent: 1 Performance
Christine Straub
No ratings yet
n25 PDF
Document8 pages
n25 PDF
Christine Straub
No ratings yet
UC Berkeley EECS: Cal Day, April 18, 2015
Document50 pages
UC Berkeley EECS: Cal Day, April 18, 2015
Christine Straub
No ratings yet
Chemistry Rules For Naming
Document6 pages
Chemistry Rules For Naming
Christine Straub
No ratings yet
Competition 2008 Spring
Document2 pages
Competition 2008 Spring
Christine Straub
No ratings yet
Formulas Area
Document5 pages
Formulas Area
Christine Straub
No ratings yet
h31 Higher Order Derivatives Velocity and Acceleration
Document2 pages
h31 Higher Order Derivatives Velocity and Acceleration
Christine Straub
No ratings yet
Cheat Sheet
Document2 pages
Cheat Sheet
Christine Straub
No ratings yet
PSPP Lab Manual
Document29 pages
PSPP Lab Manual
Rajesh Natarajan
No ratings yet
Green University of Bangladesh Department of Computer Science and Engineering (CSE)
Document3 pages
Green University of Bangladesh Department of Computer Science and Engineering (CSE)
Himel
No ratings yet
Data Structure in Python
Document36 pages
Data Structure in Python
Youraj Singh
No ratings yet
Tutorial Sheets of Data Structure
Document10 pages
Tutorial Sheets of Data Structure
AnupmaSangwan
No ratings yet
Bisection Method Algorithm & Example-1 F (X) X 3-X-1
Document5 pages
Bisection Method Algorithm & Example-1 F (X) X 3-X-1
Rashid Abu abdoon
No ratings yet
Design and Analysis of Algorithms (CS3052)
Document11 pages
Design and Analysis of Algorithms (CS3052)
pranali suryawanshi
No ratings yet
Cambridge International AS & A Level: Computer Science 9618/22
Document20 pages
Cambridge International AS & A Level: Computer Science 9618/22
Shanbhavi Shanmugarajah
No ratings yet
Sorting and Hashing: By, Pankti Doshi
Document89 pages
Sorting and Hashing: By, Pankti Doshi
ritz mesh
No ratings yet
CS-7457 Submitted
Document18 pages
CS-7457 Submitted
Geo News
No ratings yet
Activities
Document11 pages
Activities
hdfajkh
No ratings yet
Algorithms HW 2
Document3 pages
Algorithms HW 2
avocetave
No ratings yet
DS Solbank U3
Document15 pages
DS Solbank U3
prasada hj prasada hj
No ratings yet
Emt537 538
Document24 pages
Emt537 538
deanz_75
No ratings yet
AI Lab MAnual Final
Document44 pages
AI Lab MAnual Final
Kongati Vaishnavi
No ratings yet
Big O Notation
Document7 pages
Big O Notation
dani
No ratings yet
02 - Implementation Methods S2
Document110 pages
02 - Implementation Methods S2
Juan Carlos Gallardo Saavedra
No ratings yet
Umass Lowell Computer Science 91.503: Graduate Algorithms
Document46 pages
Umass Lowell Computer Science 91.503: Graduate Algorithms
Shivam Atri
No ratings yet
Clustering Part-1
Document48 pages
Clustering Part-1
SANJIDA AKTER
No ratings yet
CS8451 DESIGN AND ANALYSIS OF ALGORITHMS QUESTION BANK - Watermark PDF
Document47 pages
CS8451 DESIGN AND ANALYSIS OF ALGORITHMS QUESTION BANK - Watermark PDF
neelakandan
50% (2)
DS Lab Assignment
Document2 pages
DS Lab Assignment
Vaibhav Bansal
No ratings yet
IT257 DAA Linear Programming
Document42 pages
IT257 DAA Linear Programming
Simhadri Sevitha
No ratings yet
CSE303: ADA M#5 (E-Tutorial#1) : Computational Complexity
Document13 pages
CSE303: ADA M#5 (E-Tutorial#1) : Computational Complexity
Prajjwal Mehra
No ratings yet
07 Local Search Algorithms
Document32 pages
07 Local Search Algorithms
Quynh Nhu Tran Thi
No ratings yet
Algorithm Complexity L3
Document27 pages
Algorithm Complexity L3
Haroon Arshad
No ratings yet
Brocamp - Mock Interview Questions
Document21 pages
Brocamp - Mock Interview Questions
a v
100% (1)