21CS63 - Unit 2 Practice Questions

Uploaded by

chaithanyasgowda10

0% found this document useful (0 votes)

3 views3 pages

Original Title

21CS63_unit 2 Practice Questions

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

3 views3 pages

21CS63 - Unit 2 Practice Questions

Uploaded by

chaithanyasgowda10

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 3

Search inside document

Data Mining(21CS63)

UNIT 2: PRACTICE QUESTIONS

Course Coordinator: Sandhya B R (Sec A)

1. Consider the training examples shown in Table 4a for a binary classification pro
a) What is the entropy of this collection of training example with rés the
positive class?
b) What are the information gains of A1 and 42 relative to these t examples?
c) What is the best split (between a1 and a2) according to the Gini index?

Table 4a

2. Consider the training examples shown in Table 4.1 for a binary classification problem.

(a)Compute the Gini index for the overall collection of training examples.
(b) Compute the Gini index for the Customer ID attribute.
(c) Compute the Gini index for the Gender attribute.
Data Mining(21CS63)
UNIT 2: PRACTICE QUESTIONS
Course Coordinator: Sandhya B R (Sec A)

(d) Compute the Gini index for the Car Type attribute using multiway split.
(e) Compute the Gini index for the Shirt Size attribute using multiway split.
(f) Which attribute is better, Gender, Car Type, or Shirt Size?
(g) Explain why Customer ID should not be used as the attribute test condition even
though it has the lowest Gini.

3. Consider the following data set for a binary class problem:

a) Calculate the information gain when splitting on A and B. Which attribute would the
decision tree induction algorithm choose?
b) Calculate the gain in the Gini index when splitting on A and B. Which attribute would
the decision tree induction algorithm choose?
4. Consider a training set that contains 100 positive examples and 400 negative
examples. For each of the following candidate rules,

R1: A −→ + (covers 4 positive and 1 negative examples),

R2: B −→ + (covers 30 positive and 10 negative examples),
R3: C −→ + (covers 100 positive and 90 negative examples),
determine which is the best and worst candidate rule according to:
(a) Rule accuracy.
(b) FOIL’s information gain.
(c) The likelihood ratio statistic.

5. Briefly outline the major steps of decision tree classification.

6. Why is tree pruning useful in decision tree induction? What is a drawback of using a
separate set of tuples to evaluate pruning?
7. Given a decision tree, you have the option of (a) converting the decision tree to rules
and then pruning the resulting rules, or (b) pruning the decision tree and then
converting the pruned tree to rules. What advantage does (a) have over (b)?
8. Discuss the difference between rule based and class based ordering schemes.
9. What is the significance of gain ratio.
10. Discuss the working of rule based classifier with suirable example.
Data Mining(21CS63)
UNIT 2: PRACTICE QUESTIONS
Course Coordinator: Sandhya B R (Sec A)

108SD 114SD - Repair Manual
Document1,406 pages
108SD 114SD - Repair Manual
lalo201042361
100% (13)
Deep Learning MCQ
Document34 pages
Deep Learning MCQ
neha Shukla
90% (71)
IRDs
Document4 pages
IRDs
Arun Kumar Srinivasa Raghavan
100% (2)
1 Dimensional Details of Friction Slab and Crash Barrier A
Document3 pages
1 Dimensional Details of Friction Slab and Crash Barrier A
Alok verma
No ratings yet
Diploma Programmes Main Examination: Dipl/Qts0109/May2018/Maineqp
Document17 pages
Diploma Programmes Main Examination: Dipl/Qts0109/May2018/Maineqp
May Jing
No ratings yet
Black Belt
Document9 pages
Black Belt
shashank
No ratings yet
MB6
Document3 pages
MB6
vikky smart
50% (2)
C72 Steel - 2
Document4 pages
C72 Steel - 2
Bruno Cavalcante
0% (1)
Manual de Reparacion Dx140lc
Document918 pages
Manual de Reparacion Dx140lc
Arbey Gonzalez
100% (1)
Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
Document8 pages
Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
varad
100% (1)
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
Document6 pages
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
taaloos
No ratings yet
BUSI2045 Midterm Questions 2024 Spring
Document10 pages
BUSI2045 Midterm Questions 2024 Spring
rinniechan630
No ratings yet
QB Students DM
Document12 pages
QB Students DM
Vinay Gopal
No ratings yet
End-Semseter Test 2017
Document11 pages
End-Semseter Test 2017
alia patil
No ratings yet
DA Question Bank UNIT IV
Document5 pages
DA Question Bank UNIT IV
sourabh utpat
No ratings yet
Uecs3213 / Uecs3483 Data Mining SESSION: January 2020 Tutorial 5 Chapter 3-4 - Classification
Document3 pages
Uecs3213 / Uecs3483 Data Mining SESSION: January 2020 Tutorial 5 Chapter 3-4 - Classification
Yuven Raj
No ratings yet
Review Questions DS
Document14 pages
Review Questions DS
Saleh Alizade
No ratings yet
ML 1
Document51 pages
ML 1
Dip
No ratings yet
ML Unit 1 MCQ
Document9 pages
ML Unit 1 MCQ
pranav chaudhari
100% (1)
2020-09-22SupplementaryCS467CS467-E - Ktu Qbank
Document3 pages
2020-09-22SupplementaryCS467CS467-E - Ktu Qbank
sabitha s
No ratings yet
Xi - Economics - Model Paper
Document6 pages
Xi - Economics - Model Paper
mitrasupratik42
No ratings yet
DMT MCQ
Document15 pages
DMT MCQ
lalith
No ratings yet
Tutorial 1 - Answer Key
Document16 pages
Tutorial 1 - Answer Key
Umar Khan
No ratings yet
Data Science
Document35 pages
Data Science
eswarannihil
No ratings yet
Estatistica Multipla Escolha
Document20 pages
Estatistica Multipla Escolha
drinakano
No ratings yet
CS 8031 Data Mining and Data Warehousing Tutorial
Document9 pages
CS 8031 Data Mining and Data Warehousing Tutorial
AakashKumar
No ratings yet
Modelling
Document1,161 pages
Modelling
vinodofc4556
No ratings yet
MLPUE2 Solution
Document9 pages
MLPUE2 Solution
NAVEEN SAINI
No ratings yet
ADA Assignment - Final - 2024
Document5 pages
ADA Assignment - Final - 2024
bida22-016
No ratings yet
DWDM Viva Question
Document31 pages
DWDM Viva Question
Upendra Reddy
50% (2)
AI - QP - XII - T1 - 2023-24 - Revision Exam
Document10 pages
AI - QP - XII - T1 - 2023-24 - Revision Exam
jhakas123456
No ratings yet
BE Information Technology 0
Document655 pages
BE Information Technology 0
virendra girase
No ratings yet
MLMock Testsolution
Document15 pages
MLMock Testsolution
NAVEEN SAINI
No ratings yet
De1 ML184 Nguyen-Ly-Thong-Ke 260921
Document4 pages
De1 ML184 Nguyen-Ly-Thong-Ke 260921
ngô long phạm
No ratings yet
Python for Data Science July 2024W4
Document4 pages
Python for Data Science July 2024W4
V VENKATASUBRAMANIAM (RA2011027040006)
No ratings yet
Untitled
Document5 pages
Untitled
21P410 - VARUN M
No ratings yet
Practice Exam
Document38 pages
Practice Exam
m3gp13 yo
No ratings yet
ICO 2023 Important Questions Class 11 - International Commerce Olympiad
Document13 pages
ICO 2023 Important Questions Class 11 - International Commerce Olympiad
Nitya Aggarwal
No ratings yet
Exam Advanced Data Mining Date: 5-11-2009 Time: 14.00-17.00: General Remarks
Document5 pages
Exam Advanced Data Mining Date: 5-11-2009 Time: 14.00-17.00: General Remarks
kishh28
No ratings yet
CIS 200 Final Exam
Document5 pages
CIS 200 Final Exam
Ashim Dhakal
No ratings yet
Data Mining and Warehousing A 10186 Apr 2021
Document3 pages
Data Mining and Warehousing A 10186 Apr 2021
Abitha Vennila
No ratings yet
MRRP Assignment
Document8 pages
MRRP Assignment
King Kaigh
100% (1)
Class-Xi ECONOMICS (030) ANNUAL EXAM (2020-21) : General Instructions
Document8 pages
Class-Xi ECONOMICS (030) ANNUAL EXAM (2020-21) : General Instructions
adarsh krishna
No ratings yet
Class 11 Economics Answer Key
Document9 pages
Class 11 Economics Answer Key
u0911588
No ratings yet
Nptel Week 5
Document4 pages
Nptel Week 5
deepankarweb2
No ratings yet
Heq Sep15 Cert Is
Document4 pages
Heq Sep15 Cert Is
Mega Services Coparation
No ratings yet
A H192009 Pages: 3: Answer All Questions, Each Carries 4 Marks
Document3 pages
A H192009 Pages: 3: Answer All Questions, Each Carries 4 Marks
Srinivas R Pai
No ratings yet
Business Analytics - Chapter 1 Testbank - Group
Document14 pages
Business Analytics - Chapter 1 Testbank - Group
Khaleel Abdo
No ratings yet
Premium-Questions-by-ISTQB - Guru - 01 - Unlocked-40 Soru
Document49 pages
Premium-Questions-by-ISTQB - Guru - 01 - Unlocked-40 Soru
Erdoğan Hizan
No ratings yet
BCS Certificate IS Past Paper 2018 Mar
Document5 pages
BCS Certificate IS Past Paper 2018 Mar
Afzal Faizal
No ratings yet
MLT Assign PDF
Document137 pages
MLT Assign PDF
Praveen Choudhary
No ratings yet
BSCIT 1120-Data Structures and Algorithms
Document4 pages
BSCIT 1120-Data Structures and Algorithms
benardkanuri17
No ratings yet
Is Ys 1142014 Exam Solution
Document22 pages
Is Ys 1142014 Exam Solution
albi
0% (1)
Question Bank - Machine Learning (Repaired)
Document78 pages
Question Bank - Machine Learning (Repaired)
Sarah Knight
No ratings yet
INT354 Question Bank
Document11 pages
INT354 Question Bank
bharad waj
No ratings yet
Data Structure Previous Year Paper - B.C.A Study
Document4 pages
Data Structure Previous Year Paper - B.C.A Study
jalimkhanjar3
No ratings yet
Cs230exam Win21 Soln
Document31 pages
Cs230exam Win21 Soln
Sana N
No ratings yet
MCQ of Machine Learning
Document151 pages
MCQ of Machine Learning
Kunal Bangar
100% (2)
d3 PDF
Document7 pages
d3 PDF
Smit Patel
No ratings yet
Mcq's (6 Topics)
Document42 pages
Mcq's (6 Topics)
utpal
No ratings yet
ML 2 (Mainly KNN)
Document12 pages
ML 2 (Mainly KNN)
utpal
100% (1)
Telegram Channel Telegram Group
Document36 pages
Telegram Channel Telegram Group
Yatharth Saxena
No ratings yet
Dip Ii-Unit
Document7 pages
Dip Ii-Unit
Aiswarya S
No ratings yet
Data Warehousing and Data Mining
Document4 pages
Data Warehousing and Data Mining
Ramesh Yadav
No ratings yet
2019 - Introduction To Data Analytics Using R
Document5 pages
2019 - Introduction To Data Analytics Using R
Yeickson Mendoza Martinez
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Estimation of Cu & ZN
Document7 pages
Estimation of Cu & ZN
jhfgh
67% (3)
New Microsoft Office Word Document
Document10 pages
New Microsoft Office Word Document
citunair
No ratings yet
AWS ClassBook Lesson03
Document7 pages
AWS ClassBook Lesson03
SAI
No ratings yet
EPP Electrical Product Division
Document11 pages
EPP Electrical Product Division
AVIJIT MITRA
No ratings yet
Proposal ETP
Document4 pages
Proposal ETP
Banerjee Suvranil
No ratings yet
Quadro Mobile Line Card n18 11x8.5 r4 HR
Document1 page
Quadro Mobile Line Card n18 11x8.5 r4 HR
Eka S. Paonganan
No ratings yet
2013 Robust Pixel-Based Classification of Obstacles For Robotic Harvesting of Sweet-Pepper PDF
Document15 pages
2013 Robust Pixel-Based Classification of Obstacles For Robotic Harvesting of Sweet-Pepper PDF
Sa
No ratings yet
LCMS 2020 Training Manual
Document19 pages
LCMS 2020 Training Manual
Nguyen Hoai Duc
No ratings yet
DTMF Based Home Automation Without Microcontroller
Document4 pages
DTMF Based Home Automation Without Microcontroller
tuheb
No ratings yet
Weka Overview Slides
Document31 pages
Weka Overview Slides
Cristhian Paul Ramirez Guaman
No ratings yet
Frick Compressor
Document37 pages
Frick Compressor
Anuj Gupta
100% (1)
Numerical Unit1
Document3 pages
Numerical Unit1
Ayush Dubey
No ratings yet
Arithmetic Mean Method
Document18 pages
Arithmetic Mean Method
Satya Prakash Pandey Shipu
No ratings yet
Ligatures Nov19
Document13 pages
Ligatures Nov19
mntoader
No ratings yet
Past Continous Tense - Kel 6
Document9 pages
Past Continous Tense - Kel 6
R. Septian
No ratings yet
TWMC Price Book
Document221 pages
TWMC Price Book
leels
No ratings yet
D-Band Frequency Tripler For Passive Imaging - Final 13th July
Document4 pages
D-Band Frequency Tripler For Passive Imaging - Final 13th July
Tapas Sarkar
100% (1)
Analytical Instrumentation Questions and Answers - Introduction To Chromatography
Document4 pages
Analytical Instrumentation Questions and Answers - Introduction To Chromatography
Mary Francia Rico
No ratings yet
Communion With The Infinite - The Visual Music of The Shipibo Tribe of The Amazon - Ayahuasca
Document6 pages
Communion With The Infinite - The Visual Music of The Shipibo Tribe of The Amazon - Ayahuasca
Jef Baker
No ratings yet
DOMO Series: Part 1: General Requirements
Document1 page
DOMO Series: Part 1: General Requirements
ZulioTZ
No ratings yet
4.2 Costs, Scale of Production and Break-Even Analysis - Learner
Document23 pages
4.2 Costs, Scale of Production and Break-Even Analysis - Learner
Dhivya Lakshmirajan
100% (1)
False)
Document2 pages
False)
srikantaLee
No ratings yet
Password Based Circuit Breaker Control To Ensure Electric Line Mans Safety and Load Sharing IJERTCONV5IS13135
Document4 pages
Password Based Circuit Breaker Control To Ensure Electric Line Mans Safety and Load Sharing IJERTCONV5IS13135
Pratiksha Sankapal
No ratings yet
Copying Data From Microsoft Excel To ABAP Using OLE
Document10 pages
Copying Data From Microsoft Excel To ABAP Using OLE
Ricky Das
No ratings yet