DA Question Bank UNIT IV

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

ZEAL EDUCATION SOCIETY’S

ZEAL COLLEGE OF ENGINEERING AND RESEARCH


NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING

Name of the Teacher: Ms. P. S. Patil

Class: BE Subject: DATA ANALYTICS


AY: 2020-21 SEM: I

UNIT-IV
1) What is decision tree?
a) Flow-Chart
b) Structure in which internal node represents test on an attribute, each branch
represents outcome of test and each leaf node represents class label
c) Flow-Chart & Structure in which internal node represents test on an attribute,
each branch represents outcome of test and each leaf node represents class label
d) None of the mentioned

Ans: c
Explanation:
2) What is classification
a) when the output variable is a category such as red or blue, disease or
no disease
b) when output variable is real value such as dollar weight
c) when dataset is unlabelled
d) when no input is there

Ans: a
Explanation:
3) Chance Nodes are represented by __________
a) Disks
b) Squares
c) Circles
d) Triangles
Ans: c
Explanation:
4) What is supervised learning ?
a) All data is unlabelled and the algorithms learned to inherent
structure from the input data
b) All data is labelled and the algorithms learn to predict the output
from the input data
c) It is the framework for learning where an agent interacts with an
environment and receives reward for each interaction
d) Some data is labelled but most of it is unlabelled and a mixture of
supervised unsupervised techniques can be used .

Ans: b
Explanation:
ZEAL EDUCATION SOCIETY’S
ZEAL COLLEGE OF ENGINEERING AND RESEARCH
NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING

5) What is unsupervised learning ?


a) All data is unlabelled and the algorithms learned to inherent
structure from the input data
b) All data is labelled and the algorithms learn to predict the output
from the input data
c) It is the framework for learning where an agent interacts with an
environment and receives reward for each interaction
d) Some data is labelled but most of it is unlabelled and a mixture of
supervised unsupervised techniques can be used .

Ans: a
Explanation:
6) What is semi supervised learning ?
a) All data is unlabelled and the algorithms learned to inherent
structure from the input data
b) All data is labelled and the algorithms learn to predict the output
from the input data
c) It is the framework for learning where an agent interacts with an
environment and receives reward for each interaction
d) Some data is labelled but most of it is unlabelled and a mixture of
supervised unsupervised techniques can be used

Ans: d
Explanation:
7) Which of the following are the pros of Decision Trees?
a). Possible Scenarios can be added
b). Use a white-box model, If a particular result is provided by a model
c). best, Worst and expected values can be determined for different scenarios
d). All of these

Ans: d
Explanation:
8) How to represent Decision Nodes?
a) . Disks
b) Squares
c) Circles
d) Triangles

Ans: b
Explanation:
ZEAL EDUCATION SOCIETY’S
ZEAL COLLEGE OF ENGINEERING AND RESEARCH
NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING

9) Choose from the following that are Decision Tree nodes?

a)Decision Nodes
b) End Nodes
c) Chance Nodes
d) All of the mentioned
Ans: d
Explanation:
10) Which of the following are the advantage/s of Decision Trees?
Possible Scenarios can be added
b) Use a white box model, If given result is provided by a model
c) Worst, best and expected values can be determined for different scenarios
d) All of the mentioned

Ans: d
Explanation:
11) Previous probabilities in Bayes Theorem that are changed with help of new
available information are classified as _________________

a) independent probabilities
b) posterior probabilities
c) interior probabilities
d) dependent probabilities

Ans: b
Explanation:
12) Tools to evaluate classifier performance
a.Confusion matrix

b.Accuracy

c.Precision

d.Recall

Ans: a

Explanation:
13) Select Ensemble methods that use multiple models
a) Bagging: bootstrap method that uses repeated sampling with
b) Boosting: similar to bagging but iterative procedure
c) Random forest
d) .All of above
ZEAL EDUCATION SOCIETY’S
ZEAL COLLEGE OF ENGINEERING AND RESEARCH
NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING

Ans: d
Explanation:
14) what is Smoothing.
a)small nonzero probability to rare events that are missing in the training data

b)small nonzero probability to rare events that are missing in the testing data

c) small nonzero probability to rare events that are missing in the production data

d)small zero probability to rare events that are missing in the training data
Ans: a
Explanation:
16) Underfitting means…………
a) the model isn't able to capture the dynamics shown by the same
training set
b) the model has an excessive capacity and it's not more able to
generalize considering the original dynamics provided by the
training set.
c) Both a and b
d) the model isn't able to capture the dynamics shown by the same as
well as different data set
Ans: a
Explanation:
17) Overfitting means ……………
a) the model isn't able to capture the dynamics shown by the same
training set
b) the model has an excessive capacity and it's not more able to
generalize considering the original dynamics provided by the
training set.
c) Both a and b
d) the model isn't able to capture the dynamics shown by the same as
well as different data set
Ans: b
Explanation:
18) What is disadvantage of decision trees

a) Sensitive to large variations in the training data


b) Overfitting can occur because each split reduces training data for
subsequent splits
c) Poor if dataset contains many relevant variables
d) not providing accurate results

Ans: b
ZEAL EDUCATION SOCIETY’S
ZEAL COLLEGE OF ENGINEERING AND RESEARCH
NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING

Explanation:
19) Suppose your model is underfitting ,which of the following is not a valid
way to try and reduce the underfitting
a) Increase the amount of training data set
b) Improve the optimization algorithm being used for error
minimization
c) Decreased the model complexity
d) Reduce the noise of training data
Ans: a
Explanation:
20) Suppose your model is overfitting ,which of the following is not a valid way
to try and reduce the overfitting
a) Increase the amount of training data set
b) Improve the optimization algorithm being used for error
minimization
c) Decreased the model complexity
d) Reduce the noise of training data
Ans: b
Explanation:

You might also like