DA Question Bank UNIT IV

ZEAL EDUCATION SOCIETY’S
ZEAL COLLEGE OF ENGINEERING AND RESEARCH

NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING
Name of the Teacher: Ms. P. S. Patil
Class: BE Subject: DATA ANALYTICS

AY: 2020-21 SEM: I
UNIT-IV
1) What is decision tree?
a) Flow-Chart
b) Structure in which internal node represents test on an attribute, each branch
represents outcome of test and each leaf node represents class label
c) Flow-Chart & Structure in which internal node represents test on an attribute,
each branch represents outcome of test and each leaf node represents class label
d) None of the mentioned
Ans: c
Explanation:
2) What is classification
a) when the output variable is a category such as red or blue, disease or
no disease
b) when output variable is real value such as dollar weight
c) when dataset is unlabelled
d) when no input is there
Ans: a
Explanation:
3) Chance Nodes are represented by __________
a) Disks
b) Squares
c) Circles
d) Triangles
Ans: c
Explanation:
4) What is supervised learning ?
a) All data is unlabelled and the algorithms learned to inherent
structure from the input data
b) All data is labelled and the algorithms learn to predict the output
from the input data
c) It is the framework for learning where an agent interacts with an
environment and receives reward for each interaction
d) Some data is labelled but most of it is unlabelled and a mixture of
supervised unsupervised techniques can be used .
Ans: b
Explanation:
5) What is unsupervised learning ?

from the input data
supervised unsupervised techniques can be used .
Ans: a
Explanation:
6) What is semi supervised learning ?
from the input data
supervised unsupervised techniques can be used
Ans: d
Explanation:
7) Which of the following are the pros of Decision Trees?
a). Possible Scenarios can be added
b). Use a white-box model, If a particular result is provided by a model
c). best, Worst and expected values can be determined for different scenarios
d). All of these
Ans: d
Explanation:
8) How to represent Decision Nodes?
a) . Disks
b) Squares
c) Circles
d) Triangles
Ans: b
Explanation:
9) Choose from the following that are Decision Tree nodes?
a)Decision Nodes
b) End Nodes
c) Chance Nodes
d) All of the mentioned
Ans: d
Explanation:
10) Which of the following are the advantage/s of Decision Trees?
Possible Scenarios can be added
b) Use a white box model, If given result is provided by a model
c) Worst, best and expected values can be determined for different scenarios
d) All of the mentioned
Ans: d
Explanation:
11) Previous probabilities in Bayes Theorem that are changed with help of new
available information are classified as _________________
a) independent probabilities
b) posterior probabilities
c) interior probabilities
d) dependent probabilities
Ans: b
Explanation:
12) Tools to evaluate classifier performance
a.Confusion matrix
b.Accuracy
c.Precision
d.Recall
Ans: a
Explanation:
13) Select Ensemble methods that use multiple models
a) Bagging: bootstrap method that uses repeated sampling with
b) Boosting: similar to bagging but iterative procedure
c) Random forest
d) .All of above
Ans: d
Explanation:
14) what is Smoothing.
a)small nonzero probability to rare events that are missing in the training data
b)small nonzero probability to rare events that are missing in the testing data
c) small nonzero probability to rare events that are missing in the production data
d)small zero probability to rare events that are missing in the training data
Ans: a
Explanation:
16) Underfitting means…………
a) the model isn't able to capture the dynamics shown by the same
training set
b) the model has an excessive capacity and it's not more able to
generalize considering the original dynamics provided by the
training set.
c) Both a and b
d) the model isn't able to capture the dynamics shown by the same as
well as different data set
Ans: a
Explanation:
17) Overfitting means ……………
a) the model isn't able to capture the dynamics shown by the same
training set
b) the model has an excessive capacity and it's not more able to
generalize considering the original dynamics provided by the
training set.
c) Both a and b
d) the model isn't able to capture the dynamics shown by the same as
well as different data set
Ans: b
Explanation:
18) What is disadvantage of decision trees
a) Sensitive to large variations in the training data

b) Overfitting can occur because each split reduces training data for
subsequent splits
c) Poor if dataset contains many relevant variables
d) not providing accurate results
Ans: b
Explanation:
19) Suppose your model is underfitting ,which of the following is not a valid
way to try and reduce the underfitting
a) Increase the amount of training data set
b) Improve the optimization algorithm being used for error
minimization
c) Decreased the model complexity
d) Reduce the noise of training data
Ans: a
Explanation:
20) Suppose your model is overfitting ,which of the following is not a valid way
to try and reduce the overfitting
a) Increase the amount of training data set
b) Improve the optimization algorithm being used for error
minimization
c) Decreased the model complexity
d) Reduce the noise of training data
Ans: b
Explanation:

DA Question Bank UNIT IV

Uploaded by

Copyright:

Available Formats

You might also like

DA Question Bank UNIT IV

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

DA Question Bank UNIT IV

Uploaded by

Copyright:

Available Formats

ZEAL EDUCATION SOCIETY’S

ZEAL COLLEGE OF ENGINEERING AND RESEARCH

Name of the Teacher: Ms. P. S. Patil

Class: BE Subject: DATA ANALYTICS

5) What is unsupervised learning ?

9) Choose from the following that are Decision Tree nodes?

a) Sensitive to large variations in the training data

You might also like