Professional Documents
Culture Documents
Classification and Clustering (HAREN SHARMA (03529802018) )
Classification and Clustering (HAREN SHARMA (03529802018) )
Classification and Clustering (HAREN SHARMA (03529802018) )
Q1) Which of the following refers to the problem of finding abstracted patterns (or structures) in the unlabeled data?
A Supervised learning
B Unsupervised learning
C Hybrid learning
D Reinforcement learning
Ans B
Q2) Which one of the following refers to querying the unstructured textual data?
A Information access
B Information update
C Information retrieval
D Information manipulation
Ans C
Q3) Some telecommunication company wants to segment their customers into distinct groups in order to send appropriate
subscription offers, this is an example of
A Supervised learning
B Data extraction
C Serration
D Unsupervised learning
Ans D
Ans B
A Unsupervised learning
B. Supervised learning
C. Reinforcement learning
D. Missing data imputation
Ans A
Q6) In the example of predicting number of babies based on storks’ population size,number of babies is
A. outcome
B. feature
C. attribute
D. observation
Ans A
Q7) For what purpose, the analysis tools pre-compute the summaries of the huge amount of data?
Ans D
Ans A
A The choice of an appropriate metric will influence the shape of the clusters
B Hierarchical clustering is also called HCA
C In general, the merges and splits are determined in a greedy manner
D All of the mentioned
Ans D
Ans B
Ans D
Q12) Point out the wrong statement.
Ans C
Ans D
Q14) Hierarchical clustering should be primarily used for exploration.
a) True
b) False
Ans A
Ans A
a) Partitional
b) Hierarchical
c) Naive Bayes
d) None of the mentioned
Ans C
Q17) K-means is not deterministic and it also consists of number of iterations.
a) True
b) False
Ans A
a)Classification
b)Clustering
c)Reinforcement Learning
d)Regression
Ans A and B
A. True
B. False
Ans B
Q20) Which of the following is the most appropriate strategy for data cleaning before performing clustering analysis,
given less than desirable number of data points: 1)Capping and flouring of variables 2)Removal of outliers
A. 1 only
B. 2 only
C. 1 and 2
D. None of the above
Ans C