Download as txt, pdf, or txt
Download as txt, pdf, or txt
You are on page 1of 2

Question 1: Which of the following methods is used to find the best fit line for

data in Linear Regression?


Answer:Least Square Method

Question 2:What is the minimum number of variables or features required to perform


clustering?
Answer: 1

Question 3:What does R-square measure?


Answer: The explained sum of squares as a proportion of the total sum of squares

Question 4:Which of the following options is true?


Answer: Linear regression error values have to be normally distributed, but it is
not the case in logistic regression

Question 5:Imagine, you are solving a classification problem with a highly


imbalanced class. The majority of the class is observed 99% of the time in the
training data. Your model has 99% accuracy after taking the predictions on test
data. Which of the following is true? 1. Accuracy metric is not good for an
imbalanced class problem. 2. Accuracy metric is good for an imbalanced class
problem. 3. Precision and recall metrics are good for an imbalanced class problem.
4. Precision and recall metrics are not good for an imbalanced class problem.
Answer:1 and 3

Question 6:The technique of detecting hidden structure in unlabeled data is known


as:
Answer:Unsupervised learning

Question 7:What is the minimum number of variables/features required to perform


clustering?
Answer:1

Question 8:In time-series analysis, which source of variation can be estimated by


the ratio-to-trend method?
Answer:Seasonal

Question 9:Which of the following is true about averaging ensemble?


Answer:It can be used in both classification as well as regression problem

Question 10:To test the linear relationship of y (dependent) and x (independent)


continuous variables, which of the following plots is best suited?
Answer:Scatter Plot

Question 11:Advantages of SVM are:


Answer:All of the above options

Question 12:Which of the following is true about Naive Bayes?


Answer:Both A and B

Question 13:When is Ridge regression favorable over Lasso regression?


Answer:Both A and B

Question 14:Which of the following is true for white noise?


Answer:Zero autocovariances, except at lag zero

Question 15:In Ensemble learning, majority vote is used for:


Answer:Classification
Question 16:Which of the following is the correct technique to preprocess data
before performing regression or classification?
Answer:Normalize the data > PCA > training

Question 17:Which of the following machine learning algorithms is based on the


principle of bagging and is extensively used and effective?
Answer:Random Forest

Question 18:In machine learning, an estimator is a(n):


Answer:Instance of the model

Question 19:In machine learning, for which problem type can the KNN algorithm be
used?
Answer:Both classification and regression

Question 20:What is the drawback of decision trees?


Answer:Decision trees are prone to be overfit

Question 21:A _________ is a decision support tool that uses a tree-like graph or
model of decisions and their possible consequences, including chance event
outcomes, resource costs, and utility.
Answer:Decision tree

Question 22:In Logistic Regression, which of the following approaches do we apply


to best fit the data?
Answer:Maximum Likelihood

Question 23:The log of odds function (written as l(x)) is the logit function. What
is the logit function's range in the domain x=[0,1]?
Answer:(?,?,?)

Question 24:What are the features of a random forest algorithm? a) Runs efficiently
on larger databases b) Handles thousands of input variables without variable
deletion c) Provides an estimate of what variables are important in the
classification d) Uses the squared proportion of the classes.
Answer:a,b,c

Question 25:The process of developing a mathematical model or function that can be


used to forecast or predict one variable based on another variable is called:
Answer:Regression

You might also like