Professional Documents
Culture Documents
QUIZ Data
QUIZ Data
QUIZ Data
Quiz # 2
1. What is difference between predictive and actual value and write the
formula for finding predictive and actual value!
Ans: In statistics, the actual value is the value that is obtained by observation or
by measuring the available data. It is also called the observed value. The predicted
value is the value of the variable predicted based on the regression analysis.
The difference between the actual value or observed value and the predicted
value is called the residual in regression analysis.
We use formula for finding predictive and actual value by using this formula
Sum = .
2. Can Logistic regression used more than two class?
Ans: No, logistic regression is a binary classifier, so it cannot be applied to more
than 2 classes. For multi-class classification, we can use decision tree algorithms,
Naive Bayes algorithms are well suited
3.Write down the loss function and cost function!
Ans: Loss function: is capture the difference between actual and predictive value
in single training record.
Cost function: Used to refer to an average of the loss functions over an entire
training dataset.
4. How do you make sure which Machine Learning Algorithm to use?
Ans: It completely depends on the dataset we have. If the data is discrete, we use
SVM. If the dataset is continuous, we use linear regression. So, there is no specific
way that lets us know which ML algorithm to use, it all depends on the
exploratory data analysis (EDA). EDA is like “interviewing” the dataset; As part of
our interview, we do the following: Classify our variables as continuous,
categorical, and so forth. Summarize our variables using descriptive statistics.
Visualize our variables using charts. Based on the above observations select one
best-fit algorithm for a particular dataset
5. How to Tackle Overfitting and Underfitting?
Ans: Overfitting means the model fitted to training data too well, in this case, we
need to resample the data and estimate the model accuracy using techniques like
k-fold cross-validation. Whereas for the Underfitting case we are not able to
understand or capture the patterns from the data, in this case, we need to change
the algorithms, or we need to feed more data points to the model
6. Is E-mail spam is applicable in Supervised Machine Learning?
Ans: yes, widely used supervised machine learning techniques namely C 4.5
Decision tree classifier, Multilayer Perceptron, Naïve Bayes Classifier are used for
learning the features of spam emails and the model is built by training with
known spam emails and legitimate emails.
7. How will you know which machine learning algorithm to choose for your
classification problem?
Ans : While there is no fixed rule to choose an algorithm for a classification
problem, you can follow these guidelines:
Quiz #3
1. How does Machine Learning Differ from Deep Learning?
Ans: Machine Learning means computers learning from data using algorithms to
perform a task without being explicitly programmed. Deep Learning uses a
complex structure of algorithms modelled on the human brain. This enables the
processing of unstructured data such as documents, images, and text
2. Write down the three steps in hypothesis in Machine Learning model!
• Assume a null hypothesis, usually in machine learning algorithms we
consider that there is no anomaly between the target and independent
variable.
• Collect a sample.
• Calculate test statistics.
3. Describe the precision and Recall!
Ans: Precision and recall are two evaluation metrics used to measure the
performance of a classifier in binary and multiclass classification problems.
Precision measures the accuracy of positive predictions, while recall measures the
completeness of positive predictions
4. Why deep learning need high- end machine?
ANS: Deep learning systems require powerful hardware because they have a large
amount of data being processed and involves several complex mathematical
calculations. Even with such advanced hardware, however, training a neural
network can take weeks. Therefore, we need to use high-end machine.
5. Write down the sentiment analysis as a application of supervised machine
learning!
Ans: sentiment analysis using machine learning can help any business analysis
public opinion, improve customer support, and automate tasks with fast
turnarounds. Not only saving you time, but also money.
6. What is Ensemble Learning?
Ans: Ensemble learning is a method that combines multiple machine learning
models to create more powerful models.
There are many reasons for a model to be different. Few reasons are:
⦁ Different Population
⦁ Different Hypothesis
⦁ Different modeling techniques
When working with the model’s training and testing data, we will experience an
error. This error might be bias, variance, and irreducible error. Now the model
should always have a balance between bias and variance, which we call a bias-
variance trade-off. This ensemble learning is a way to perform this trade-off.
There are many ensemble techniques available but when aggregating multiple
models there are two general methods:
⦁ Bagging, a native method: take the training set and generate new training
sets from it.
⦁ Boosting, a more elegant method: similar to bagging, boosting is used to
optimize the best weighting scheme for a training set.
7. How does you sure the bias is occurring in Machine Learning?
Ans: Labeling bias occurs when your data has incorrect or inconsistent labels or
annotations, leading to misleading or confusing information. Lastly, algorithmic
bias can occur when your data is affected by the assumptions, preferences, or
limitations of the ML algorithm or technique, resulting in unfair or biased
outcomes.
8. How to Handle Outlier values?
• . Set up a filter in your testing tool. Even though this has a little cost,
filtering out outliers is worth it. ...
• Remove or change outliers during post-test analysis. ...
• Change the value of outliers. ...
• Consider the underlying distribution. ...
• Consider the value of mild outliers
9. What are reasons in machine learning model for making a model
unsupervised?
Ans: The main goal of unsupervised learning is to discover hidden and interesting
patterns in unlabelled data. Unlike supervised learning, unsupervised learning
methods cannot be directly applied to a regression or a classification problem as
one has no idea what the values for the output might be.
10. What is meant by local Minima?
Ans: local minimum (plural local minimums or local minima) (mathematics) A
point on a graph (or its associated function) whose value is less than all other
points near it.
Let us take the points in the immediate neighbourhood of x = 1. The points are {0,
2}. The derivative of the function is negative towards the left of x = 1, and is
positive towards the right. Hence x = 1 is the local minima.
Quiz # 4
1. What is Activation Function?
Ans: An Activation Function decides whether a neuron should be activated or
not. This means that it will decide whether the neuron’s input to the network is
operations.
The role of the Activation Function is to derive output from a set of input values
Ans: For a better predictive model, the categorical variable can be considered as
a continuous variable only when the variable is ordinal in nature.