Machine Learning II Mid Term

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

Engineering College Jhalawar

Computer Science and Engineering


B. Tech Third Year, VI Semester
Subject:- 6CS4-02: Machine Learning
Maximum Marks: 50 Date: 15/04/2020 Time: 10:00 AM - 11:30 AM

Instructions:
* All the questions are compulsory
* Q.1, Q. 2 & Q. 4 carries 12 marks (each), Q. 3 carries 10 marks and Q. 5 carries 04
Marks
* Submit your Answer sheet in PDF format. PDF name should be “CollegeID YourName”
* If distance measure is not given in question then use ‘Euclidean Distance’

Q.1. Answer any THREE of the following [12 Marks]

I. The following training dataset is given to you:

Example Instances Classification feature 1 feature 2


1 + CSE CSE
2 + CSE CSE
3 - CSE IT
4 + IT IT
5 - IT CSE
6 - IT CSE

Answer the following:

i. Find out the entropy of this group of example instances with respect to the classi-
fication?

ii. Find out the information gain of feature 2 relative to these example instances?
II. In estimating the price of a used car, rather than estimating the absolute price it makes
more sense to estimate the percent depreciation (reduction in the value of an asset over
time) over the original price. Propose an predictive estimation model for estimating
price of a use car.
III. Why a linear regression is sensitive to outliers in case we fit the line considering
Mean Squared Error (MSE)?
IV. Among Naive Bayes and K-NN classifier which one is Lazy and why?

Q.2. You have been given a collection of 08 vectors or points: X1 =[0.1,0.6], X2 = [0.15,
0.71], X3 = [0.08, 0.9], X4 = [0.16, 0.85], X5 = [0.2, 0.3], X6 = [0.25, 0.5], X7 =
[0.24, 0.1], X8 = [0.3, 0.2]. Assume that we are performing k - means clustering of these
points with initial centroid’s or means C1 = X1 and C2 = X8 [12 Marks]
Answer the following questions
I. Which Cluster does X6 belongs to? (Correspondingly the C1 belongs to Cluster 1 and
C2 belongs to Cluster 2)
II. What is the population of the cluster C1 ?
III. What is the updated value of C1
IV. What is the updated value of C2

Q.3. Consider, yo have given 04 places in a town. The following matrix define the distance be-
tween in places (Use agglomerative clustering) [10 Marks]

Lalbagh Village Chandloi ECJ Purvaj Restaurant


Lalbagh 0 10 12 20
Village Chandloi 10 0 5 3
ECJ 12 5 0 4
Purvaj Restaurant 20 3 4 0

Construct the dendrogram using the following:

1. Single linkage clustering


2. Complete linkage clustering

Q.4. The transaction data-set is given to you as follows. You have to generate the association rules
using an apriori algorithm. Consider the values of Support = 50% and Confidence =
75% [12 Marks]

Transaction ID Items (Books) Purchased


1 ML, AI, CC, ISS
2 ML, AI, ISS
3 ML, DIP, CAO
4 ML, ISS, DIP
5 AI, ISS, DIP

Q.5. Answer any TWO of the following [04 Marks]

I. Draw a SVM plots which shows the Margin, Support Vectors, Hyperplane, Mis-
classified data points for below two cases:
i. Linealy Seperabale Data
ii. Linearly Inseparable Data
II. Which of the following offsets, do we use in case of least square linear regression fit? Sup-
pose the horizontal axis is the independent variable and the vertical axis is the dependent
variable.
III. The values on the graph show the residuals for each prediction value. Compute the
Sum of Squared Error (SSE) with the given information.

********************************** ALL THE BEST**************************************

You might also like