Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

1.

Last year, 10 randomly selected students took a placement test before they began their technical
interview, details are given in the below table. The placement cell has two questions.

test (xi) 95 85 80 70 82 75 69 81 84 60
interview (yi) 85 95 70 65 89 79 79 89 72 70

a) What linear regression equation best predicts interview performance, based on written test
scores?
b) If a student made an 89 on the test, what grade would we expect her/his to make the score in an
interview?

c) Find the mean square error .

2).Implement Decision tree classifier for breast cancer Wisconsin


dataset (load_breast_cancer) and evaluate the algorithm with precision, recall
sensitivity and specificity. Use 75% of samples as training data size.

https://www.kaggle.com/uciml/breast-cancer-wisconsin-data

3.. Breast cancer dataset (load_breast_cancer) summary is:

Classes 2

Samples per class 212(M),357(B)

Samples total 569

Dimensionality 30

Features real, positive

Ten real-valued features are computed for each cell nucleus:

a) radius (mean of distances from center to points on the perimeter)


b) texture (standard deviation of gray-scale values)
c) perimeter
d) area
e) smoothness (local variation in radius lengths)
f) compactness (perimeter^2 / area - 1.0)
g) concavity (severity of concave portions of the contour)
h) concave points (number of concave portions of the contour)
i) symmetry
j) fractal dimension ("coastline approximation" - 1)

Dataset link:

https://www.kaggle.com/uciml/breast-cancer-wisconsin-data

Implement k-NN classifier and evaluate the algorithm with simple accuracy, precision,
and recall. Use 72% of samples as training data size.

4. Write a program to demonstrate the working of the simple linear regression. Use the data set
given below link for building the best line fit and compute SSE to classify a new sample.

https://www.kaggle.com/jemishdonda/headbrain

5. Apply naïve bayseian classification for the following dataset. And evaluate confusion matrix,
accuracy, precision, recall, f1-score.

https://www.kaggle.com/usman27/buy-computer/version/1

You might also like