Professional Documents
Culture Documents
Bahria University, Islamabad Campus: Department of Computer Science
Bahria University, Islamabad Campus: Department of Computer Science
Bahria University, Islamabad Campus: Department of Computer Science
INSTRUCTIONS:
QUESTIONS:
2. Find certain names which are more prevalent in certain US locati ons? Why
is it that we can’t write an SQL query for this business questi on? How it helps for
the case of Data Mining? (Answer in your own words) [5
Points]
3. Give an overview of the Knowledge discovery process (in your own words)?
What is the signifi cance of diff erent stages? [5 Points]
Page 1 of 3
Enrollment Number: ____________________________
4. What is market basket data? What kind of analysis is done with the market
basket data? [5 Points]
5. A new coach has been working with the Long Jump team this month, and the
athletes' performance has changed. Augustus can now jump 0.15m further, June
and Carol can jump 0.06m further.
Augustus: +0.15m
Tom: +0.11m
June: +0.06m
Carol: +0.06m
Tom: +0.14m
Bob: +0.12m
Sam: +1.56m
How would you work with the above situati on, i.e. as a Data Scienti st do you
noti ce anything unusual, if yes, what would you do about it? [5
Points]
Hint: Computi ng the fi ve-number summary could be the fi rst step. You should
consider fi nding an outlier, if any?
6. Consider the following grouped data. You are required to compute the
Median? [5 Points]
Page 2 of 3
Enrollment Number: ____________________________
7. Consider the following dataset: 33, 25, 26, 36, 19, 30, 40, 51, 42, 32, 35, 35, 35, 45, 20, 23,
13, 15, 25, 25, 25, 26, 40, 25, 26, 22, 10. Answer the following:
a. What could be the pre-processing step for this data and why? [5 Points]
Good Luck 😊
Page 3 of 3