Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 2

King Saud University College of Computer and Information Sciences Department of Information Technology IT434 Data Mining and

Data Warehouse 1433/1434 H Lab 2 First Semester

Please solve and submit your answers for the following questions:
1) Suppose that data about number of people and their ages are available. The values for are grouped into

intervals. The intervals and corresponding frequencies are as follows. --------------------------------Age frequency --------------------0-10 500 10-20 300 20-30 200 30-40 1100 40-50 800 50-60 100 -------------------------------a) Compute an approximate median value for the data. b) Draw the histogram for the data. 2) Suppose that the data for analysis includes the attribute grade. The grade values for the data tuples are: 4, 5, 9, 11, 12, 13, 13, 13, 13, 14, 15, 15, 16, 17, 18, 18, 19, 20 (a) What is the mean of the data? What is the median? (b) What is the mode of the data? Comment on the data's modality (i.e., bimodal, trimodal, etc.). (c) What is the midrange of the data? (d) Can you find (roughly) the first quartile (Q1) and the third quartile (Q3) of the data? (e) Give the five-number summary of the data. (f) Show a boxplot of the data. (g) Draw the scatter plot for the data. 3) Suppose that the data for analysis includes the attribute age. The age values for the data tuples are 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45,46, 52, 70. (a) What is the mean of the data? What is the median? (b) What is the mode of the data? Comment on the data's modality (i.e., bimodal, trimodal, etc.). (c) What is the midrange of the data? (d) Can you find (roughly) the first quartile (Q1) and the third quartile (Q3) of the data? (e) Give the five-number summary of the data. (f) Show a boxplot of the data. (g) Draw the scatter plot for the data

4) Suppose a manager tested the age and brain score data for 18 randomly selected adults with the following result Age 23 Score 9.5 Age 52 Score 34.6 23 26.5 54 42.5 27 7.8 54 28.8 27 17.8 56 33.4 39 31.4 57 30.2 41 25.9 58 34.1 47 27.4 58 32.9 49 27.2 60 41.2 50 31.2 61 35.7

(a) Calculate the mean, median and standard deviation of age and score. (b) Draw the box-plots for age and score. (c) Draw a scatter plot and a q-q plot based on these two variables.

You might also like