CS3352 iat qb

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

CS3352 – FOUNDATIONS OF DATA SCIENCE

INTERNAL ASSESSMENT TEST – I

QUESTION BANK

PART – A

1. What is data science?


2. List the Benefits and uses of data science
3. Define Statistics and its types.
4. Difference between Data Mining and Data Warehousing
5. Define Percentile Rank
6. What is Z-score?
7. What is Degree of Freedom (df)?
8. Difference between Descriptive vs. Inferential Statistics
9. Define Inter Quartile Range(IQR)
10. What is outlier?
11. What are the types of Frequency Distribution?
12. Determine the range for the following sets of data.
13. What is Degree of Freedom (df)?
14. Indicate whether the following statements suggest a positive or negative relationship:
(a) More densely populated areas have higher crime rates.
(b) Schoolchildren who often watch TV perform more poorly on academic achievement
tests.
(c) Heavier automobiles yield poorer gas mileage.
(d) Better-educated people have higher incomes.
15. Define Correlation Coefficient

PART – B

1. Briefly explain the steps in Data Science process with diagram


2. Briefly explain the architecture of Data Warehousing.
3. How do you perform Data Preparation, Data Cleaning, Data Modelling, and Presentation in Data
4. Science Life Cycle?
5. Explain briefly the KDD Steps in Data Mining.
6. Explain briefly Facets of data.
7. Illustrate the Graphs for Quantitative data. Histogram, Freq.Polygon, Stem and Leaf, Scatterplot,
Boxplot
8. Find the mean, median, mode (Refer example in Textbook and Note)
9. Illustrate the different types of Frequency Distribution
10. Calculate the sum of squares (SS), population standard deviation and the sample standard
deviation (σ) for the scores (Refer example in Textbook and Note)
11. Calculate the correlation coefficient. (Refer example in Textbook and Note)
12. Using Table, find the proportion of the total area identified with the following statements a)
Above a z score of 1.80 b) between the mean and z score of -0.43 c) below a z score of -3.70 d)
between the mean and a z score of 1.65
13. Construct the frequency table and draw bar graph, stem and leaf displays for the following data:
(Refer example in Textbook and Note)
14. Discuss the approaches for combining different tables.
15. Describe the approaches for data exploration with suitable examples. (Refer example in
Textbook and Note)

You might also like