Professional Documents
Culture Documents
Model - 2 QP Format - V Year BDA
Model - 2 QP Format - V Year BDA
Model - 2 QP Format - V Year BDA
Reg.No: 5 1 2 2
Answer ALL questions
PART – A (10 X 2 = 20 Marks)
1. What are the challenges of conventional system
2. Discuss the types of data analytics
3. Can you Pick K in a K-Means Algorithm?
4. Define Bayes Theorem
5. Define apriori algorithm
6. What is Prune
7. How are moments estimated?
8. Analyze the term filtering a data stream.
9. Summarize the features of Hive
10. What is NoSQL database
PART – B (5 X 13 = 65 Marks)
11(a) i. What are the best practices in Big Data analytics? (13)
ii. Explain the techniques used in Big Data Analytics.
(Or)
11(b) I. Generalize the list of tools related to Hadoop. (13)
ii. How does Hadoop work?
12(a) i. Given a one dimensional dataset {1, 5, 8, 10, 2} use the (13)
agglomerative clustering algorithms with the complete link with
Euclidean distance to establish a hierarchical grouping relationship.
By using the maximal lifetime as the cutting
threshold, how many clusters are there? What is their
membership in each cluster?
(Or)
12(b) I. Describe about Market-Basket model. (13)
13(a) (13)
I. Explain the apriori algorithm for mining frequent item sets with an
example.
14(a) (13)
i. List some common online tools used to perform sentiment
analysis.(6)
ii. What do you understand by sentiment analysis?(7)
(Or)
14(b) I. Describe about Stream clustering and parallel clustering. (13)
(Or)
15(b) (13)
i. What is the purpose of sharding?
ii. Explain the process of sharding in MongoDB.