Professional Documents
Culture Documents
DS Prelim QP
DS Prelim QP
Instructions:
1. All Questions are compulsory
2. Figures to right indicates CO, RBT Level and Marks
3. The below-given data is a hypothetical dataset of transactions, each letter representing an item
Transaction ID Items
T1 E,K,M,N,O,Y
T2 D,E,K,N,O,Y
T3 A,E,K,M
T4 C,K,M,U,Y
T5 C,E,I,K,O,O
5. What is TFIDF? Calculate TFIDF for each word of all the documents mentioned in the example
below CO2 , L4 [8]
CO6 :Design and implement Big Databases using the Hadoop ecosystem
6. Explain HDFS and Mapreduce wrt Hadoop architecture. Explain with wordcount example.CO6 , L3
[12]
Answer Key:
Page 2