Professional Documents
Culture Documents
Important Questions-Bigdata
Important Questions-Bigdata
Important Questions-Bigdata
File.txt
This is a hadoop class
hadoop is a bigdata technology
37. Consider the student data file (st.txt) Data in the following format Name, District; Age,
Gender : [Unit 5 -10M]
i. Write a PIG Script to display Names of all male students.
ii. Write a PIG Script to find the number of students from Ghaziabad district.
iii. Write a PIG Script to display district wise count of all female students.
38. Consider the given information analyse the twitter data and explain the steps involved to
find how many tweets are created by a user using Pig latin. [Unit 5- 10M]
User table
UId Name
1 Ram
2 Rahul
3 Rita
4 Neha
Tweet table
UId Tweet
1 Google…
2 Politics…
3 Olympiad
1 Politics
2 Tennis…
4 Google…
39. Define Hbase. Explain the Hbase architectural components with suitable diagram. [Unit
5-10M]
40. Define Hive. Design and explain the detailed architecture of HIVE. [Unit 5-10M]
41. Explore various execution modes of PIG. [Unit 5- 5M]
42. Write in detail about Hbase data model and pig data model. [Unit 5-5Marks]
43. Differentiate HBase Vs RDBMS [Unit 5 -5M]
44. Differentiate Hive Vs RDBMS [Unit 5 -5M]
45. Differentiate Pig Vs RDBMS [Unit 5 -5M]
46. Differentiate HBase Vs Hive Vs Pig Vs RDBMS [Unit 5 -5M]
47. Write down the Hive queries for natural join and outer join. Give examples. [Unit 5 -5M]
48. Provide overview of HBase data model. [Unit 5 -5M]
49. Why Hive is preferred instead of PigLatin? [Unit 5- 2M]
50. How Date and Time data types are used in Hive? [Unit 5- 2M]
51. Mention the usage of Grunt. [Unit 5- 2M]
52. Discuss the queries involved in Hive definition. [Unit 5- 10M]
53. Explore various execution models of PIG. [Unit 5- 10M]
54. Design and explain the detailed architecture of HIVE. [Unit 5- 10M]
55. Differentiate between Map-Reduce, PIG and HIVE [Unit 5- 10M]
56. Compare and Contrast No SQL Relational Databases. [Unit 4- 2M]
57. Does MongoDB support ACID properties? Justify your answer. [Unit 4-2M]
59. With the help of suitable example, explain how CRUD operations are performed in MongoDB.
[Unit 4- 10M]
60. Classify and detail the different types of NoSQL [Unit 4- 10M]
61. Summarize the role of indexing in MongoDB using an example. [Unit4 -10M]
62. Discuss the limitations of MapReduce. Explain the working of apache spark to
overcome the limitations of MapReduce. [Unit 4-10M]
63. Explain the type of Inheritance supported by SCALA programming. [Unit 4-10M]
64. Is Multiple inheritance implemented in SCALA justify your answer. If not the with
example explain how multiple inheritance can be implemented in SCALA. [Unit 4-10M]
65. Explain about HDFS Federation with neat diagram. [Unit 4-10M]
66. Explain in brief the working architecture of spark. Let’s assume 10 nodes in the spark
cluster. Each node having 16 cores, 64 GB RAM, and 5 cores per executer then find
[Unit 4-10M]
i. Total number cores in the cluster
ii. Number of executers
iii. Number of executers per node