Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

1. What is the default replication factor for data blocks in HDFS?

A) one
B) two
C) three
D) four

2. Which of the following is NOT a type of RAID supported by cloud


storage providers?
A) RAID 1
B) RAID 6
C) RAID 5
D) RAID 0

3. Which component of HDFS stores file metadata and regulates data


access?
A) NameNode
B) Secondary NameNode
C) DataNodes
D) Client
4. Which module of Apache Hadoop provides high-throughput access to
application data?
A) Hadoop YARN
B) HDFS
C) Hadoop Common
D) Hadoop MapReduce

5. Which of the following is a feature of Apache Pig scripts?


A) They are written in Java
B) They define a Directed Acyclic Graph (DAG)
C) They support SQL syntax
D) They simplify MapReduce tasks

6. Which of the following is NOT a component of the YARN architecture


in Hadoop?
A) ResourceManager
B) NodeManager
C) ApplicationMaster
D) JobTracker
7. Which type of offloading shifts part of an application's computation to
the cloud dynamically based on runtime conditions?
A) Serverless computing
B) Dynamic offloading
C) Static offloading
D) Edge computing

8. Which of the following best describes a data lake?


A) It stores raw, unprocessed data in a centralized repository with
flexible storage options.
B) It stores only structured data
C) It is suited for operational reporting
D) It requires schema design before data loading

9. Which of the following is NOT a module in the Apache Hadoop


ecosystem?
A) MapReduce
B) YARN
C) HDFS
D) Apache Spark
10. Which of the following best describes the purpose of Apache Hive?
A) It provides high-throughput access to application data
B) It turns SQL queries into MapReduce programs
C) It processes structured data in large datasets and runs HiveQL
queries
D) It facilitates job scheduling and resource management

11. Which of the following best describes the 'compute over data'
paradigm?
A) It involves dynamic provisioning of resources
B) It facilitates distributed data storage and access
C) It emphasizes security and privacy of data
D) It is a computing paradigm where data processing occurs near the
location of the data

12. Which component of HDFS stores file system metadata?


A) DataNodes
B) JobTracker
C) NameNode
D) Client
13. Which type of NoSQL database is BigTable?
A) Graph
B) Document
C) Wide-row
D) Wide-column and key-value

14. Which of the following best describes ZooKeeper?


A) It processes streaming data
B) It is a SQL-based data warehouse
C) It is a distributed coordination service exposing primitives like
synchronization, configuration, and naming
D) It provides high-throughput storage

15. Which of the following best describes a characteristic of effective


data pipelines?
A) High streaming write performance
B) Scalable in memory processing
C) Supports SQL queries
D) Support for both batch processing and data stream processing
16. Which type of cloud storage allows for automatic growth and
shrinkage of storage capacity based on data additions and removals?
A) Amazon EBS
B) Amazon EFS
C) Network Attached Storage
D) Amazon S3

17. Which of the following is an advantage of cloud storage over


physical storage?
A) It facilitates job scheduling
B) It is cheaper than buying physical storage
C) It supports SQL queries
D) It provides high streaming write performance

18. Which component of YARN is responsible for negotiating resources


with the ResourceManager and tracking status/progress of a single
application?
A) ApplicationMaster
B) Client
C) NodeManager
D) ResourceManager
19. Which of the following best describes edge computing?
A) It involves offline data processing
B) It extends cloud capabilities closer to data sources by enabling
computing at the edge of networks
C) It processes data on centralized cloud servers
D) It emphasizes data security and privacy

20. Which of the following is an advantage of using Apache Spark over


MapReduce?
A) It provides high-throughput storage
B) It supports interactive queries and stream processing in addition to
batch jobs
C) It is more fault tolerant
D) It simplifies programming model

21. Which of the following best describes static offloading in mobile


cloud computing?
A) It emphasizes privacy and security
B) Decisions on offloading are made dynamically at runtime
C) Specific parts of a mobile app are always executed on the cloud
D) It extends cloud computing to network edges
22. Which of the following best describes the purpose of ZooKeeper?
A) It provides data storage services
B) It exposes primitives for synchronization, configuration maintenance,
and groups/naming
C) It schedules jobs on Hadoop clusters
D) It processes streaming data inputs

23. Which component of HDFS is responsible for storing and serving


data blocks?
A) Client
B) DataNodes
C) Secondary NameNode
D) NameNode
Answer Sheet
1. C
2. D
3. A
4. B
5. B
6. D
7. B
8. A
9. D
10. C
11. D
12. C
13. D
14. C
15. D
16. B
17. B
18. A
19. B
20. B
21. C
22. B
23. B

You might also like