Download as pdf
Download as pdf
You are on page 1of 18
Big Data Questions Latest Big Data MCQ Objective Questions & CE ABR ee lial Start Complete Exam Preparation Cee Rr roe leet oes es Loerie rs jownload App Ex.) Question 1: View this Question Online > In reference to Big data, consider the following database: (A) Memeached (8) Couch DB (Q Infinite graph Choose the most appropriate answer from the options given below: 4. (A) and (B) only 2. (8) and (Q) only, ~’ and (A)only 4, (A), (B) and (C) ‘Answer (Detailed Solution Below) Option 2: (8) and (C) only coaching India’s Super Teachers for all govt. exams Under One Roof BD ooo ae isin Big Data Question 1 Detailed Solution Memcached It is @ high-performance, user-friendly in-memory data Store, It provides a mature, scalable, open- source solution for sub-millisecond resp ds,,which makes it suitable as a cache or session store. Memeached is widely used to poy time;Web, Mobile Apps, Gaming, AdTech, and E- Commerce systems. \ Couch DB The prominent big data analytics tools thet use non-relational databases are MongoDB, Cassandra, Oracle No-SQL, and Apache CouchDB. CouchDB is a NoSQL database for document storage. It provides the ability to store documents with unique names, as well as an API is known as RESTful HTTP API for reading and modifying (edding, editing. and deleting) database documents. Documents are the fundamental unit of data in CouchDB, and they also contain metadata. Infinite graph InfiniteGraph is a distributed graph database developed in Java and C++. It belongs to a class of Not Only SQL" (NOSQL) database systems that emphasize graph data structures. InfiniteGraph is used by developers to discover useful and frequently hidden relationships in highly connected, complex big data sets Hi option 2) is the most appropriate answer eae Rac Teoual Cee eRe ee Start Complete Exam Preparation POTS cscs Dac laa cold Clee imoog Exeter) Vv D> Download App Question 2: View this Question Online > 4, Mapreduce Answer (Detailed Solution Below) Option 2: HDFS Big Data Question 2 Detailed Solution Explanation HDFS is a system that allows you to distribute big data storage across a + Italso keeps redundant copies of data. So, if one of your comp flames or if some technical issues arise, «DFS can recover by creating a backup from a co} wae automatically saved, and you won't even know what happened. 5 YARN: It comes next in the Hadoop ecosystem Where Hadoop's data processing is put ‘esource Negotiator). It is the location + The system that controls, ourceson your computing cluster is called YARN. + It is the one that choos 4s to perform the duties, as well as when, which nodes are open for more work, and whi Nodes are not. Mapreduce: It is another part of the Hadoop ecosystem called MapReduce. + It is essentially a programming model that lets you process data across an entire cluster. + It mainly comprises Mappers and Reducers, which are several scripts or functions that you might write when creating a MapReduce programme. Hadoop: It is a distributed, open-source, multidimensional, scalable NoSQL database. Based on Java, HBase runs on HDFS and gives Hadoop capabilities and functionalities akin to those of Google Bigtable, PDE see Berle) Start Complete Exam Preparation RCC oS Descent y researc gd Px) ake he aces Download App Question 3: View this Question Online > Which of the following statement(s) is/are corfect regarding On-Line Transaction Processing (OLTP)? |. Responses to the user inquiry are immediate. II. The associated cost is economical with efficient utilization of resources. Ill. The databaseiis always up-to-date. zo 2. Land Ill 3. land Il 4. Only Ill Answer (Detailed Solution Below) Option 2: land Ill Big Data Question 3 Detailed Solution Concept: OLTP stands for online transaction processing. Ibifvolves tfansaction-oriented applications. Itis based on query processing and total number of transactions processed in a second. Characteristics of OLTP. ~ + Transaction involves small amount of data. + Response time is very short or user inquiry are immediate It uses fully normalized schema and removes inconsistencies It strictly performs the predefined operations on small number of records. Indexed data ic acceceed eacily using OITD. + The database is always up-to-date. + It supports complex data models. Hence Option 2 is correct Ee seer RSet Start Complete Exam Preparation Ree RRs Gree frees ead oes chen ereseocs Bre Download App Question 4: View this Question Online > The data node and name node in HADOOP are 1. Worker Node and Master Node respectively 2. Master Node and Worker Node respectively »...... 4, Both Master Nodes 3. Be Answer (Detailed Solution Below) Option 1 : Worker Node and Master Node respectively Big Data Question 4 Detailed Solution The correct answer is option 4 © key Points The main difference betweempigmeNode ‘and DataNode in Hadoop is that the NameNode is the master node jn Hadoon 0; Be cmaeeue HI) hei shat eucerei ieee baila carne wictogiaes while the DataNode is a slavel™@tie in Hadoop distributed file system that stores the actual data as instructed by the NameNode § Additional Information + NameNode stores the metadata of all files in HDFS. Metadata includes file permission, names, and location of each block. Namenode maps these blocks to DataNodes. + The deta nodes store and retrieve blocks as instructed by the NameNode. Eee een) > ileal ITS Start Complete Exam Preparation Cli foe ee eas hold Cresta Download App Ram eae Question 5: View this Question Online > Point out the wrong statement : 1. Non-gelational databases require that schemas be defined before you can add data. 2. NoSQL databases are built to allow the insertion of data without a predefined schema. 3. NewSQL databases are built to allow the insertion of data without a predefined schema. 4. All of the options. Answer (Detailed Solution Below) Option 1: Non-Relational databases require that schemas)be defined before you can add data. Big Data Question 5 Detailed Solution The correct answer is /~ © Key Points In fact, one key characteristic of a non-relational (NoSQL) database is that they do not require a Tixed schema betore you start tilling them with data. [his texibility works well with unstructured data and is often used in big data applications 4 cs Tee ane ca) Start Complete Exam Preparation Rem ee ost ) ally tive or aac iets Eran era God ect Download App Question 6 View this Question Online > The data node and name node in HADOOP are 1. Worker Node and Master Node respectively 2. Master Node and Warker Node respectively 3. a Nodes 4. Both Master Nodes Answer (Detailed Solution Below) Option 1 : Worker Node and Master Node respectively Big Data Question 6 Detailed Solution The correct answer is option 1 © Key Points The main difference between NameNode and DataNode in Hadoop is that the NameNode is the master node in Hedoop DistliBred File System (HDFS) that manages the file system metadata while the DataNode Is a sla\ ie in Hadoop distributed file system that stores the actual data as SER ST ge, oa ee ne ee &- Additional Information + NameNode stores the metadata of all files in HDFS, Metadata includes file permission, names, and location of each block. Namenode maps these blocks to DataNodes. + The data nodes store and retrieve blocks as instructed by the NameNode. a CER ABR Reta) See Ld Start Complete Exam Preparation [ rea one) Teac Lose cl ars eet Download App Question 7 Wesinecas Which of the following is component of Hadoop? 1. YARN 2. HDES 3. Map) A 4. All of the options Answer (Detailed Solution Below) Option 4 : All of the options Big Data Question 7 Detailed Solution YARN, HDFS, and Map-reduce are the component of Hadoop © Key Points Hadoop ge Se manage Big Data + It is the most commonly used software to handle Big Data. i i caeeiemmteatmieatiaintaieins:“- siiemiaienal “imemmiaceaaieal :\almimmesiaeniich Coaediiennaciabenienabass There are three components of Hadoop. 1, Hadoop HDFS - Hadoop Distributed File System (HDFS) is the storage unit of Hadoop. 2. Hadoop MapReduce - Hadoop MapReduce is the processing unit of Hadoop. 3. Hadoop YARN - Hadoop YARN is a resource management unit of Hadoop eee see acta ORD eee PS ela meu) CM Clima leche cela) Peay e) Practice Mock Tests MasterCl Question Bank Exec 7) Download App Question 8 View this Question Online > Which of the following statement/s is/are true? (i) Facebook has the world's largest Hadoop cluster. (ji) Hadoop 2.0 allows live stream processing of real time data 1. Neither (i) for Gi) 2 AA. (i 3. () only 4. Gi) only Answer (Detailed Solution Below) Option 2: Both (i) and (ii) Big Data Question 8 Detailed Solution ( Facebook has the worlds largest Hadoop cluster. ‘This statement |s correct. Madoop clusters helps in organizing and analyse the data ina computational environment. It boosts the speed for data analysis. These clusters helps in increasing the throughput performance. They also resistant to any failure and data loss as data backup is maintained on the clusters to support redundaney.-As per the research study 2013, Facebook has become the world’s largest Hadoop cluster and storing more than hundreds of millions of gigabytes. Hadoop provides a common infrastructure for Facebook with efficiency and reliability. Hadoop is empowering this jorking platform in each possible way such as searching, log processing, recommendation to video and image analysis. (i) Hadoop 2.0 allows live stream processing of real time data Given statement is correct. Hadoop 2.0 has many advantages over previous version. These are: + Hadoop 2.0 allows live stream processing of real time data + Its ability to process Terabytes and petabytes of data available in HDFS using MPI. + It enables multi-tenancy support in Hadoop. + It provides horizontal scalability of nemenode. oa Ss rN India’s #1 Learning Platform Pela meri) M Cima lle h iced) Rea es Capo ocd Gees Us} Mastercl Coa iets Download App Peete Question 9 View this Question Online > In reference to Big data, consider the following database: (A) Memcached (B) Couch DB (Q Infinite graph Choose the most appropriate answer from the options given below: 1. (A) end @)only 2 AX. only 3. (Cand (A) only 4. (A), (8) and (C) Answer (Detailed Solution Below) Option 2: (8) and (C) only Big Data Question 9 Detailed Solution Memcached It is @ high-performance, user-friendly in-memory data store. it provides a mature, scalable, open- source solution for sub-millisecond response speeds, which makes it suitable'as a cache or session store. Memcached is widely used to power real-time Web, Mabile Apps, Gaming, AdTech, and E- Commerce systems. Couch DB The prominent big data analytics tools tht Use noftelational databases are MongoDB, Cassandra, Oracle No-SQL, and Apache C: CouchDB is a NoSQL database foR@etument storage. It provides the ability to store documents with unique names, as well as an AP] is@f@wn as RESTful HTTP API for reading and modifying (adding, editing, and deleting) database documents. Documents are the fundamental unit of data in CouchDB, and they also contain metadata. Infinite graph InfiniteGraph is a distributed graph database developed in Java and C++. It belongs to a class of “Not Only SQL" (NOSQL) database systems that emphasize graph data structures. InfiniteGraph is used by developers to discover useful and frequently hidden relationships in highly connected, complex big data sets. Hence, option 2) is the most appropriate answer. eee Sea Start Complete Exam Preparation Gees pee ies oes teeny Cresent Peete Download App ‘Question 10 g “View this Question Online > Which of the following statement(s) is/are correct regarding On-Line Transaction Processing (OLTP)? 1. Responses to the user inquiry ere immediate, Il. The associated cost is economical with efficient utilization of resources. ger: vias: gicesaatasia tas caus Mee ‘7! 2. Land Jil 3. land Il - Only tll Answer (Detailed Solution Below) Option 2: | and Ill Big Data Question 10 Detailed Solution Concept: OLTP stands for online transaction processing. Itifivolves ffansaction-oriented applications. Itis based on query processing and total number of transactions processed in a second. Characteristics of OLTP. ~» + Transaction involves small nt of data. + Response time is very short or user inquiry are immediate + It uses fully normalized schema and removes inconsistencies + It strictly performs the predefined operations on small number of records. + Indexed data is accessed easily using OLTP + The database is always up-to-date. + It supports complex data models. Hence Option 2 is correct Pn eee ene eto Pe Ta meV CMcuimee- yell) Cee ar te ice Raed eh ioc Question Bank Pris B Download App ‘Question 11 Point A... statement : 1, Non-Relational databases require that schemas be defined before you can add data. View this Question Online > 2. NoSQL databases are built to allow the insertion of data without 2 predefined schema. 3. NewSQL databases are built to allow the insertion of data without a predefined schema. 4. All of the options. Answer (Detailed Solution Bolow) Option 1: Non-Relational databases require that schemas be defined before you can add date. Big Data Question 11 Detailed Solution The correct answer is oa oO Key Points In fact, one key characteristic of a non-relational (NoSQL) database is that they do not require a fixed schema before you start filling them with data. This flexibility works well with unstructured data and is often used in big data applications ee India’s #1 Learning Platform Rear Start Complete Exam Preparation aS rea aa Gites Coe aa Ga TT entities antaacaae als os ol Question 12: View this Question Online > Hadoop (a big data tool) works with number of related tools. Choose from the following, the common tools included into Hadoop: 1 mm API and Map reduce 2. Map reduce, Scala and hummer 3. Map reduce, H base and Hive 4, Map reduce, hummer and Heron Answer (Detailed Solution Below) Option 3: Map reduce, H base and Hive Big Data Question 12 Detailed Solution + Hadoop is an open source platform providing highly reliable, scalable) distributed processing of large data sets using simple programming models. + Hadoop provides a reliable shared storage and analysis system. In'this, storage is provided by HDFS (Hadoop distributed file system) and analysis by MapReduce. + HDFS breaks a file into chunks and distributed them across the nodes of the cluster. It stores large amount of data on local disks across a distributed cluster of computers. It is written in java. . + Hadoop work with the tools: Map Reduce, H base, hive. Hbase: It is a distributed, column- oriented database. HBase uses HDFS for its underlying storage and supports both batch style computations using Map Reduce and point queries. Map Reduce: A distributed data processing model and execution environment that runs on large clusters of commodity machines. Hive: Itis a distributed data warehouse. Hive manages data stored in HDFS and provides a query language based on SQL. Lar rors ened Mock Tests resco Cresta Exeter) Download App Question 13: View this Question Online > The data node and name node in HADOOP are 1. Worker Node and Master Node respectively 2. Master Node and Worker Node respectively 3. A». Nodes 4. Both Master Nodes Answer (Detailed Solution Below) Option 1: Worker Node and Master Node respectively Big Data Question 13 Detailed Solution The correct answer is option 1 © Key Points master node in Hadoop Distliblked File System (HDFS) that manages the file system metadata while the DataNode is a slaveim@de in Hadoop distributed file system that stores the actual data as instructed by the NameNode, & Additional Information The main difference betwe Si and DataNode in Hadoop is that the NameNode is the + NameNode stores the metadata of all files in HDFS. Metadata includes file permission, names, and location of each block. Namenode maps these blocks to DataNodes. + The data nodes store and retrieve blocks as instructed by the NameNode. India’s #1 Learning Platform Start Complete Exam Preparation RCC R eosin Dos caper alee ob Pears Question Bank a eer Download App Question 14: View this Question Online > Which of the following is component of Hadoop? 1. YARN 3. Map reduce 4. All of the options Answer (Detailed Solution Below) Option 4: All of the options Big Data Question 14 Detailed Solution YARN, HDFS, and Map-reduce are the component of Hadoop © Key Points Hadoop + Hadoop is a framews + uses distributed storage and parallel processing to store and manage Big Data + It is the most commonly used software to handle Big Data. There are three components of Hadoop. 1, Hadoop HDFS - Hadoop Distributed File System (HDFS) is the storage unit of Hadoop. 2. Hadoop MapReduce - Hadoop MapReduce is the processing unit of Hadoop. 3 Hadoon YARN - Hadoon YARN is a resource manaaement unit of Hadoon Pg wary Te eae oitay cela mee) CM climes lec) Weaucs ara acd ceo iene Recaro mo Download App Question 15: View this Question Online > Which of the following statement/s is/are true? () Facebook has the world’s largest Hadoop cluster. (ji) Hadoop 2.0 allows live stream processing of real time data 1. Neither () nor (iy 2 A. (o) 3. @ only 4. Gi only Answer (Detailed Solution Below) Option 2 : Both (i) and (ii) Big Data Question 15 Detailed Solution (9 Facebook has the worlds largest Hadoop cluster. This statement is correct. Hadoop clusters helps in organizing and analyse the data ina computational environment. it boosts the speed for data analysis. These clusters helps in increasing the throughput performance. They also resistant to any failure end data loss as data backup is maintained on the clusters to support.redundaney."s per the research study 2013, Facebook has become the world's largest Hadoop cluster and storing more than hundreds of millions of gigabytes. Hadoop provides mon infrastructure for Facebook with efficiency and reliability. Hadoop is empowering this, ” jorking platform in each possible way such as searching, log proceceina recommerdati. to video and imace analvcic (i) Hadoop 2.0 allows live stream processing of real time data Given statement is correct. Hadoop 2.0 has many advantages over previous version. These are: + Hadoop 20 allows live stream processing of real time data + Its ability to process Terabytes and petabytes of data available in HDFS using MPI. + It enables multi-tenancy support in Hadoop. + It provides horizontal scalability of namenode.

You might also like