Big Data 2018

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

Sp 1

(6 pages) Reg. No. : ........................................

Code No. : 9186 Sub. Code : PCSE 32

M.Sc. (CBCS) DEGREE EXAMINATION,


NOVEMBER 2018.

Third Semester

Computer Science

Elective — BIG DATA ANALYTICS

(For those who joined in July 2017 onwards)

Time : Three hours Maximum : 75 marks

PART A — (10  1 = 10 marks)

Answer ALL questions.

1. The —————— architecture provides a general


purpose approach to implementing an arbitary
function on an arbitary dataset and having the
function return its results with low latency

(a) Lambda (b) DBMS

(c) Database (d) None


Sp 1

2. —————— is the ability to maintain in the face


of increasing data or load by adding resources to
the system
(a) Generalization (b) Scalability
(c) Extensibility (d) None

3. The requiste “support for parallel processing” is


for operators ——————
(a) Write (b) Read
(c) Both (a) and (b) (d) None

4. Say true of false:


In distributed file system they spread their
storage across a cluster of computers”
(a) True (b) False

5. —————— are tightly integrated to form a


framework for storing and processing large
amounts of data
(a) Hadoop HDFS (b) Map reduce
(c) Both (a) and (b) (d) None

6. —————— is a thin abstraction over files and


folders from the dfs —————— datastores
library
(a) Object (b) Pail
(c) Both (a) and (b) (d) None

Page 2 Code No. : 9186


Sp 1

7. The —————— view is mapping from a domain


to the number of visits and the number of bounces
for that domain
(a) Interface analysis
(b) Hybrid Analysis
(c) Bounce-rate analysis
(d) None
8. Elephant DB as an example of a ——————
database
(a) Batch layer (b) Speed layer
(c) Serving layer (d) None
9. —————— to support incremental algorithms, it
must also be possible to modify a real time view
with low latency
(a) Scalability (b) Fault tolerance
(c) Random reads (d) Random writes
10. —————— provides highly reliable distributed
coordination
(a) Apache zookeeper
(b) Nimbus
(c) Both (a) and (b)
(d) None

Page 3 Code No. : 9186


Sp 1

PART B — (5  5 = 25 marks)

Answer ALL questions choosing either (a) or (b).

Each answer should not exceed 250 words.

11. (a) What are the desired properties of a Big-data


system?

Or

(b) What are the limitations of serialization


framework?

12. (a) What is vertical partitioning?

Or

(b) How does distributed file system work?

13. (a) What is scalability in the batch layer?

Or

(b) What is fault tolerance?

14. (a) What are the serving layer solutions


available for normalization and
denormalization?

Or

(b) What is uniques over time?

Page 4 Code No. : 9186


[P.T.O.]
Sp 1

15. (a) What is query layer?

Or
(b) Differentiate Asynchronous versus
synchronous updates.

PART C — (5  8 = 40 marks)

Answer ALL questions choosing either (a) or (b).

Each answer should not exceed 600 words.

16. (a) What are the different layers available in


lambda Architecture?

Or
(b) Give a brief note on ‘Apache Thrift’.

17. (a) How do you choose storage solution for the


batch layer?

Or
(b) Explain low-level nature of distributed file
systems.

18. (a) Write short note on storing the master


dataset for superweb analysis .com.

Or
(b) Give a brief note on map reduce: a paradigm
for big data computing.

Page 5 Code No. : 9186


Sp 1

19. (a) How do you design a serving rayer for super


web analysis .com?

Or
(b) List out and explain the basics for elephant
DB.

20. (a) Write short note on Batch and serving


layers.

Or
(b) How do define topologies with apache storm?
———————

Page 6 Code No. : 9186

You might also like