Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 10

Course Code: 20ITEL801

Course Name: Free and Open Source Software Tools


Year / Sem : IV/VIII
Specify any Data book / Sheet / Graph / Plots / has to be provided

UNIT - I

Unit - I / Part - A / 2 Marks


Mark
S.No Questions Split K – Level CO
up
1. List few Big Data Applications 2 K1 CO1
2. State the data types of Big Data. 2 K1 CO1
3. Define Big Data. 2 K1 CO1
4. Summarize web analytics. 2 K1 CO1
5. List the risks of Big Data 2 K1 CO1
6. What are the five V’s of Big Data? 2 K1 CO1
7. List the promotion values of big data. 2 K1 CO1
8. Write down the characteristics of Big Data Applications. 2 K1 CO1
2 K1 CO1
9. List the main features of a Big Data Analytics.
10. Is big data good for marketers? Why? 2 K1 CO1
2 K1 CO1
11. Why Big data analytics technologies are necessary?
What is the recommended best practice for managing big data 2 K1 CO1
12. analytics programs?
2 K1 CO1
13. What is Unstructured data?
2 K1 CO1
14. Why is unstructured text data important in decision making?
2 K1 CO1
15. What are some common types of unstructured data?

Unit - I / Part - B / 13 Marks


Marks
S.No Questions K – Level CO
Splitup
1. Define Big Data and the Vs of Big Data. 13 K1 CO1
i) State about the challenges of conventional system. 7 K1 CO1
2. ii) List big data use cases in detail. 6
(i) State in detail about the characteristics of Big Data Analytics. 7 K1 CO1
5. (ii) Outline about Big Data Use cases. 6
(i) Describe the Evolution of Big Data. 7 K1 CO1
6. (ii) How big data differs from relational database. 6
(i)Describe Big Data in Algorithmic Trading. 7 K1 CO1
7. (ii) Describe bi versus data science, 6
13 K1 CO1
8. Discuss best practices for big data analytics.
9. (i)Discuss drivers of big data. 7 K1 CO1
(ii)List the main features of a big data in detail. 6
13 K1 CO1
10. Define Big Data and explain applications of Big Data:
(i)How can Big Data add value in Marketing? 7 K1 CO1
11. (ii) List the risks of Big Data 6
(i) How can Big Data add value in business? 7 K1 CO1
12. (ii)What is Algorithmic Trading? 6
What Is Big Data and how can Big Data add value in 13 K1 CO1
13. Healthcare?
13 K1
14. Describe advertising and Big Data. CO1
13 K1
15. Discuss Big Data Technologies. CO1

Unit - I / Part - C / 15 Marks


Marks
K–
S.No Questions Split- CO
Level
up
(i)Discuss various challenges of Conventional Systems. 8 K1 CO1
1. (ii) List the characteristics of Big Data Applications. 7
Discuss emerging big data ecosystem and new approach to 15 K1 CO1
2. Analytics.
Discuss characteristics of Big Data Applications and various 15 K1 CO1
3. challenges.
D Describe Web Analytics and how does web analytics collect 15 K1 CO1
4. data?
Outline the steps to be followed to deploy a Big Data solution. 15 K1 CO1
5.

UNIT - II

Unit - II / Part - A / 2 Marks


Mark
K–
S.No Questions Split CO
Level
up
1. Define Hadoop 2 K2 CO2
2 K2 CO2
2. What is meant by Streaming in hadoop
2 K2 CO2
3. Summarize Scaling Out in Hadoop
2 K2 CO2
4. Define flume
2 K2 CO2
5. Define Scoop
2 K2 CO2
6. What is Compression in Hadoop
2 K2 CO2
7. What is an Avro. Write the uses of Avro
2 K2 CO2
8. How file structure maintain in Hadoop
2 K2 CO2
9. Draw the Structure of Command Line Interface
2 K2 CO2
10. What is data Digest in Hadoop
2 K2 CO2
11. What are the Components of Hadoop
2 K2 CO2
12. Write the Hadoop Archives Operations
2 K2 CO2
13. Give two Example Commands used in Hadoop
2 K2 CO2
14. Write the Hadoop Command for creating of file
2 K2 CO2
15. What are the different version of Hadoop

Unit - II / Part - B/ 13 Marks


Marks K –
S.No Questions CO
Splitup Level
1. Explain briefly about Hadoop Distributed File System 13 K2 CO2
How To Analyze the Data with Hadoop- 7 K2 CO2
i)Scaling Out 6
2. ii)Hadoop Streaming
13 K2 CO2
Discuss about Components of Hadoop and Hadoop Archives
3.
13 K2 CO2
4. Explain and Draw different design approaches in Hadoop
13 K2 CO2
5. Explain briefly about Data Ingest with Flume and Scoop
13 K2 CO2
6. Write the functions of Hadoop I/O- Compression
Discuss about Hadoop I/O- Compression- Serialization- 13 K2 CO2
7. Avro and File-Based Data structures.
13 K2 CO2
8. Explain briefly about Serialization
Outline the steps to be followed by the File Based Data 13 K2 CO2
9. Structure
13 K2 CO2
10. How you going to analyse the Data with Hadoop

Unit - II / Part - C / 15 Marks


Marks K –
S.No Questions CO
Splitup Level
15 K2 CO2
1. Write the Application of Hadoop file System
8 K2 CO2
i).Highlight the Features of Hadoop 7
2. ii)Functionalities of Hadoop Cluster
i)Write a note on data integrity 8 K2 CO2
ii)Explain Briefly about Hadoop input and output Opeartion 7
3.
4. Generalize the list of tool Related to Hadoop and How Does 15 K2 CO2
Hadoop work
15 K2 CO2
5. Define HDFS.Explain Hadoop in detail

UNIT - III

Unit - III / Part - A / 2 Marks


Mark K –
S.No Questions Split Leve CO
up l
1. Define Map – Reduce. 2 K1 CO3
2. List the advantages of Map – Reduce. 2 K2 CO3
3. What are the features of Hadoop? 2 K1 CO3
4. What are the features of Map Reduce? 2 K1 CO3
5. What are the components of Map-Reduce Architecture? 2 K1 CO3
6. List the Phases of Map-Reduce. 2 K2 CO3
7. What is the use of Org.apcahe.hadoop.io.package? 2 K2 CO3
8. List some of the practical applications of Hadoop Map-Reduce. 2 K2 CO3
2 K1 CO3
9. What are all the benefits of Hadoop MapReduce?
10. State one Map-Side tuning property and describe it. 2 K1 CO3
2 K1 CO3
11. Define Hadoop Streaming.
2 K1 CO3
12. State the General form of Map and Reduce functions.
2 K1 CO3
13. What is YARN?
2 K1 CO3
14. Outline Apache Hadoop Yarn Architecture.
2 K2 CO3
15. List 5 steps in submitting an application in YARN.
2 K1 CO3
16. What are the three main components of YARN?
2 K1 CO3
17. What are the responsibilities of NODE Manager in YARN?
2 K1 CO3
18. Define Streaming and Pipes.
2 K2 CO3
19. Compare Hadoop environment with and without YARN
What are the runtime failure modes to be considered in classic 2 K1 CO3
20. Mapreduce?

Unit - III / Part - B / 13 Marks


Marks K –
S.No Questions CO
Splitup Level
1. (a) List the main feature of Map Reduce. 5 K1 CO3
(b) Explain working of the following phases of Map Reduce 8
with one common example
(i) Map Phase
(ii) Shuffle and sort phase
(iii) Reducer Phase
13 K1 CO3
2. Explain about failures in Classic Map Reduce and YARN.
Discuss the following terms K1 CO3
a. Streaming information access. 4
b. Low latency information access. 3
c. Rest and thrift 3
3. d. Org.apcahe.hadoop.io.package 3
Interpret Job Scheduling, task execution in YARN. 7 K1 CO3
4. 6
Explain various types of Input and Output Formats in Map 7 K2 CO3
5. ReduceTechniques 6
13 K1 CO3
6. Discuss about the MapReduce Types and formats in detail.
Explain in details about the map reduce features. 7 K1 CO3
7. 6
What is YARN? Discuss about YARN job scheduling types in 13 K1 CO3
8. detail.
What is YARN in Hadoop? Why is YARN in Hadoop used? 7 K1 CO3
9. Outline and explain Hadoop YARN architecture. 6
Discuss about developing a MapReduce application and how 7 K1 CO3
10. job runs on MapReduce 6
Discuss about the Shuffling and Sorting process in Map 13 K1 CO3
11. Reduce in detail with neat sketches.
Enumerate the process to install MapReduce in Hadoop frame 13 K2 CO3
12. work.
Define Inverse Document Frequency and discuss about the 13 K1 CO3
13. anatomy of MapReduce

Unit - III / Part - C / 15 Marks


Marks K –
S.No Questions CO
Splitup Level
Count the frequency of each word in a given input text using Map 15 K2 CO3
Reduce. The input text is, “BIG DATA COMES IN VARIOUS
FORMATS. THIS DATA CAN BE STORED IN MULTIPLE
1. DATASERVERS.”
Count the frequency of each word in a given input textusing Map 15 K2 CO3
Reduce. The input text is, “SRI SAI RAM ENGINEERING
COLLEGE IS THE BEST ENGINEERING COLLEGE IN
2. CHENNAI.”
Describe in detail about the MapReduce Input and Output 15 K1 CO3
3. formats.
15 K1 CO3
How Map Reduce works and describe the anatomy of Map
4. Reduce Job working with suitable example.
Compare and contrast the Hadoop architecture with and without 15 K2 CO3
YARN and list the features of YARN
5.
UNIT - IV

Unit - IV / Part - A / 2 Marks


Mark K –
S.No Questions Split Leve CO
up l
2 K1 CO4
1. Define PIG.
2 K2 CO4
2. List the Features of PIG.
2 K1 CO4
3. What are the major component in the Apache Pig framework?
2 K1 CO4
4. Sketch Pig Latin Data model
2 K1 CO4
5. What are the execution modes of PIG?
2 K1 CO4
6. Define HIVE.
2 K2 CO4
7. State the need for HIVE in Facebook.
Who developed Apache PIG and the reason for which it is 2 K2 CO4
8. developed?
2 K2 CO4
9. List few HDFS commands that is used in Pig Grunt.
2 K1 CO4
10. Define NULL value in PIG LATIN.
2 K1 CO4
11. Define CASE operator in PIG LATIN.
2 K2 CO4
12. List types of User defined functions in JAVA.
2 K2 CO4
13. List the steps to write a User defined functions in JAVA.
2 K1 CO4
14. Define Hive QL.
2 K2 CO4
15. List the features of HIVE
2 K1 CO4
16. Define Zookeeper in HBase?
2 K2 CO4
17. List the Major components of Apache HIVE.
2 K2 CO4
18. List three types of HIVE Clients.
Differentiate between Internal and External tables in HiveQL 2 K2 CO4
19.

Unit - IV / Part - B / 13 Marks


Marks K –
S.No Questions CO
Splitup Level
(a)Discuss the following terminologies in pig with example 10 K1 CO4
1. Atom
2. Tuple
3. Bag
4. Map CO4
5. Relation
1. (b) Discuss about the features of Apache PIG. 4 K1
Explain in details about Apache PIG architecture and PIG 13 K1 CO4
2. Latin data model with necessary diagrams.
08 K1 CO4
Explain in detail about Apache Pig Execution Modes
05 K1 CO4
3.
Brief the following with necessary diagrams: 13 K1 CO4
1. HIVE
2. PIG
4. 3. SQL
Compare and Contrast HIVE, PIG, SQL, also list their benefits 13 K2 CO4
5. in detail.
Discuss in detail about the operators in PIG Latin with suitable 13 K1 CO4
6. examples.
Explain in detail about Apache PIG Grunt Shell with examples 13 K1 CO4
7. wherever needed.
13 K2 CO4
8. Compare and Contrast Pig, Hive and SQL
6 K1 CO4
Discuss about the following: 7
1. HIVE Clients
9. 2. HIVE Services
13 K1 CO4
10. Discuss in detail about the user defined functions in HIVE.
State the limitation of Hadoop that led to the invention of 13 K2 CO4
11. HBASE.
13 K1 CO4
12. Explain with neat diagram the HBase architecture.
Brief the following 6 K1 CO4
1. Column Oriented Database 7
2. Row Oriented Database
13. Compare and contrast the same.
Discuss the following in details. K1 CO4
1. HIVE Metastore 3
2. HIVE QL 5
14. 3. HIVE Tables 5

Unit - IV / Part - C / 15 Marks


S.No Questions Marks K– CO
Splitup Level
15 K2 CO4
Discuss in detail about the Steps to write a User defined
function in java with suitable example.
1.
15 K1 CO4
Compare and Contrast the following:
RDBMS and HIVE
HIVEQL and SQL
2.
15 K2 CO4
Discuss about the HIVEQL commands and HIVE Table
operations in detail.
3.
15 K2 CO4
Compare and Contrast the following:
1. HBase and HDFS
2. HBase and RDBMS
4.
15 K1 CO4
Explain in detail about the HBASE Shell commands in detail
with proper syntax.
5.

UNIT - V

Unit - V / Part - A / 2 Marks


Mark K –
S.No Questions Split Leve CO
up l
1. Differentiate GPU with CPU. 2 K1 CO6
2. What is the salient feature of GPU? 2 K1 CO6
3. List out the applications of GPU. 2 K2 CO6
4. What is CUDA? Why is it used? 2 K1 CO6
5. Write the syntax to write a cuda function and function call. 2 K1 CO6
6. Give the common workflow of CUDA programs. 2 K1 CO6
7. Write a CUDA program to perform addition of two numbers. 2 K2 CO6
8. Why is the cudaMemcpy() function used ? 2 K1 CO6
9. Define Spark and API used. 2 K1 CO5
10. Differentiate Spark with Hadoop. 2 K1 CO5
11. What is the role of driver and executor in Spark architecture? 2 K1 CO5
12. Define CUDA Memory model 2 K1 CO4
13. How is the Spark shell used?Give an example. 2 K1 CO5
14. List the features of Spark. 2 K1 CO5
15. Write a Spark code to read a text file from HDFS 2 K2 CO5

Unit - V / Part - B / 13 Marks


Marks K –
S.No Questions CO
Splitup Level
1. Explain in detail about GPU architecture and features. 13 K1 CO6
2. Illustrate in detail about CUDA Programming Model. 13 K1 CO6
a) Write a CUDA program to perform addition of two matrices. 08 K2 CO6
b) Write a CUDA Program to perform basic multiplication
3. operations in vector variables. 05 K2 CO6
Explain in detail about the steps of converting the Vector 13 K1 CO6
4. addition program to CUDA.
Explain in detail about Device Management in CUDA with 13 K1 CO6
5. necessary functions with examples.
Write a C program to swap two numbers and use this host 13 K2 CO6
variable in device variable in CUDA program for swapping of
6. two numbers. Write the steps of conversion.
Explain in detail about SPARK API,its applications,features 13 K1 CO5
7. and characteristics.
Explain in detail about SPARK Architecture and describe its 13 K1 CO5
8. components.
a)Write Spark API to count the number of words in the given 07 K2 CO5
text file.
b)Write a Spark API Program to display the error message
9. from the log file. 06 K2 CO5
Write the Steps of writing a Spark API program in Scala or 13 K1 CO5
10. Java.
Write a CUDA program to perform basic mathematical 13 K2 CO6
11. operations using vectors.
Describe in detail about how GPU is better than CPU with 13 K2 CO6
12. comparative study.
Write a SPARK API Program to display the highest occurrence 13 K2 CO5
13. of a file.

Unit - V / Part - C / 15 Marks


Marks K –
S.No Questions CO
Splitup Level
1. Explain in detail about CUDA Programming Model. 15 K1 CO6
2. Write a CUDA program to perform Matrix Multiplication. 15 K2 CO6
Explain Spark architecture,Write a simple API application with 15 K1 CO5
3. steps of compilation.
Explain in detail about GPU Architecture.List out the features 15 K1 CO6
4. GPU better than CPU.
Explain in detail about how data analysis is done with spark 15 K1 CO5
5. shells.
Course Outcomes:

CO Outcomes K level
CO1 Illustrate various big data concepts and its use cases in various application
K1
domains.
CO2 Understand the Hadoop distributed file systems on different applications. K2
CO3 Infer the working of Hadoop architecture and Map reduce Framework. K2
CO4 Articulate the different Hadoop ecosystem components. K3
CO5 Demonstrate the big data solutions using Spark Programming. K3
CO6 Solve the various distributed applications using the Big data technologies. K3

Exam / CO CO1 CO2 CO3 CO4 CO5 CO6


CAT 1 52 48 .... .... .... ....
CAT 2 .... ... 48 52 .... ....
CAT 3 52 48
END SEM 17 17 17 17 17 15

Course Moderator: Mr.R.Sampath -AP/IT/SSIT

Course Coordinators: Mr.R.Sampath-AP/IT/SSIT


Mrs.P.Leela Jancy -AP/IT/SSIT
Mrs.Thamizhselvi – AP /IT/SSEC

You might also like