Professional Documents
Culture Documents
Practice Set Data Analytics
Practice Set Data Analytics
Practice Set Data Analytics
1. How many types of data are available in data analytics, and what are
they?
2. Types of Data Analytics: An Overview and Description of Each.
3. Describe EDA and its various forms.
4. Talk about the features of big data.
5. Explain what a data warehouse is and how business intelligence relates
to it.
6. Discuss the characteristics of data extraction.
7. Talk about the Data Stack.
8. Provide an instance of how data analytics is used.
9. Talk about the application of big data.
10.Give examples of the tools used in big data.
11.See how Hadoop makes sure that data storage is fault-tolerant.
12.Differentiate Between Cloud Technology and Big Data.
13.Talk about cloud computing instead of big data.
14.Talk about HDFS and its main function within the Hadoop ecosystem.
15.Give a concrete instance of MapReduce's data processing capabilities in
the context of Hadoop.
16.Describe the benefits of Hadoop.
17.Examine how business intelligence helps companies make data-driven
decisions.
18.List the Drawbacks of Hadoop.
19.Give an example of data deserialization in big data and explain why it's a
necessary step in the data processing process.
20.Describe the role of data mining in Business Intelligence.
21.Give the definition of business intelligence (BI) in relation to data
analytics.
22.Describe business intelligence's primary objective in terms of data
analytics.
23.Talk about a few popular data sources that are analyzed by BI tools.
24.Describe the distinctions between business intelligence and traditional
reporting.
25.Identify a crucial part of a business intelligence system that is used for
reporting and data visualization.
26.Describe the advantages of self-service BI tools for businesses.
Data Analytics And Reporting(PCC-CSD503)
27.In the context of business intelligence, describe the idea of OLAP (Online
Analytical Processing).
28.Describe Hadoop and the issues it resolves with processing large
amounts of data.
29.What is the main programming model that Hadoop uses to process
data?
30.Describe the two main Hadoop components.
31.Explain the functions of the Hadoop Distributed File System's NameNode
and DataNode (HDFS).
32.Explain the importance of Hadoop's MapReduce process' "shuffle and
sort" phase.
33.Describe the Hadoop concept of data locality.
34.List a few Hadoop substitutes in the big data ecosystem.
35.What role does Hadoop play in the scalability of big data processing?
36.Give an example of a few common Hadoop use cases for big data apps.
37.In the context of big data, distinguish between structured and
unstructured data.
38.What part does data preprocessing play in big data analytics?
39.Explain the data measurement scale.
40.Talk about Big Data Types.
41.Explain the Different Data Analytics Stages.
42.What is data analytics? Describe the significance of data analytics.
43.Describe the Procedures for Exploratory Data Analysis (EDA).
44.What does business intelligence mean? Give examples of business
intelligence's advantages.
45.Describe Hadoop. Analyze the Hadoop components.
46.Establish the Hadoop Ecosystem.
47.Talk about data serialization in relation to large data. Why does big data
processing require data serialization?
48.Name the main benefits of processing large amounts of data with
Hadoop and MapReduce.
49.Talk about the functions of the reducer and mapper in map reduce.
50.Analyze how data analytics are used in a variety of sectors, including e-
commerce, finance, and healthcare. How have operations and decision-
making in these sectors been changed by data analytics?
Data Analytics And Reporting(PCC-CSD503)
1. What is data analytics? a) Storing data for future use b) Analyzing data to
extract meaningful insights c) Creating data visualizations d) Data
collection and reporting
2. Which of the following is not a common data analytics technique? a)
Regression analysis b) Machine learning c) Descriptive statistics d)
Database management
3. What is the primary goal of data preprocessing in data analytics? a)
Finding hidden patterns in data b) Cleaning and transforming data for
analysis c) Generating data visualizations d) Collecting data from various
sources
4. Which statistical measure is used to describe the spread or dispersion of
data? a) Mean b) Median c) Variance d) Mode
5. Which of the following is an example of supervised learning in machine
learning? a) Clustering b) Regression c) Anomaly detection d) Principal
component analysis
6. What is the purpose of data normalization in data analytics? a) Reducing
the dimensionality of data b) Scaling data to a common range c) Filling
missing data with zeros d) Creating data visualizations
7. Which data visualization type is best suited for showing the distribution
of a single variable? a) Line chart b) Scatter plot c) Histogram d) Pie chart
8. What is the main difference between structured and unstructured data?
a) Structured data is stored in databases, while unstructured data is not.
b) Structured data is easy to analyze, while unstructured data is not. c)
Structured data is text-based, while unstructured data is numeric. d)
Structured data has a defined format, while unstructured data does not.
9. Which statistical test is commonly used to determine if there is a
significant difference between two or more groups? a) T-test b) Chi-
squared test c) ANOVA (Analysis of Variance) d) Pearson correlation
Data Analytics And Reporting(PCC-CSD503)
10.In data analytics, what is the term used to describe the process of
combining data from multiple sources to create a single, unified dataset?
a) Data visualization b) Data exploration c) Data integration d) Data
aggregation
11.What is the primary goal of data analytics?
A) Data collection
B) Data storage
C) Data visualization
D) Extracting valuable insights from data
18.What is the term for the process of finding patterns and relationships in
data?
A) Data cleaning
B) Data visualization
C) Data exploration
D) Data modeling
34.In Hadoop, what is the role of the Map phase in the MapReduce
framework?
A. Data splitting and sorting
B. Data aggregation
C. Data storage in HDFS
D. Data visualization
Data Analytics And Reporting(PCC-CSD503)
45.What is the primary purpose of a data lake in the context of Big Data?
A. Data storage for structured data
B. Data storage for unstructured and semi-structured data
C. Real-time data processing
D. Data warehousing for historical data
48.What is the primary goal of data preprocessing in the context of Big Data
analytics?
A. Reducing data volume
B. Ensuring data is clean and ready for analysis
C. Aggregating data into a single repository
D. Applying machine learning algorithms
Data Analytics And Reporting(PCC-CSD503)
49.What is the primary challenge associated with data integration in a Big
Data environment?
A. Data duplication
B. Data privacy concerns
C. Data loss during transfer
D. Lack of data variety