University Institute of Computing: Big Data Analytics 21CAH-782

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 13

UNIVERSITY INSTITUTE OF COMPUTING

MASTER OF COMPUTER APPLICATIONS


BIG DATA ANALYTICS
21CAH-782

DISCOVER . LEARN . EMPOWER


1
OBJECTIVES
• To explore the fundamental concepts of big data analytics
• To learn to analyze the big data using intelligent techniques.
• To understand the various search methods and visualization techniques.
• To learn to use various techniques for mining data stream.
• To understand the applications using Map Reduce Concepts
What Is Digital Data?
Digital data is the electronic representation of information in a format or language
that machines can read and understand. In more technical terms, digital data is a
binary format of information that's converted into a machine-readable digital
format. The power of digital data is that any analog inputs, from very simple text
documents to genome sequencing results, can be represented with the binary
system.
Big Data:Definition
Big Data is a collection of data that is huge in volume, yet growing exponentially
with time. It is a data with so large size and complexity that none of traditional data
management tools can store it or process it efficiently. Big data is also a data but
with huge size.
• As Gartner defines it – “Big Data are high volume, high velocity, or high-variety information
assets that require new forms of processing to enable enhanced decision making, insight discovery,
and process optimization.”
3
Big Data:Definition
• The term ‘big data’ is self-explanatory − a collection of huge data sets that normal
computing techniques cannot process.
• The term not only refers to the data, but also to the various frameworks, tools, and
techniques involved.
• Technological advancement and the advent of new channels of communication (like social
networking) and new, stronger devices have presented a challenge to industry players in
the sense that they have to find other ways to handle the data.
• Big data is an all-inclusive term, representing the enormous volume of complex data sets
that
• companies and governments generate in the present-day digital environment.
• Big data, typically measured in petabytes or terabytes, materializes from three major
sources— transactional data, machine data, and social data.

4
Types of Big Data

1. Structured
2. Unstructured
3. Semi-structured

5
Types of Big Data

1. Structured Data:
Structured data is created using a fixed schema and is maintained in tabular format. The
elements in structured data are addressable for effective analysis. It contains all the data which can
be stored in the SQL Database in a tabular format. Today, most of the data is developed and
processed in the simplest way to manage information.
Examples –
Relational Data, Geo-location, credit card numbers, addresses, etc.
• Consider an example for Relational Data like you have to maintain a record of students for a
university like the name of the student, ID of a student, address, and Email of the student. To store
the record of students used the following relational schema and table for the same.
S_ID S_Name S_Address S_Email

1001 A Delhi A@gmail.com

1002 B Mumbai B@gmail.co

6
Types of Big Data

2. Unstructured Data:
• It is defined as the data in which is not follow a pre-defined standard
or you can say that any does not follow any organized format. This
kind of data is also not fit for the relational database because in the
relational database you will see a pre-defined manner or you can say
organized way of data. Unstructured data is also very important for the
big data domain and To manage and store Unstructured data there are
many platforms to handle it like No-SQL Database.
• Examples –
• Word, PDF, text, media logs, etc.

7
Types of Big Data
• 3. Semi-Structured/Hybrid Data :
Semi-structured data is information that does not reside in a relational database but that
have some organizational properties that make it easier to analyse. With some process, you
can store them in a relational database but is very hard for some kind of semi-structured
data, but semi-structured exist to ease space.
Example – XML data
• Features of Data Classification:
• The main goal of the organization of data is to arrange the data in such a form that it becomes
fairly available to the users. So it’s basic features as following.
• Homogeneity – The data items in a particular group should be similar to each other.
• Clarity – There must be no confusion in the positioning of any data item in a particular group.
• Stability – The data item set must be stable i.e. any investigation should not affect the same set of
classification.
• Elastic – One should be able to change the basis of classification as the purpose of classification
changes. 8
Traditional Data Vs Big Data

9
Big Data Analytics
Big data analytics examines large amounts of data to uncover hidden patterns,
correlations and other insights. With today’s technology, it’s possible to analyse
your data and get answers from it almost immediately – an effort that’s slower and
less efficient with more traditional business intelligence solutions.

10
Big Data Analytics
• Big data analytics is a method to uncover the hidden designs in large
data, to extract useful information that can be divided into two major
sub-systems: data management and analysis.
• Big data analytics is a process of inspecting, differentiating and
transforming big data with the goal of identifying useful information,
suggesting conclusion and helping to take accurate decisions.
• Analytics include both data mining and communication or guide
decision making.

11
Importance of Big Data Analytics
• Big data analytics through specialized systems and software can lead
to positive business-related outcomes
• New revenue opportunities
• More effective marketing
• Better customer service
• Improved operational efficiency
• Competitive advantages over rivals

12
THANK YOU

13

You might also like