Professional Documents
Culture Documents
CH 01 A Introduction To Big Data
CH 01 A Introduction To Big Data
ANALYTICS
5 Concept of Hadoop
No single definition
2 Image
3 Audio
4 Video
14
©"2015"IBM"Corporation
Source: Josh James, Domosphere
Mobile devices
(tracking all objects all the
time)
Social media and Scientific instruments
networks (collecting all sorts of
(all of us are generating data)
data)
Sensor technology and
networks
(measuring all kinds of data)
•Big :
• It is doubling every two years, and changing the way we live.
• Something that is Large
• Is it Kbyte , M byte, G byte ??
Acquistion Storage
Data writing and Traditional devices
reading speeds are are not able to store
different in the this huge amount of
devices. data.
Searching Visualization
Traditional data Traditional tools of
bases can not be data visualization
used to search a are failing with large
particular data data set
from large chunk
of data.
Analytics
Sharing Due to large size of data
Large BW it is difficult to analyse
the data with traditional
algorithms..
Exponential increase in
collected/generated data
compiled by Dr. Rohini Temkar 29
Characteristics of Big Data
2-Complexity (Varity)
■ Various formats, types, and
structures
■ Text, numerical, images, audio,
video, sequences, time series,
social media data, multi-dim
arrays, etc…
■ Static data vs. streaming data
■ A single application can be
generating/collecting many
types of data
Bankin
Social g
Media Financ
e
Our
Gamin
g Custom Known
Histor
y
er
Entertai Purcha
n se
Audio, Call center data, Customer call logs , Voice Transcriptions, Voice
Video & mails, phone logs, Videos, Surveillance images, Medical images
Images
• textual data
files with a
discernible
pattern that
enables
parsing.
• for ex, extensible
markup language
[xml] data files that
are self- describing
and defined by an
xml schema