Professional Documents
Culture Documents
Big Data
Big Data
Big Data
BLACK BOOK
Unit – I
8 Hours
Unit V
• NoSQL: Introduction to NoSQL: Why NoSQL,
Characteristics of NoSQL, History of NoSQL,
Types of NoSQL Data Models: Key-Value Data
Model, Column-Oriented Data Model,
Document Data Model, Graph Databases,
Schemaless Databases, Materialized views,
Distribution Models: CAP Theorem, Sharding
8 Hours
BOOKS
• Text Book:
• DT Editorial Services,”Big Data:Black Book ,Comprehensive Problem
Solver”, Dreamtech Press. 2016 Edition [ Chapters - 1,2,4,5,11,12,13,15]
•
• Reference Book:
• Paul C. Zikopoulos, Chris Eaton, Dirk deRoos, Thomas Deutsch, George
Lapis, Understanding Big Data – Analytics for Enterprise Class Hadoop
and Streaming Data, McGraw Hill, 2012
• P. J. Sadalage and M. Fowler, "NoSQL Distilled: A Brief Guide to the
Emerging World of
• Polyglot Persistence", Addison-Wesley Professional, 2012.
• 3. TomWhite,"Hadoop:TheDefinitiveGuide",ThirdEdition,O'Reilly,2012.
What is Data
“Collection of raw facts from which conclusions may be drawn”
– Individuals
– Businesses
AVocabulary for Measuring Information
If a Grain of Sand were One Byte of Information . . .
1 Megabyte =
1 million bytes
a tablespoon of sand
1 Gigabyte =
1 billion bytes
patch of sand—
9” square, 1’ deep
1 Terabyte =
1 trillion bytes
a sandbox—
24’ square, 1’ deep
1 Petabyte =
1,000 terabytes
a mile long beach—
100’ wide , 1’ deep
A NewVocabulary for Measuring Information
If a Grain of Sand were One Byte of Information . . .
1 Exabyte =
1 Megabyte = 1,000 petabytes
1 million bytes the same beach—
a tablespoon of sand from Maine to North Carolina
1 Gigabyte = 1 Zetabyte =
1 billion bytes 1,000 exabytes
patch of sand— the same beach—
9” square, 1’ deep along the entire US coast
1 Terabyte = 1 Yottabyte =
1 trillion bytes 1,000 zetabytes
a sandbox— enough info to bury the entire
24’ square, 1’ deep US under 296 feet of sand
1 Petabyte =
1,000 terabytes
a mile long beach—
100’ wide , 1’ deep
Define Information
• What do individuals/businesses do
with the data they collect?
– They turn it into “information”
– “Information is the intelligence
Centralized information
storage and processing
Network Network
Uploading Accessing
information information
trends
Information
information
– For example:
• Buying habits and patterns of
customers
• Health history of patients Demand for more
Information