Welcome to Scribd!

0% found this document useful (0 votes)

50 views

Introduction To Big Data and Hadoop

Uploaded by

Big data refers to large, complex datasets that are difficult to process using traditional database management tools. It is characterized by the challenges of capture, storage, search, sharing, transfer, analysis and visualization of these large datasets. Current social and economic changes have increased data sharing and created more big data. Big data can be structured, unstructured, or semi-structured. Technologies like Apache Hadoop provide scalable, economical, and efficient ways to store and process big data across clusters of commodity hardware.

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
Document30 pages
Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
Richa
No ratings yet
Software: IT Infrastructure and Emerging Technologies: Accounting Department Universitas Muhammadiyah Yogyakarta
Document21 pages
Software: IT Infrastructure and Emerging Technologies: Accounting Department Universitas Muhammadiyah Yogyakarta
Choirul Huda
No ratings yet
Embedded Systems Andd Robotics
Document20 pages
Embedded Systems Andd Robotics
Ken Garcia
No ratings yet
Nist Chips 1000
Document36 pages
Nist Chips 1000
CM Gonzalez
No ratings yet
Big Data and Hadoop
Document37 pages
Big Data and Hadoop
Sreenivasulu Gogula
No ratings yet
Big Data and Cloud Computing
Document27 pages
Big Data and Cloud Computing
ShreeRoopaU
No ratings yet
Introduction To Wireless Communication Systems
Document27 pages
Introduction To Wireless Communication Systems
debojyoti
No ratings yet
Overview of Artificial Intelligence: Abu Saleh Musa Miah
Document54 pages
Overview of Artificial Intelligence: Abu Saleh Musa Miah
Md.Ashiqur Rahman
No ratings yet
Lesson 4 Big Data Ecosystem
Document26 pages
Lesson 4 Big Data Ecosystem
Neerom Baldemoro
No ratings yet
Lab. Manual: Robotics (Enge425)
Document24 pages
Lab. Manual: Robotics (Enge425)
Mohamed Alaa
No ratings yet
B.M.S College of Engineering: (Autonomous Institution Under VTU) Bangalore-560 019
Document25 pages
B.M.S College of Engineering: (Autonomous Institution Under VTU) Bangalore-560 019
Aishwarya Naidu
No ratings yet
Ai Intro
Document33 pages
Ai Intro
Nithish
No ratings yet
Robotics: Hira Shabbir 15006101049
Document13 pages
Robotics: Hira Shabbir 15006101049
HIRA SHABBIR
100% (1)
Basics of Python
Document48 pages
Basics of Python
Mubaraka Kundawala
No ratings yet
Robotics Is A Technology That Deals With Design of Robots
Document11 pages
Robotics Is A Technology That Deals With Design of Robots
purushothaman.r
No ratings yet
Seminar On Edge Computing: Presented By, N. Datta Sai 178X1A0570
Document28 pages
Seminar On Edge Computing: Presented By, N. Datta Sai 178X1A0570
Priya Chowdhury
No ratings yet
Microwave Presentation
Document12 pages
Microwave Presentation
AMAN PANDEY
No ratings yet
Lecture 01b
Document53 pages
Lecture 01b
Abdul Salam (F-Name: Amanullah
No ratings yet
Iot
Document3 pages
Iot
gdeepthi
0% (2)
Cloud Technologies
Document56 pages
Cloud Technologies
Narmatha Thiyagarajan
No ratings yet
HS1011 Lecture 3 PDF
Document34 pages
HS1011 Lecture 3 PDF
Hải Long
No ratings yet
Project Work: Submitted To:R.Vidthya
Document11 pages
Project Work: Submitted To:R.Vidthya
haripriya25
No ratings yet
WLan Architecture
Document4 pages
WLan Architecture
seventhsensegroup
No ratings yet
Brochure - AI Applications in Telecom Sector
Document2 pages
Brochure - AI Applications in Telecom Sector
Sprint T
No ratings yet
Iot Lesson1
Document38 pages
Iot Lesson1
Matteo Beggiato
100% (1)
Big Data & Business Intelligence: Presented by Binh Nguyen - Luong Dinh
Document24 pages
Big Data & Business Intelligence: Presented by Binh Nguyen - Luong Dinh
gsrao_9
No ratings yet
Machine Learning (10.17.2018)
Document45 pages
Machine Learning (10.17.2018)
hafidh irsyad
No ratings yet
ACA Unit 8 - 1
Document23 pages
ACA Unit 8 - 1
sushil@ird
No ratings yet
Artificial Intelligence in Gravel Packing
Document22 pages
Artificial Intelligence in Gravel Packing
Ugomuoh Tochukwu Theophine
No ratings yet
RF Clutter - Luxembourg (Luxembourg)
Document1 page
RF Clutter - Luxembourg (Luxembourg)
TROYTEQ
No ratings yet
Raspberry Pi
Document32 pages
Raspberry Pi
Gukesh
No ratings yet
Reverse Car Parking System: IOT Project On
Document12 pages
Reverse Car Parking System: IOT Project On
Shraddha Tamhane
100% (1)
IoT Concept
Document7 pages
IoT Concept
Pedro Alexis Santos Hernadez
No ratings yet
Jumping Into Industry 4.0 With Predictive Maintenance Solutions
Document26 pages
Jumping Into Industry 4.0 With Predictive Maintenance Solutions
Nantha Kumara Periasamy
No ratings yet
IOT Unit3 Web Comm Protocols
Document31 pages
IOT Unit3 Web Comm Protocols
Saumitra Pandey
No ratings yet
Unit 1 Big Data Analytics - An Introduction (Final)
Document65 pages
Unit 1 Big Data Analytics - An Introduction (Final)
Murtaza Vasanwala
No ratings yet
Unit 01
Document32 pages
Unit 01
Raj Chauhan
No ratings yet
CH 1
Document108 pages
CH 1
Janhavi Vishwanath
No ratings yet
Business Laws IoT R
Document16 pages
Business Laws IoT R
aduuf
No ratings yet
Unit 4
Document47 pages
Unit 4
A21126512117 SUKALA ABHIRAM
No ratings yet
An Autonomous Wireless Sensor Network Deployment System Using Mobile Robots For Human Existence Detection in Case of Disasters
Document15 pages
An Autonomous Wireless Sensor Network Deployment System Using Mobile Robots For Human Existence Detection in Case of Disasters
Ali Maarouf
No ratings yet
Big Data 8722 m8RQ3h1
Document31 pages
Big Data 8722 m8RQ3h1
sai shiva
No ratings yet
White Paper A I Ops Use Cases 1563909601853
Document4 pages
White Paper A I Ops Use Cases 1563909601853
Janer Gracia
No ratings yet
Seminar Report On REDTACTON
Document34 pages
Seminar Report On REDTACTON
Jose Thomas Painumkal
100% (1)
Artificial Intelligence
Document27 pages
Artificial Intelligence
Md Hassan
No ratings yet
Introduction To Data Science 5-13
Document19 pages
Introduction To Data Science 5-13
Syed Zubair
No ratings yet
Chapter IV Google Cloud IoT Core
Document2 pages
Chapter IV Google Cloud IoT Core
Ayoub BENSAKHRIA
No ratings yet
Mtech Ai ML
Document19 pages
Mtech Ai ML
Anju Chandran T
No ratings yet
Cloud Computing
Document29 pages
Cloud Computing
Shilpa Das
No ratings yet
Kang Won Lee Final Slide Deck 1 1
Document20 pages
Kang Won Lee Final Slide Deck 1 1
Kriangkrai Butrphakdee
No ratings yet
Eight Units DWDM
Document119 pages
Eight Units DWDM
dasariorama
No ratings yet
Simulation of Manet Routing Algorithm: Under The Guidance Of: P. Ramya Sruthi
Document20 pages
Simulation of Manet Routing Algorithm: Under The Guidance Of: P. Ramya Sruthi
pandu_harsha
No ratings yet
Cisco LMR
Document5 pages
Cisco LMR
agung.kusumaw269
No ratings yet
Wireless Communication: Wc-Unit 8, VTU EC Students
Document60 pages
Wireless Communication: Wc-Unit 8, VTU EC Students
Suresha V Sathegala
No ratings yet
Unit-2 IoT
Document45 pages
Unit-2 IoT
maruthi631
100% (1)
XG PON YanquiLuo Huawei
Document20 pages
XG PON YanquiLuo Huawei
angelpy
No ratings yet
Iot Communication Protocols
Document27 pages
Iot Communication Protocols
Simrandeep Singh
No ratings yet
Vodafone NB IoT White Paper Final, 0 PDF
Document16 pages
Vodafone NB IoT White Paper Final, 0 PDF
dfasdfsd
No ratings yet
Spectrum Spats: There Is Simply Not Enough Spectrum To Go Round
Document21 pages
Spectrum Spats: There Is Simply Not Enough Spectrum To Go Round
Karan Kadaba
No ratings yet
Emerging Technologies in Information and Communications Technology
From Everand
Emerging Technologies in Information and Communications Technology
Fouad Sabry
No ratings yet

Introduction To Big Data and Hadoop

Uploaded by

vignesh51885

0% found this document useful (0 votes)

50 views10 pages

Original Description:

This PPT will give you an Intro into Hadoop

Original Title

Introduction to Big Data and Hadoop

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

50 views10 pages

Introduction To Big Data and Hadoop

Uploaded by

vignesh51885

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 10

Search inside document

Big data is the term for a collection of data sets so large and complex that it becomes difficult to process

using on-hand database management tools or traditional data processing applications. The challenges include capture, storage, search, sharing, transfer, analysis and visualization.

Current social and economic changes like sharing data spontaneously, instantaneously and constantly using social networking which enables us to connect across boundaries creates Big data More application are used to extract values/data to achieve personal and profession goals by individual and organization creates Big data

Big data is any attribute that challenge constraints of a system capability or business need
Best example is 10MB presentation which cannot be shared across our team via email is a Big data for us.

Google processes 20 PB a day (2008) Facebook has 2.5 PB of user data + 15 TB/day (4/2009) eBay has 6.5 PB of user data + 50 TB/day (5/2009) CERNs Large Hydron Collider (LHC) generates 15 PB a year By 2050 Data generated will 50 times of current data

Big data can be divided into three types,

Structured data
Transaction details, System logs, Etc.,

Non-Structured data
Social Networking Data, Weather data etc.,

Semi-structured data
XML Files

Apache Hadoop is an open-source software framework for storage and large scale processing of data-sets on clusters of commodity hardware. Hadoop was created by Doug Cutting and Mike Cafarella in 2005. Cutting, who was working at Yahoo at the time, named it after his son's toy elephant Scalable: It can reliably store and process petabytes. Economical: It distributes the data and processing across clusters of commonly available computers (in thousands) and hence Efficient: By distributing the data, it can process it in parallel on the nodes where the data is located. Reliable: It automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures.

GOOGLE

APACHE HADOOP

Google map reduce Hadoop Map reduce Big Table HBASE Google File System Hadoop Distributed File system

Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
Document30 pages
Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
Richa
No ratings yet
Software: IT Infrastructure and Emerging Technologies: Accounting Department Universitas Muhammadiyah Yogyakarta
Document21 pages
Software: IT Infrastructure and Emerging Technologies: Accounting Department Universitas Muhammadiyah Yogyakarta
Choirul Huda
No ratings yet
Embedded Systems Andd Robotics
Document20 pages
Embedded Systems Andd Robotics
Ken Garcia
No ratings yet
Nist Chips 1000
Document36 pages
Nist Chips 1000
CM Gonzalez
No ratings yet
Big Data and Hadoop
Document37 pages
Big Data and Hadoop
Sreenivasulu Gogula
No ratings yet
Big Data and Cloud Computing
Document27 pages
Big Data and Cloud Computing
ShreeRoopaU
No ratings yet
Introduction To Wireless Communication Systems
Document27 pages
Introduction To Wireless Communication Systems
debojyoti
No ratings yet
Overview of Artificial Intelligence: Abu Saleh Musa Miah
Document54 pages
Overview of Artificial Intelligence: Abu Saleh Musa Miah
Md.Ashiqur Rahman
No ratings yet
Lesson 4 Big Data Ecosystem
Document26 pages
Lesson 4 Big Data Ecosystem
Neerom Baldemoro
No ratings yet
Lab. Manual: Robotics (Enge425)
Document24 pages
Lab. Manual: Robotics (Enge425)
Mohamed Alaa
No ratings yet
B.M.S College of Engineering: (Autonomous Institution Under VTU) Bangalore-560 019
Document25 pages
B.M.S College of Engineering: (Autonomous Institution Under VTU) Bangalore-560 019
Aishwarya Naidu
No ratings yet
Ai Intro
Document33 pages
Ai Intro
Nithish
No ratings yet
Robotics: Hira Shabbir 15006101049
Document13 pages
Robotics: Hira Shabbir 15006101049
HIRA SHABBIR
100% (1)
Basics of Python
Document48 pages
Basics of Python
Mubaraka Kundawala
No ratings yet
Robotics Is A Technology That Deals With Design of Robots
Document11 pages
Robotics Is A Technology That Deals With Design of Robots
purushothaman.r
No ratings yet
Seminar On Edge Computing: Presented By, N. Datta Sai 178X1A0570
Document28 pages
Seminar On Edge Computing: Presented By, N. Datta Sai 178X1A0570
Priya Chowdhury
No ratings yet
Microwave Presentation
Document12 pages
Microwave Presentation
AMAN PANDEY
No ratings yet
Lecture 01b
Document53 pages
Lecture 01b
Abdul Salam (F-Name: Amanullah
No ratings yet
Iot
Document3 pages
Iot
gdeepthi
0% (2)
Cloud Technologies
Document56 pages
Cloud Technologies
Narmatha Thiyagarajan
No ratings yet
HS1011 Lecture 3 PDF
Document34 pages
HS1011 Lecture 3 PDF
Hải Long
No ratings yet
Project Work: Submitted To:R.Vidthya
Document11 pages
Project Work: Submitted To:R.Vidthya
haripriya25
No ratings yet
WLan Architecture
Document4 pages
WLan Architecture
seventhsensegroup
No ratings yet
Brochure - AI Applications in Telecom Sector
Document2 pages
Brochure - AI Applications in Telecom Sector
Sprint T
No ratings yet
Iot Lesson1
Document38 pages
Iot Lesson1
Matteo Beggiato
100% (1)
Big Data & Business Intelligence: Presented by Binh Nguyen - Luong Dinh
Document24 pages
Big Data & Business Intelligence: Presented by Binh Nguyen - Luong Dinh
gsrao_9
No ratings yet
Machine Learning (10.17.2018)
Document45 pages
Machine Learning (10.17.2018)
hafidh irsyad
No ratings yet
ACA Unit 8 - 1
Document23 pages
ACA Unit 8 - 1
sushil@ird
No ratings yet
Artificial Intelligence in Gravel Packing
Document22 pages
Artificial Intelligence in Gravel Packing
Ugomuoh Tochukwu Theophine
No ratings yet
RF Clutter - Luxembourg (Luxembourg)
Document1 page
RF Clutter - Luxembourg (Luxembourg)
TROYTEQ
No ratings yet
Raspberry Pi
Document32 pages
Raspberry Pi
Gukesh
No ratings yet
Reverse Car Parking System: IOT Project On
Document12 pages
Reverse Car Parking System: IOT Project On
Shraddha Tamhane
100% (1)
IoT Concept
Document7 pages
IoT Concept
Pedro Alexis Santos Hernadez
No ratings yet
Jumping Into Industry 4.0 With Predictive Maintenance Solutions
Document26 pages
Jumping Into Industry 4.0 With Predictive Maintenance Solutions
Nantha Kumara Periasamy
No ratings yet
IOT Unit3 Web Comm Protocols
Document31 pages
IOT Unit3 Web Comm Protocols
Saumitra Pandey
No ratings yet
Unit 1 Big Data Analytics - An Introduction (Final)
Document65 pages
Unit 1 Big Data Analytics - An Introduction (Final)
Murtaza Vasanwala
No ratings yet
Unit 01
Document32 pages
Unit 01
Raj Chauhan
No ratings yet
CH 1
Document108 pages
CH 1
Janhavi Vishwanath
No ratings yet
Business Laws IoT R
Document16 pages
Business Laws IoT R
aduuf
No ratings yet
Unit 4
Document47 pages
Unit 4
A21126512117 SUKALA ABHIRAM
No ratings yet
An Autonomous Wireless Sensor Network Deployment System Using Mobile Robots For Human Existence Detection in Case of Disasters
Document15 pages
An Autonomous Wireless Sensor Network Deployment System Using Mobile Robots For Human Existence Detection in Case of Disasters
Ali Maarouf
No ratings yet
Big Data 8722 m8RQ3h1
Document31 pages
Big Data 8722 m8RQ3h1
sai shiva
No ratings yet
White Paper A I Ops Use Cases 1563909601853
Document4 pages
White Paper A I Ops Use Cases 1563909601853
Janer Gracia
No ratings yet
Seminar Report On REDTACTON
Document34 pages
Seminar Report On REDTACTON
Jose Thomas Painumkal
100% (1)
Artificial Intelligence
Document27 pages
Artificial Intelligence
Md Hassan
No ratings yet
Introduction To Data Science 5-13
Document19 pages
Introduction To Data Science 5-13
Syed Zubair
No ratings yet
Chapter IV Google Cloud IoT Core
Document2 pages
Chapter IV Google Cloud IoT Core
Ayoub BENSAKHRIA
No ratings yet
Mtech Ai ML
Document19 pages
Mtech Ai ML
Anju Chandran T
No ratings yet
Cloud Computing
Document29 pages
Cloud Computing
Shilpa Das
No ratings yet
Kang Won Lee Final Slide Deck 1 1
Document20 pages
Kang Won Lee Final Slide Deck 1 1
Kriangkrai Butrphakdee
No ratings yet
Eight Units DWDM
Document119 pages
Eight Units DWDM
dasariorama
No ratings yet
Simulation of Manet Routing Algorithm: Under The Guidance Of: P. Ramya Sruthi
Document20 pages
Simulation of Manet Routing Algorithm: Under The Guidance Of: P. Ramya Sruthi
pandu_harsha
No ratings yet
Cisco LMR
Document5 pages
Cisco LMR
agung.kusumaw269
No ratings yet
Wireless Communication: Wc-Unit 8, VTU EC Students
Document60 pages
Wireless Communication: Wc-Unit 8, VTU EC Students
Suresha V Sathegala
No ratings yet
Unit-2 IoT
Document45 pages
Unit-2 IoT
maruthi631
100% (1)
XG PON YanquiLuo Huawei
Document20 pages
XG PON YanquiLuo Huawei
angelpy
No ratings yet
Iot Communication Protocols
Document27 pages
Iot Communication Protocols
Simrandeep Singh
No ratings yet
Vodafone NB IoT White Paper Final, 0 PDF
Document16 pages
Vodafone NB IoT White Paper Final, 0 PDF
dfasdfsd
No ratings yet
Spectrum Spats: There Is Simply Not Enough Spectrum To Go Round
Document21 pages
Spectrum Spats: There Is Simply Not Enough Spectrum To Go Round
Karan Kadaba
No ratings yet
Emerging Technologies in Information and Communications Technology
From Everand
Emerging Technologies in Information and Communications Technology
Fouad Sabry
No ratings yet

Introduction To Big Data and Hadoop

Uploaded by

Copyright:

Available Formats

You might also like

Introduction To Big Data and Hadoop

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Introduction To Big Data and Hadoop

Uploaded by

Copyright:

Available Formats

Big data is the term for a collection of data sets so large and complex that it becomes difficult to process

Big data can be divided into three types,

You might also like