Welcome to Scribd!

100% found this document useful (1 vote)

147 views

Hands-On Hadoop Tutorial

Uploaded by

This document provides an overview of Hands-On Hadoop tutorial. It discusses that Hadoop uses HDFS distributed file system based on GFS for shared storage. HDFS divides files into large 64MB chunks distributed across data servers. It also describes the master and slave node architecture and how to start, stop and use HDFS to manage files. Configuration and adding new slave nodes are also summarized.

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

MongoDB 102 - Reportes de Ejercicios Del Dataset "Restaurants"
Document15 pages
MongoDB 102 - Reportes de Ejercicios Del Dataset "Restaurants"
MALDONADO MENDOZA YAEL
No ratings yet
Fundamentals of Apache Sqoop Notes
Document66 pages
Fundamentals of Apache Sqoop Notes
paramreddy2000
No ratings yet
Hands On Big Data
Document52 pages
Hands On Big Data
pratap
No ratings yet
Hadoop Interview Questions New
Document9 pages
Hadoop Interview Questions New
Rupali Shetty
No ratings yet
Monitoring Hadoop
From Everand
Monitoring Hadoop
Gurmukh Singh
No ratings yet
Blockchain Unconfirmed Transaction Hack Script 2019mmmmmmmmm
Document2 pages
Blockchain Unconfirmed Transaction Hack Script 2019mmmmmmmmm
mediaads
No ratings yet
1 Hdfs Notes
Document38 pages
1 Hdfs Notes
Sandeep Boyina
No ratings yet
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet
Hadoop Overview
Document16 pages
Hadoop Overview
Sunil D Patil
100% (1)
Hadoop Interview Guide
Document34 pages
Hadoop Interview Guide
Nadeem Khan Khan
100% (1)
Hadoop Questions
Document14 pages
Hadoop Questions
Shreya Kasturia
No ratings yet
Distributed Database Systems: - Spark I
Document59 pages
Distributed Database Systems: - Spark I
Thomas Ariyanto
No ratings yet
Hadoop
Document30 pages
Hadoop
SAM7028
No ratings yet
Hadoop Interview Questions
Document14 pages
Hadoop Interview Questions
satish.sathya.a2012
No ratings yet
Hadoop Tutorial
Document552 pages
Hadoop Tutorial
SureshAnand CSE
0% (1)
Map Reduce With Hadoop:: Presented by ANIVESHA-126 ARITRA-128 RIA-142 Shashvat - 150 SHEKHAR-151
Document9 pages
Map Reduce With Hadoop:: Presented by ANIVESHA-126 ARITRA-128 RIA-142 Shashvat - 150 SHEKHAR-151
Aritra Banerjee
100% (1)
Spark Summit East 2015 - Adv Dev Ops - Student Slides
Document219 pages
Spark Summit East 2015 - Adv Dev Ops - Student Slides
Chánh Lê
No ratings yet
Hadoop and Java Ques - Ans
Document222 pages
Hadoop and Java Ques - Ans
ravi
No ratings yet
Hive Query Optimization Infinity
Document13 pages
Hive Query Optimization Infinity
shashwat2010
No ratings yet
HADOOP
Document35 pages
HADOOP
Ekapop Verasakulvong
100% (1)
RaviKumar Gurrappagari PDF
Document8 pages
RaviKumar Gurrappagari PDF
Benedict Zander
No ratings yet
Hadoop Hive Cheat Sheet - Developer Guide For SQL To HiveQL - Qubole
Document19 pages
Hadoop Hive Cheat Sheet - Developer Guide For SQL To HiveQL - Qubole
gowri1111
No ratings yet
Sqoop Interview Questions
Document6 pages
Sqoop Interview Questions
Guruprasad Vijayakumar
No ratings yet
Introduction of The Radial Basis Function (RBF) Networks: February 2001
Document8 pages
Introduction of The Radial Basis Function (RBF) Networks: February 2001
jainam dude
No ratings yet
Hive Commands
Document3 pages
Hive Commands
pkumarss
No ratings yet
Chap 6 - MapReduce Programming
Document37 pages
Chap 6 - MapReduce Programming
Harshitha Raaj
No ratings yet
Linux Command List
Document8 pages
Linux Command List
hkneptune
No ratings yet
MapReduce by Example
Document54 pages
MapReduce by Example
sumit04_28
No ratings yet
Guided By:: Miss. Rupali Zambre
Document20 pages
Guided By:: Miss. Rupali Zambre
john
No ratings yet
PySpark Reference Guide
Document2 pages
PySpark Reference Guide
Tarun Singh
No ratings yet
Intellipaat Hands On Exercises PDF
Document49 pages
Intellipaat Hands On Exercises PDF
SAURABH RANJAN
No ratings yet
Hadoop Ecosystem
Document55 pages
Hadoop Ecosystem
nehal
No ratings yet
Big Data Introduction PDF
Document180 pages
Big Data Introduction PDF
valtech20086605
No ratings yet
Spark
Document17 pages
Spark
Ravi Kumar
No ratings yet
Hadoop Training Institute in Hyderabad
Document8 pages
Hadoop Training Institute in Hyderabad
OrienIt Orienit
No ratings yet
Apache Spark: Data Science Foundations
Document55 pages
Apache Spark: Data Science Foundations
TRAPMUZIC HDTV
No ratings yet
Final Print Py Spark
Document133 pages
Final Print Py Spark
Shivaraj K
No ratings yet
Introduction To Big Data and Hadoop
Document29 pages
Introduction To Big Data and Hadoop
Manoj K Upadhyaya
100% (1)
Spart Part 2
Document44 pages
Spart Part 2
Aleena Nasir
100% (1)
Cloudera Academic Partnership 3 PDF
Document103 pages
Cloudera Academic Partnership 3 PDF
yo2k9
0% (1)
PySpark+Slides v1
Document458 pages
PySpark+Slides v1
ravikumar lanka
No ratings yet
5 - Programming With RDDs and Dataframes
Document32 pages
5 - Programming With RDDs and Dataframes
ravikumar lanka
No ratings yet
Administrator Exercise Instructions 201306
Document117 pages
Administrator Exercise Instructions 201306
Thanos Peristeropoulos
No ratings yet
Spark ETL and Process
Document15 pages
Spark ETL and Process
Ankita Kukreja
No ratings yet
1.4 HDFS Lab 1H
Document23 pages
1.4 HDFS Lab 1H
Sabir Moussaoui
No ratings yet
Resume
Document4 pages
Resume
shekhar
No ratings yet
Spark Concept
Document18 pages
Spark Concept
suchanda kundu
No ratings yet
Facebook Hive POC
Document18 pages
Facebook Hive POC
Jayashree Ravi
No ratings yet
Hive For SQL Users: Cheat Sheet
Document3 pages
Hive For SQL Users: Cheat Sheet
srikanth07balusu
No ratings yet
Tuning Spark Best Performance
Document49 pages
Tuning Spark Best Performance
Quý Đỗ
No ratings yet
Azure Cloud Intro
Document34 pages
Azure Cloud Intro
Shivaraj K
No ratings yet
Admin Commands
Document6 pages
Admin Commands
Katrina Camacho
No ratings yet
L02 - Spark SQL For Data Processing: CBG1C04 Big Data Programming
Document23 pages
L02 - Spark SQL For Data Processing: CBG1C04 Big Data Programming
Satya Narayana
No ratings yet
Hive Interview Questions Answers
Document6 pages
Hive Interview Questions Answers
rksekhar
No ratings yet
Ankit Rathi Ankit Rathi: Lead Data Architect
Document4 pages
Ankit Rathi Ankit Rathi: Lead Data Architect
Abhishek Vijayvargiya
No ratings yet
DataStage Faq S
Document57 pages
DataStage Faq S
swaroop24x7
No ratings yet
Hadoop-Oozie User Material
Document183 pages
Hadoop-Oozie User Material
rahulneel
No ratings yet
Unit - II Data Preprocessing
Document35 pages
Unit - II Data Preprocessing
ANITHA AMMU
No ratings yet
Saavn MapReduce Project
Document9 pages
Saavn MapReduce Project
sandesh.herwade
100% (1)
Hadoop and BigData LAB MANUAL
Document59 pages
Hadoop and BigData LAB MANUAL
harshi
33% (3)
Getting Started with Big Data Query using Apache Impala
From Everand
Getting Started with Big Data Query using Apache Impala
Agus Kurniawan
No ratings yet
Hadoop Cluster Deployment
From Everand
Hadoop Cluster Deployment
Danil Zburivsky
No ratings yet
RabbitMQ Architecture
Document8 pages
RabbitMQ Architecture
Sakthivel P
No ratings yet
K8ssandra Workshop Feb 2021
Document80 pages
K8ssandra Workshop Feb 2021
test
No ratings yet
Blockbuster Blockchain
Document81 pages
Blockbuster Blockchain
Dr P Adhikary
No ratings yet
FactoryTalk® View Site Edition User's Guide
Document702 pages
FactoryTalk® View Site Edition User's Guide
mistiano
100% (2)
Introduction To Cloud Computing
Document36 pages
Introduction To Cloud Computing
Ajay Kakkar
No ratings yet
Outline: - Distributed Mutual Exclusion
Document38 pages
Outline: - Distributed Mutual Exclusion
VARNESH GAWDE
No ratings yet
Assingnment No. 4
Document4 pages
Assingnment No. 4
Devendra Chaturvedi
No ratings yet
Module 10: Automatic Scaling and Monitoring: AWS Academy Cloud Foundations
Document39 pages
Module 10: Automatic Scaling and Monitoring: AWS Academy Cloud Foundations
Down2 Tv21
No ratings yet
qFKyxeZz 1
Document7 pages
qFKyxeZz 1
David Clark
No ratings yet
Apache Zookeeper
Document31 pages
Apache Zookeeper
Tripti Sagar
No ratings yet
7 Sem CSE (CS) Pre-Requisite Course Completion (PCC) Cloud Security
Document74 pages
7 Sem CSE (CS) Pre-Requisite Course Completion (PCC) Cloud Security
Vishnu Ch
No ratings yet
Validation Based Protocol
Document11 pages
Validation Based Protocol
-Nikhil Bhatia
No ratings yet
Iasimp Qr010 en P
Document10 pages
Iasimp Qr010 en P
viorelduca
No ratings yet
Binary Locks
Document4 pages
Binary Locks
Shelvin Echo
No ratings yet
Outline: What Is A Distributed DBMS Distributed DBMS Architecture
Document40 pages
Outline: What Is A Distributed DBMS Distributed DBMS Architecture
Muhammad Jaffar Hussain
No ratings yet
CSC 321 Operating System
Document90 pages
CSC 321 Operating System
Kelly Brown
No ratings yet
Chapter 1 (A) - Distribted System
Document40 pages
Chapter 1 (A) - Distribted System
siraj mohammed
No ratings yet
Deadlock Notes
Document3 pages
Deadlock Notes
Abcd Efgh
No ratings yet
Log
Document478 pages
Log
Maria Isabel Alesna
No ratings yet
Cse PDF
Document31 pages
Cse PDF
srivani boddepalli
No ratings yet
Hyper Ledger
Document15 pages
Hyper Ledger
vijaymsp
No ratings yet
Grid Architecture
Document19 pages
Grid Architecture
Bittu Verma
No ratings yet
DC Notes - 2 Marks
Document11 pages
DC Notes - 2 Marks
Alima Nasa
No ratings yet
OS Process Synchronization Unit 3
Document55 pages
OS Process Synchronization Unit 3
Abhay
No ratings yet
CS8603 Unit - Iii
Document21 pages
CS8603 Unit - Iii
Vignesh Baskey
No ratings yet
Lock-Free and Practical Deques Using Single-Word Compare-And-Swap
Document17 pages
Lock-Free and Practical Deques Using Single-Word Compare-And-Swap
jenana4059
No ratings yet
Copa MCQ Cloud Computing
Document8 pages
Copa MCQ Cloud Computing
Vishwanath ITI
No ratings yet
3 - Nonblocking Commit Protocols
Document28 pages
3 - Nonblocking Commit Protocols
Tathagat rubal
No ratings yet

Hands-On Hadoop Tutorial

Uploaded by

Jomy Antony

100% found this document useful (1 vote)

147 views13 pages

Original Description:

Original Title

HadoopTutorial

Copyright

Available Formats

PPT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as ppt, pdf, or txt

100% found this document useful (1 vote)

147 views13 pages

Hands-On Hadoop Tutorial

Uploaded by

Jomy Antony

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as ppt, pdf, or txt

Jump to Page

You are on page 1of 13

Search inside document

Hands-On Hadoop

Tutorial
Chris Sosa
Wolfgang Richter
May 23, 2008
General Information
 Hadoop uses HDFS, a distributed file
system based on GFS, as its shared
filesystem

 HDFS architecture divides files into large

chunks (~64MB) distributed across data
servers

 HDFS has a global namespace

General Information (cont’d)
 Provided a script for your convenience
– Run source /localtmp/hadoop/setupVars from centurtion064
– Changes all uses of {somePath}/command to just command

 Goto http://www.cs.virginia.edu/~cbs6n/hadoop for web

access. These slides and more information are also
available there.

 Once you use the DFS (put something in it), relative

paths are from /usr/{your usr id}. E.G. if your id is tb28
… your “home dir” is /usr/tb28
Master Node
 Hadoop currently configured with
centurion064 as the master node

 Master node
– Keeps track of namespace and metadata
about items
– Keeps track of MapReduce jobs in the system
Slave Nodes
 Centurion064 also acts as a slave node

 Slave nodes
– Manage blocks of data sent from master node
– In terms of GFS, these are the chunkservers

 Currently centurion060 is also another

slave node
Hadoop Paths
 Hadoop is locally “installed” on each machine
– Installed location is in /localtmp/hadoop/hadoop-
0.15.3
– Slave nodes store their data in
/localtmp/hadoop/hadoop-dfs (this is automatically
created by the DFS)
– /localtmp/hadoop is owned by group gbg (someone
in this group must administer this or a cs admin)

 Files are divided into 64 MB chunks (this is

configurable)
Starting / Stopping Hadoop
 For the purposes of this tutorial, we
assume you have run the setupVars from
earlier

 start-all.sh – starts all slave nodes and

master node
 stop-all.sh – stops all slave nodes and
master node
Using HDFS (1/2)
 hadoop dfs
– [-ls <path>]
– [-du <path>]
– [-cp <src> <dst>]
– [-rm <path>]
– [-put <localsrc> <dst>]
– [-copyFromLocal <localsrc> <dst>]
– [-moveFromLocal <localsrc> <dst>]
– [-get [-crc] <src> <localdst>]
– [-cat <src>]
– [-copyToLocal [-crc] <src> <localdst>]
– [-moveToLocal [-crc] <src> <localdst>]
– [-mkdir <path>]
– [-touchz <path>]
– [-test -[ezd] <path>]
– [-stat [format] <path>]
– [-help [cmd]]
Using HDFS (2/2)
 Want to reformat?

 Easy
– hadoop namenode –format

 Basically we see most commands look similar

– hadoop “some command” options
– If you just type hadoop you get all possible
commands (including undocumented ones – hooray)
To Add Another Slave
 This adds another data node / job execution site
to the pool
– Hadoop dynamically uses filesystem underneath it
– If more space is available on the HDD, HDFS will try
to use it when it needs to
 Modify the slaves file
– In centurion064:/localtmp/hadoop/hadoop-
0.15.3/conf
– Copy code installation dir to
newMachine:/localtmp/hadoop/hadoop-0.15.3 (very
small)
– Restart Hadoop
Configure Hadoop

 Can configure in {$installation dir}/conf

– hadoop-default.xml for global
– hadoop-site.xml for site specific (overrides global)
That’s it for Configuration!
Real-time Access

MongoDB 102 - Reportes de Ejercicios Del Dataset "Restaurants"
Document15 pages
MongoDB 102 - Reportes de Ejercicios Del Dataset "Restaurants"
MALDONADO MENDOZA YAEL
No ratings yet
Fundamentals of Apache Sqoop Notes
Document66 pages
Fundamentals of Apache Sqoop Notes
paramreddy2000
No ratings yet
Hands On Big Data
Document52 pages
Hands On Big Data
pratap
No ratings yet
Hadoop Interview Questions New
Document9 pages
Hadoop Interview Questions New
Rupali Shetty
No ratings yet
Monitoring Hadoop
From Everand
Monitoring Hadoop
Gurmukh Singh
No ratings yet
Blockchain Unconfirmed Transaction Hack Script 2019mmmmmmmmm
Document2 pages
Blockchain Unconfirmed Transaction Hack Script 2019mmmmmmmmm
mediaads
No ratings yet
1 Hdfs Notes
Document38 pages
1 Hdfs Notes
Sandeep Boyina
No ratings yet
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet
Hadoop Overview
Document16 pages
Hadoop Overview
Sunil D Patil
100% (1)
Hadoop Interview Guide
Document34 pages
Hadoop Interview Guide
Nadeem Khan Khan
100% (1)
Hadoop Questions
Document14 pages
Hadoop Questions
Shreya Kasturia
No ratings yet
Distributed Database Systems: - Spark I
Document59 pages
Distributed Database Systems: - Spark I
Thomas Ariyanto
No ratings yet
Hadoop
Document30 pages
Hadoop
SAM7028
No ratings yet
Hadoop Interview Questions
Document14 pages
Hadoop Interview Questions
satish.sathya.a2012
No ratings yet
Hadoop Tutorial
Document552 pages
Hadoop Tutorial
SureshAnand CSE
0% (1)
Map Reduce With Hadoop:: Presented by ANIVESHA-126 ARITRA-128 RIA-142 Shashvat - 150 SHEKHAR-151
Document9 pages
Map Reduce With Hadoop:: Presented by ANIVESHA-126 ARITRA-128 RIA-142 Shashvat - 150 SHEKHAR-151
Aritra Banerjee
100% (1)
Spark Summit East 2015 - Adv Dev Ops - Student Slides
Document219 pages
Spark Summit East 2015 - Adv Dev Ops - Student Slides
Chánh Lê
No ratings yet
Hadoop and Java Ques - Ans
Document222 pages
Hadoop and Java Ques - Ans
ravi
No ratings yet
Hive Query Optimization Infinity
Document13 pages
Hive Query Optimization Infinity
shashwat2010
No ratings yet
HADOOP
Document35 pages
HADOOP
Ekapop Verasakulvong
100% (1)
RaviKumar Gurrappagari PDF
Document8 pages
RaviKumar Gurrappagari PDF
Benedict Zander
No ratings yet
Hadoop Hive Cheat Sheet - Developer Guide For SQL To HiveQL - Qubole
Document19 pages
Hadoop Hive Cheat Sheet - Developer Guide For SQL To HiveQL - Qubole
gowri1111
No ratings yet
Sqoop Interview Questions
Document6 pages
Sqoop Interview Questions
Guruprasad Vijayakumar
No ratings yet
Introduction of The Radial Basis Function (RBF) Networks: February 2001
Document8 pages
Introduction of The Radial Basis Function (RBF) Networks: February 2001
jainam dude
No ratings yet
Hive Commands
Document3 pages
Hive Commands
pkumarss
No ratings yet
Chap 6 - MapReduce Programming
Document37 pages
Chap 6 - MapReduce Programming
Harshitha Raaj
No ratings yet
Linux Command List
Document8 pages
Linux Command List
hkneptune
No ratings yet
MapReduce by Example
Document54 pages
MapReduce by Example
sumit04_28
No ratings yet
Guided By:: Miss. Rupali Zambre
Document20 pages
Guided By:: Miss. Rupali Zambre
john
No ratings yet
PySpark Reference Guide
Document2 pages
PySpark Reference Guide
Tarun Singh
No ratings yet
Intellipaat Hands On Exercises PDF
Document49 pages
Intellipaat Hands On Exercises PDF
SAURABH RANJAN
No ratings yet
Hadoop Ecosystem
Document55 pages
Hadoop Ecosystem
nehal
No ratings yet
Big Data Introduction PDF
Document180 pages
Big Data Introduction PDF
valtech20086605
No ratings yet
Spark
Document17 pages
Spark
Ravi Kumar
No ratings yet
Hadoop Training Institute in Hyderabad
Document8 pages
Hadoop Training Institute in Hyderabad
OrienIt Orienit
No ratings yet
Apache Spark: Data Science Foundations
Document55 pages
Apache Spark: Data Science Foundations
TRAPMUZIC HDTV
No ratings yet
Final Print Py Spark
Document133 pages
Final Print Py Spark
Shivaraj K
No ratings yet
Introduction To Big Data and Hadoop
Document29 pages
Introduction To Big Data and Hadoop
Manoj K Upadhyaya
100% (1)
Spart Part 2
Document44 pages
Spart Part 2
Aleena Nasir
100% (1)
Cloudera Academic Partnership 3 PDF
Document103 pages
Cloudera Academic Partnership 3 PDF
yo2k9
0% (1)
PySpark+Slides v1
Document458 pages
PySpark+Slides v1
ravikumar lanka
No ratings yet
5 - Programming With RDDs and Dataframes
Document32 pages
5 - Programming With RDDs and Dataframes
ravikumar lanka
No ratings yet
Administrator Exercise Instructions 201306
Document117 pages
Administrator Exercise Instructions 201306
Thanos Peristeropoulos
No ratings yet
Spark ETL and Process
Document15 pages
Spark ETL and Process
Ankita Kukreja
No ratings yet
1.4 HDFS Lab 1H
Document23 pages
1.4 HDFS Lab 1H
Sabir Moussaoui
No ratings yet
Resume
Document4 pages
Resume
shekhar
No ratings yet
Spark Concept
Document18 pages
Spark Concept
suchanda kundu
No ratings yet
Facebook Hive POC
Document18 pages
Facebook Hive POC
Jayashree Ravi
No ratings yet
Hive For SQL Users: Cheat Sheet
Document3 pages
Hive For SQL Users: Cheat Sheet
srikanth07balusu
No ratings yet
Tuning Spark Best Performance
Document49 pages
Tuning Spark Best Performance
Quý Đỗ
No ratings yet
Azure Cloud Intro
Document34 pages
Azure Cloud Intro
Shivaraj K
No ratings yet
Admin Commands
Document6 pages
Admin Commands
Katrina Camacho
No ratings yet
L02 - Spark SQL For Data Processing: CBG1C04 Big Data Programming
Document23 pages
L02 - Spark SQL For Data Processing: CBG1C04 Big Data Programming
Satya Narayana
No ratings yet
Hive Interview Questions Answers
Document6 pages
Hive Interview Questions Answers
rksekhar
No ratings yet
Ankit Rathi Ankit Rathi: Lead Data Architect
Document4 pages
Ankit Rathi Ankit Rathi: Lead Data Architect
Abhishek Vijayvargiya
No ratings yet
DataStage Faq S
Document57 pages
DataStage Faq S
swaroop24x7
No ratings yet
Hadoop-Oozie User Material
Document183 pages
Hadoop-Oozie User Material
rahulneel
No ratings yet
Unit - II Data Preprocessing
Document35 pages
Unit - II Data Preprocessing
ANITHA AMMU
No ratings yet
Saavn MapReduce Project
Document9 pages
Saavn MapReduce Project
sandesh.herwade
100% (1)
Hadoop and BigData LAB MANUAL
Document59 pages
Hadoop and BigData LAB MANUAL
harshi
33% (3)
Getting Started with Big Data Query using Apache Impala
From Everand
Getting Started with Big Data Query using Apache Impala
Agus Kurniawan
No ratings yet
Hadoop Cluster Deployment
From Everand
Hadoop Cluster Deployment
Danil Zburivsky
No ratings yet
RabbitMQ Architecture
Document8 pages
RabbitMQ Architecture
Sakthivel P
No ratings yet
K8ssandra Workshop Feb 2021
Document80 pages
K8ssandra Workshop Feb 2021
test
No ratings yet
Blockbuster Blockchain
Document81 pages
Blockbuster Blockchain
Dr P Adhikary
No ratings yet
FactoryTalk® View Site Edition User's Guide
Document702 pages
FactoryTalk® View Site Edition User's Guide
mistiano
100% (2)
Introduction To Cloud Computing
Document36 pages
Introduction To Cloud Computing
Ajay Kakkar
No ratings yet
Outline: - Distributed Mutual Exclusion
Document38 pages
Outline: - Distributed Mutual Exclusion
VARNESH GAWDE
No ratings yet
Assingnment No. 4
Document4 pages
Assingnment No. 4
Devendra Chaturvedi
No ratings yet
Module 10: Automatic Scaling and Monitoring: AWS Academy Cloud Foundations
Document39 pages
Module 10: Automatic Scaling and Monitoring: AWS Academy Cloud Foundations
Down2 Tv21
No ratings yet
qFKyxeZz 1
Document7 pages
qFKyxeZz 1
David Clark
No ratings yet
Apache Zookeeper
Document31 pages
Apache Zookeeper
Tripti Sagar
No ratings yet
7 Sem CSE (CS) Pre-Requisite Course Completion (PCC) Cloud Security
Document74 pages
7 Sem CSE (CS) Pre-Requisite Course Completion (PCC) Cloud Security
Vishnu Ch
No ratings yet
Validation Based Protocol
Document11 pages
Validation Based Protocol
-Nikhil Bhatia
No ratings yet
Iasimp Qr010 en P
Document10 pages
Iasimp Qr010 en P
viorelduca
No ratings yet
Binary Locks
Document4 pages
Binary Locks
Shelvin Echo
No ratings yet
Outline: What Is A Distributed DBMS Distributed DBMS Architecture
Document40 pages
Outline: What Is A Distributed DBMS Distributed DBMS Architecture
Muhammad Jaffar Hussain
No ratings yet
CSC 321 Operating System
Document90 pages
CSC 321 Operating System
Kelly Brown
No ratings yet
Chapter 1 (A) - Distribted System
Document40 pages
Chapter 1 (A) - Distribted System
siraj mohammed
No ratings yet
Deadlock Notes
Document3 pages
Deadlock Notes
Abcd Efgh
No ratings yet
Log
Document478 pages
Log
Maria Isabel Alesna
No ratings yet
Cse PDF
Document31 pages
Cse PDF
srivani boddepalli
No ratings yet
Hyper Ledger
Document15 pages
Hyper Ledger
vijaymsp
No ratings yet
Grid Architecture
Document19 pages
Grid Architecture
Bittu Verma
No ratings yet
DC Notes - 2 Marks
Document11 pages
DC Notes - 2 Marks
Alima Nasa
No ratings yet
OS Process Synchronization Unit 3
Document55 pages
OS Process Synchronization Unit 3
Abhay
No ratings yet
CS8603 Unit - Iii
Document21 pages
CS8603 Unit - Iii
Vignesh Baskey
No ratings yet
Lock-Free and Practical Deques Using Single-Word Compare-And-Swap
Document17 pages
Lock-Free and Practical Deques Using Single-Word Compare-And-Swap
jenana4059
No ratings yet
Copa MCQ Cloud Computing
Document8 pages
Copa MCQ Cloud Computing
Vishwanath ITI
No ratings yet
3 - Nonblocking Commit Protocols
Document28 pages
3 - Nonblocking Commit Protocols
Tathagat rubal
No ratings yet

Hands-On Hadoop Tutorial

Uploaded by

Copyright:

Available Formats

You might also like

Hands-On Hadoop Tutorial

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Hands-On Hadoop Tutorial

Uploaded by

Copyright:

Available Formats

Hands-On Hadoop

 HDFS architecture divides files into large

 HDFS has a global namespace

 Goto http://www.cs.virginia.edu/~cbs6n/hadoop for web

 Once you use the DFS (put something in it), relative

 Currently centurion060 is also another

 Files are divided into 64 MB chunks (this is

 start-all.sh – starts all slave nodes and

 Basically we see most commands look similar

 Can configure in {$installation dir}/conf

You might also like