Professional Documents
Culture Documents
Big Data Ex Hdfs
Big Data Ex Hdfs
– LIMOS
BIG DATA
EXERCISES – HDFS
L AURENT D ’O RAZIO
2015/2016
Installation
This section presents how to install Hadoop. The first step consists in downloading Hadoop’s binary.
Such a binary is available at the following address:
http://hadoop.apache.org
Then update your JAVA_HOME and HADOOP_HOME using the following command:
export JAVA_HOME=<path_to_java_home>
export HADOOP_HOME=<path_to_hadoop>
Pseudo-distributed mode
This part aims at using HDFS in a pseudo-distributed mode (simulating communications on a single
node).
Configuration
Edit the following files:
etc/hadoop-env.sh
...
export JAVA_HOME=<path_to_java_home>
...
etc/hadoop/core-site.xml
<configuration>
<property>
<name>hadoop.tmp.dir</name>
<value>/home/oth/Documents/work/data/hadoop/tmp</value>
</property>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>
etc/hadoop/hdfs-site.xml
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
Execution
Format the file system
sbin/start-all.sh
Note
Shut down the nodes
sbin/stop-all.sh
File management
Make some HDFS directories
Bibliography
http://hadoop.apache.org