Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

WORD COUNT

OPEN VMWARE
CLICK ON POWER ON VIRTUAL MEACHINE
OPEN ECLIPSE
FILE> NEW FILE>JAVAPROGECT
Name: WordCount
GOTOLIBRARIES> EXTERNALJARS
SELECT FILE SYSTEM>usr>lib>hadoop>
SELECTA JAR FILES
CLICK OK
ADD EXTERNAL JAR FILES
GOTO CLIENT>SELECT ALL JAR FILES
CLICK OK
FINISH
FILE> NEW> CLASS>CLASSNAME:-WordCount
FINISH
COPY THE CODE from ONLINE PLACE THE CODE

https://hadoop.apache.org/docs/stable/hadoop-mapreduce-
client/hadoop-mapreduce-client-core/
MapReduceTutorial.html#:~:text=WordCount%20is%20a
%20simple%20application,installation%20(Single%20Node
%20Setup).

Goto File click on EXPORTS


Select Java>Jarfile
select on checkbox of WordCount
Below set export destination:
Browse>desktop
Above change untitled.jar to WordCount.jar
Click ok
WORD COUNT

Click finish

Open command prompt on virtual machine on desktop


pwd TO KNOW PRESE/NT WORKING DIRECTORY

file create:
cat>/home/cloudera/file2.txt
NEXT ENTER FILE DATA
a
b
c
AFTER COMPLETION ENTER CTRL+z
TO SEE FILE DETAILS
cat /home/cloudera/file2.txt
a
b
c
NEXT TAKE LOCAL TO HDFS
CREATE A FOLDER
hdfs dfs -mkdir /inputfolder2
WORD COUNT

LOCAL DATA IS GOING TO STORE IN HDFS FOLDER


hdfs dfs -put /home/cloudera/file2.txt /inputfolder2
NOW RUN THE FILE PROCESS
hadoop jar /home/cloudera/Desktop/WORDCOUNT.jar
WordCount /inputfolder2/file2.txt /out2
WAIT UNTIL EXECUTION COMPLETES
hdfs dfs -ls /out2
O/P:-rw-r--r-- 1 cloudera supergroup 0 2022-05-17
03:14 /out2/_SUCCESS
-rw-r--r-- 1 cloudera supergroup 12 2022-05-17 03:14
/out2/part-r-00000
TO SEE OUTPUT:COPY ABOVE PATH AS BELOW
hdfs dfs -cat /out2/part-r-00000
OUTPUT:
a 1
b 1
c 1

You might also like