Professional Documents
Culture Documents
14 Oozie
14 Oozie
14 Oozie
and Analytics
Oozie
Agenda
• Overview
• What is Oozie?
• Features of Oozie
• Oozie architecture
• Workflows
• Coordinators
• Submitting Monitoring and Managing
Oozie Jobs.
• Oozie allows a user to create Directed Acyclic Graphs of workflows and these can
be ran in parallel and sequen)al in Hadoop
• Oozie can also run plain java classes, Pig workflows, and interact with the HDFS –
Nice if you need to delete or move files before a job runs
• Oozie can run job’s sequentially (one after the other) and in parallel
(multiple at a time)
• Java Client API / Command Line Interface – Launch, control, and monitor
jobs from your Java Apps
• Run Periodic jobs – Have jobs that you need to run every hour, day, week?
Have Oozie run the jobs for you
• Action Nodes:
• A variety of actions are supported
freq=1440
startTime=2012-07-12T14:00Z
endTime=2012-08-01T14:00Z
timezone=UTC
workflowPath=/user/hduser/oozieWF
jobtracker=localhost:54311
namenode=localhost:54310
PREFIX=hdfs://localhost:54310/user
<kill name="fail">
<message>Bummer, error message[${wf:errorMessage()}]
</message>
</kill>
<end name="end"/>
</workflow-app>
Step 7: Copy the below code into job.properties file #JobTracker and
NodeName
jobtracker=localhost:54311
namenode=localhost:54310
#prefix of the HDFS path for input and output, adapt!
PREFIX=hdfs://localhost:54310/user
#HDFS path where you need to copy workflow.xml and lib/*.jar to
oozie.wf.application.path=hdfs://localhost:54310/user/hduser/Oozi
eWFonHDFS/
#one of the values from Hadoop mapred.queue.names
queueName=default