Oozie
Oozie is a workflow scheduler system to manage Apache Hadoop jobs. Oozie Workflow jobs are Directed Acyclical Graphs (DAGs) of actions.
Oozie is a workflow scheduler system that manages Apache Hadoop jobs.
Oozie’s system operates by running the workflows of dependent jobs and permits users to create Directed Acyclic Graphs of workflows. These DAG’s can be run in parallel and sequentially in Hadoop.
This workflow scheduler system consists of two parts:
Oozie operates by running as a service in a Hadoop cluster with clients submitting workflow definitions for immediate or delayed processing.?
Oozie workflow consists of action nodes and control-flow nodes.
An action node is a workflow task, which could be moving files into HDFS. While, a control-flow node controls the workflow execution between actions by allowing constructs like conditional logic, where it allows for more actions to follow depending on the result of earlier action nodes.
Oozie can be integrated with the rest of the Hadoop stack supporting several types of Hadoop jobs out of the box.
Features of Oozie?include: