(according to Why does the oozie luncher consume 2 yarn containers?)
I have cluster with 1900 core and 11TB RAM. And I have next structure of workflow for my Oozie WF:
The question is - is it possible to run these subworkflows in a single container (application master)? By default, for each subworkflow oozie/yarn uses two cores: one for AM and one for map-reduce task (controller). And this is the bottleneck - 1/3 of all cores of my cluster used only for controlling but not for computing
I guess you can use the uber mode of the oozie to save the container which launches the oozie action job. The AM will launch the action instead of doing it from a separate container.
Add the following property into oozie-site.xml.
<property>
  <name>oozie.action.launcher.mapreduce.job.ubertask.enable</name>
  <value>true</value>
</property>
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With