OOZIE中的多输入路径配置 [英] Multiple Input Paths configuration in OOZIE
问题描述
我正在尝试在oozie中配置一个Mapreduce作业。该作业有两种不同的输入格式和两个输入数据文件夹。我用这个帖子如何配置oozie具有多个映射器的多输入路径工作流程
并将这些属性添加到我的workflow.xml中:
< property>
<名称> mapred.input.dir.formats< / name>
< value>文件夹/ data / *; org.apache.hadoop.mapred.SequenceFileInputFormat \,data / *; org.apache.hadoop.mapred.TextInputFormat< / value>
< / property>
<属性>
<名称> mapred.input.dir.mappers< / name>
<值>文件夹/ data / *; .... PublicMapper \,data / *; .... PublicMapper< / value>
< / property>
但是当作业启动时,我有以下错误:在作业中未指定任何输入路径。
有没有人可以帮助我?
thks
< property> =h2_lin>解决方案
< name> mapreduce.inputformat.class< / name>
< value> org.apache.hadoop.mapreduce.lib.input.DelegatingInputFormat< / value>
< / property>
<属性>
< name> mapreduce.map.class< / name>
< value> org.apache.hadoop.mapreduce.lib.input.DelegatingMapper< / value>
< / property>
I am trying to configure a Mapreduce job in oozie . This job has two different input formats and two input data folders. I used this post How to configure oozie workflow for multi-input path with multiple mappers and added these properties to my workflow.xml :
<property>
<name>mapred.input.dir.formats</name>
<value>folder/data/*;org.apache.hadoop.mapred.SequenceFileInputFormat\,data/*;org.apache.hadoop.mapred.TextInputFormat</value>
</property>
<property>
<name>mapred.input.dir.mappers</name>
<value>folder/data/*;....PublicMapper\,data/*;....PublicMapper</value>
</property>
but when the job is launched i have the following error: " No input paths specified in job".
Is there anyone that can help me ?
thks
You need to set some additional properties:
<property>
<name>mapreduce.inputformat.class</name>
<value>org.apache.hadoop.mapreduce.lib.input.DelegatingInputFormat</value>
</property>
<property>
<name>mapreduce.map.class</name>
<value>org.apache.hadoop.mapreduce.lib.input.DelegatingMapper</value>
</property>
这篇关于OOZIE中的多输入路径配置的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!