hadoop中的MultipleOutputFormat [英] MultipleOutputFormat in hadoop

查看：24 发布时间：2021/12/15 18:25:03 java hadoop mapreduce

本文介绍了hadoop中的MultipleOutputFormat的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我是 Hadoop 的新手.我正在试用 Wordcount 程序.

I'm a newbie in Hadoop. I'm trying out the Wordcount program.

现在要尝试多个输出文件，我使用 MultipleOutputFormat.这个链接帮助我做到了.http://hadoop.apache.org/common/docs/r0.19.0/api/org/apache/hadoop/mapred/lib/MultipleOutputs.html

Now to try out multiple output files, i use MultipleOutputFormat. this link helped me in doing it. http://hadoop.apache.org/common/docs/r0.19.0/api/org/apache/hadoop/mapred/lib/MultipleOutputs.html

在我的司机课上

    MultipleOutputs.addNamedOutput(conf, "even",
            org.apache.hadoop.mapred.TextOutputFormat.class, Text.class,
            IntWritable.class);

    MultipleOutputs.addNamedOutput(conf, "odd",
            org.apache.hadoop.mapred.TextOutputFormat.class, Text.class,
            IntWritable.class);`

我的reduce类变成了这个

and my reduce class became this

public static class Reduce extends MapReduceBase implements
        Reducer<Text, IntWritable, Text, IntWritable> {
    MultipleOutputs mos = null;

    public void configure(JobConf job) {
        mos = new MultipleOutputs(job);
    }

    public void reduce(Text key, Iterator<IntWritable> values,
            OutputCollector<Text, IntWritable> output, Reporter reporter)
            throws IOException {
        int sum = 0;
        while (values.hasNext()) {
            sum += values.next().get();
        }
        if (sum % 2 == 0) {
            mos.getCollector("even", reporter).collect(key, new IntWritable(sum));
        }else {
            mos.getCollector("odd", reporter).collect(key, new IntWritable(sum));
        }
        //output.collect(key, new IntWritable(sum));
    }
    @Override
    public void close() throws IOException {
        // TODO Auto-generated method stub
    mos.close();
    }
}

一切正常，但我得到了很多文件，(每个 map-reduce 一个奇数和一个偶数)

Things worked , but i get LOT of files, (one odd and one even for every map-reduce)

问题是:我怎样才能只有 2 个输出文件(奇数和偶数)，以便每个 map-reduce 的每个奇数输出都写入该奇数文件，而偶数也是如此.

Question is : How can i have just 2 output files (odd & even) so that every odd output of every map-reduce gets written into that odd file, and same for even.

hadoop中的MultipleOutputFormat [英] MultipleOutputFormat in hadoop

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

hadoop中的MultipleOutputFormat [英] MultipleOutputFormat in hadoop

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭