为什么在mapreduce作业中需要setMapOutputKeyClass方法 [英] Why setMapOutputKeyClass method is necessary in mapreduce job

查看：516 发布时间：2020/5/5 15:38:05 types mapreduce

本文介绍了为什么在mapreduce作业中需要setMapOutputKeyClass方法的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在编写mapreduce程序时，我经常会编写类似的代码

When I write the mapreduce program, I often write the code like

 job1.setMapOutputKeyClass(Text.class);

但是为什么我们要显式指定MapOutputKeyClass?我们已经在

But why should we specify the MapOutputKeyClass explicitly? We have already spicify it in the mapper class such as

public static class MyMapper extends
        Mapper<LongWritable, Text, Text, Text>

在Hadoop:权威指南一书中，有一张表显示setMapOutputKeyClass方法是可选的(用于配置类型的属性)，但是正如我测试的那样，我发现它是必需的，否则Eclipse的控制台将显示

In the book Hadoop:The definitive Guide, there is a table shows that the method setMapOutputKeyClass is optional(Properties for configuring types), but as I test, I found it is necessary, or the Console of eclipse will show

Type mismatch in key from map: expected org.apache.hadoop.io.LongWritable, received org.apache.hadoop.io.Text

有人可以告诉我原因吗?

Can someone tell me the reason of it?

书中写着

表8-1的下部列出了必须与MapReduce类型兼容的设置". 这是否意味着我们必须设置较低部分的属性类型，而不必设置较高部分的属性类型?

"The settings that have to be compatible with the MapReduce types are listed in the lower part of Table 8-1". Does it mean we have to set the lower part property type, but do not have to set the higher part ones?

表的内容如下:

Properties for configuring types:
mapreduce.job.inputformat.class  
mapreduce.map.output.key.class  
mapreduce.map.output.value.class  
mapreduce.job.output.key.class  
mapreduce.job.output.value.class 

Properties that must be consistent with the types:
mapreduce.job.map.class   
mapreduce.job.combine.class  
mapreduce.job.partitioner.class  
mapreduce.job.output.key.comparator.class 
mapreduce.job.output.group.comparator.class  
mapreduce.job.reduce.class  
mapreduce.job.outputformat.class

为什么在mapreduce作业中需要setMapOutputKeyClass方法 [英] Why setMapOutputKeyClass method is necessary in mapreduce job

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

为什么在mapreduce作业中需要setMapOutputKeyClass方法 [英] Why setMapOutputKeyClass method is necessary in mapreduce job

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭