为什么我们需要在Hadoop程序中显式地设置输出键/值类？ [英] Why do we need to set the output key/value class explicitly in the Hadoop program?

查看：101 发布时间：2016/11/23 16:51:00 class input hadoop

本文介绍了为什么我们需要在Hadoop程序中显式地设置输出键/值类？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在Hadoop：The Definitive Guide一书中，有一个包含以下代码的示例程序。

In the "Hadoop : The Definitive Guide" book, there is a sample program with the below code.

JobConf conf = new JobConf(MaxTemperature.class);  
conf.setJobName("Max temperature");  
FileInputFormat.addInputPath(conf, new Path(args[0]));  
FileOutputFormat.setOutputPath(conf, new Path(args[1]));  
conf.setMapperClass(MaxTemperatureMapper.class);  
conf.setReducerClass(MaxTemperatureReducer.class);  
conf.setOutputKeyClass(Text.class);  
conf.setOutputValueClass(IntWritable.class);

MR框架应该能够从Mapper和Reduce中找出输出键和值类在JobConf类中设置的函数。为什么我们需要在JobConf类上显式设置输出键和值类？此外，输入键/值对没有类似的API。

The MR framework should be able to figure out the output key and value class from the Mapper and the Reduce functions which are being set on the JobConf class. Why do we need to explicitly set the output key and value class on the JobConf class? Also, there is no similar API for the input key/value pair.

为什么我们需要在Hadoop程序中显式地设置输出键/值类？ [英] Why do we need to set the output key/value class explicitly in the Hadoop program?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

为什么我们需要在Hadoop程序中显式地设置输出键/值类？ [英] Why do we need to set the output key/value class explicitly in the Hadoop program?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭