两次调用MapReduce [英] Calling MapReduce Twice

查看：219 发布时间：2020/11/22 3:01:55 java hadoop

本文介绍了两次调用MapReduce的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我在这里关注字数统计教程:

I'm following the word count tutorial here: https://hadoop.apache.org/docs/stable/hadoop-mapreduce-client/hadoop-mapreduce-client-core/MapReduceTutorial.html#Example:_WordCount_v1.0

然后我可以得出一个单词以这种格式出现的频率:

and I can produce how often a word appears in this format:

word frequency
1    1
2    2
3    3
4    1
5    2
6    1

但是，现在我需要对频率进行分组:

However, now I need to group the frequency like this:

frequency   count
1           3
2           2
3           1

基本上，对于每个频率，找出出现频率.我将如何修改代码以显示此信息?我觉得我必须修改IntSumReducer，但是我从未真正使用过Hadoop.

Basically, for each frequency, find out how often that appeared. How would I modify the code to show this? I feel like I have to modify IntSumReducer but I've never really worked with Hadoop.

两次调用MapReduce [英] Calling MapReduce Twice

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

两次调用MapReduce [英] Calling MapReduce Twice

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭