洗牌阶段实际上是做什么的? [英] What does the shuffling phase actually do?

查看：190 发布时间：2020/5/5 15:37:49 hadoop mapreduce shuffle mapper reducers

本文介绍了洗牌阶段实际上是做什么的?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

洗牌阶段实际上是做什么的?

What does the shuffling phase actually do?

由于改组是将映射器o/p引入化简器o/p的过程，因此它只是基于分区器中编写的代码将映射器的特定键引入特定的化简器中

As shuffling is the process of bringing the mapper o/p to the reducer o/p, it just brings the specific keys from the mappers to the particular reducers based on the code written in partitioner

例如映射器1的o/p为{a，1} {b，1}

eg. the o/p of mapper 1 is {a,1} {b,1}

映射器2的o/p为{a，1} {b，1}

the o/p of mapper 2 is {a,1} {b,1}

在我的分区程序中，我写了所有以'a'开头的键都将进入化简器1，而所有以'b'开头的键都将去化简器2，因此o/p为:

and in my partitioner, I have written that all keys starting with 'a' will go to reducer 1 and all keys starting with 'b will go to reducer 2 so the o/p would be:

减速器1:{a，1} {a，1}

reducer 1: {a,1}{a,1}

减速器2:{b，1} {b，1}

reducer 2: {b,1}{b,1}

可能性-B

或者与上述过程一起，它还会对键进行分组吗?

Possibility - B

Or along with he above process, does it also groups the keys:

因此，o/p为:

减速器1:{a，[1,1]}

reducer 1: {a,[1,1]}

减速器2:{b，[1,1]}

reducer 2: {b,[1,1]}

我认为应该是A，因为键的分组必须在排序后进行，因为排序仅是为了使reducer可以轻松地指出一个键结束而另一个键正在启动.如果是，请何时真正进行密钥分组.

In my opinion I think it should be A because grouping of keys must take place after sorting because sorting is only done so that reducer can easily point out when one key is ending and the other key is starting. If yes, when does grouping of keys actually happen, please elaborate.

洗牌阶段实际上是做什么的? [英] What does the shuffling phase actually do?

问题描述

可能性-B

Possibility - B

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

洗牌阶段实际上是做什么的? [英] What does the shuffling phase actually do?

问题描述

可能性-B

Possibility - B

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭