MapReduce 排序算法是如何工作的? [英] How does the MapReduce sort algorithm work?

查看：29 发布时间：2021/12/15 18:36:41 algorithm sorting parallel-processing hadoop mapreduce

本文介绍了MapReduce 排序算法是如何工作的?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

用于展示 MapReduce 功能的主要示例之一是 Terasort 基准测试.我无法理解 MapReduce 环境中使用的排序算法的基础知识.

One of the main examples that is used in demonstrating the power of MapReduce is the Terasort benchmark. I'm having trouble understanding the basics of the sorting algorithm used in the MapReduce environment.

对我来说，排序只是确定一个元素相对于所有其他元素的相对位置.所以排序涉及将一切"与一切"进行比较.您的平均排序算法(快速、冒泡、...)只是以一种聪明的方式做到了这一点.

To me sorting simply involves determining the relative position of an element in relationship to all other elements. So sorting involves comparing "everything" with "everything". Your average sorting algorithm (quick, bubble, ...) simply does this in a smart way.

在我看来，将数据集分成许多部分意味着您可以对单个部分进行排序，然后您仍然必须将这些部分整合到完整"的完全排序的数据集中.鉴于分布在数千个系统上的 TB 级数据集，我预计这是一项艰巨的任务.

In my mind splitting the dataset into many pieces means you can sort a single piece and then you still have to integrate these pieces into the 'complete' fully sorted dataset. Given the terabyte dataset distributed over thousands of systems I expect this to be a huge task.

那么这到底是怎么做的呢?这个 MapReduce 排序算法是如何工作的?

So how is this really done? How does this MapReduce sorting algorithm work?

谢谢你帮助我理解.

MapReduce 排序算法是如何工作的? [英] How does the MapReduce sort algorithm work?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

MapReduce 排序算法是如何工作的? [英] How does the MapReduce sort algorithm work?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭