MapReduce替代品 [英] MapReduce alternatives
问题描述
例如,查看批量同步并行。 Map / Reduce实际上是一个非常有限的减少问题的方式,但是限制使得它在像Hadoop这样的框架中可以管理。问题是如果您的问题进入地图/缩小设置不太麻烦,或者更容易创建特定于域的并行化方案,并且必须自行处理所有实施细节。实际上,猪只是Hadoop之上的一个抽象层,它将许多标准的问题转换从不是Map-Reduce-y自动化到Map-Reduce兼容。
编辑26.1.13:在这里找到一个很好的最新概述/ a>
Are there any alternative paradigms to MapReduce (Google, Hadoop)? Is there any other reasonable way how to split & merge big problems?
Definitively. Check out, for example, Bulk Synchronous Parallel. Map/Reduce is in fact a very restricted way of reducing problems, however that restriction makes it manageable in a framework like Hadoop. The question is if it is less trouble to press your problem into a Map/Reduce setting, or if its easier to create a domain-specific parallelization scheme and having to take care of all the implementation details yourself. Pig, in fact, is only an abstraction layer on top of Hadoop which automates many standard problem transformations from not-Map-Reduce-y to Map-Reduce-compatible.
Edit 26.1.13: Found a nice up-to-date overview here
这篇关于MapReduce替代品的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!