Hadoop gen1与Hadoop gen2 [英] Hadoop gen1 vs Hadoop gen2
问题描述
Hadoop-1.x中的守护进程为 namenode,datanode,jobtracker,taskracker和secondarynamenode
$ b Hadoop-2.x中的守护进程
namenode,datanode,resourcemanager,applicationmaster,secondarynamenode。
这意味着 up:resourcemanager和applicationmaster
那么 tasktracker
?
在YARN(Hadoop 2中的新执行框架)中,MapReduce并不像以前那样存在。 b
$ b YARN是在集群上分配资源的更通用的方法。 ResourceManager,ApplicationMaster和NodeManager现在由新的YARN执行框架组成。 NodeManager是每个节点上的守护进程,所以我想你可以说取代了TaskTracker。但现在它只提供进程,而不仅仅是map任务和减少任务。
MapReduce仍然存在,但它现在是YARN的应用程序。
以下是对YARN的介绍,它将深入到更深层次: http://hortonworks.com/blog/introducing-apache-hadoop-yarn/
I am a bit confused about place of tasktracker in Hadoop-2.x.
Daemons in Hadoop-1.x are namenode, datanode, jobtracker, taskracker and secondarynamenode
Daemons in Hadoop-2.x are namenode, datanode, resourcemanager, applicationmaster, secondarynamenode.
This means Jobtracker has split up into: resourcemanager and applicationmaster
So where is tasktracker
?
解决方案 In YARN (the new execution framework in Hadoop 2), MapReduce doesn't exist in the way it did before.
YARN is a more general purpose way to allocate resources on the cluster. ResourceManager, ApplicationMaster, and NodeManager now consist of the new YARN execution framework. The NodeManager is the daemon on every node, so I guess you could say that replaced the TaskTracker. But now it just gives processes instead of just map tasks and reduce tasks.
MapReduce is still there, but it is now an "application" of YARN.
Here is an introduction to YARN, which will go into much more depth: http://hortonworks.com/blog/introducing-apache-hadoop-yarn/
这篇关于Hadoop gen1与Hadoop gen2的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!