为什么我们在YARN中配置mapred.job.tracker? [英] Why we are configuring mapred.job.tracker in YARN?
问题描述
我介绍的是YARN,它取代了JobTracker和TaskTracker.
What I know is YARN is introduced and it replaced JobTracker and TaskTracker.
我看过一些Hadoop 2.6.0/2.7.0安装教程,他们将mapreduce.framework.name
配置为yarn,将mapred.job.tracker
属性配置为local或host:port.
I have seen is some Hadoop 2.6.0/2.7.0 installation tutorials and they are configuring mapreduce.framework.name
as yarn and mapred.job.tracker
property as local or host:port.
mapred.job.tracker
属性的描述为
"MapReduce作业跟踪器运行所在的主机和端口.如果为本地", 然后将作业作为单个映射在进程中运行并减少任务."
"The host and port that the MapReduce job tracker runs at. If "local", then jobs are run in-process as a single map and reduce task."
我的疑问是,如果我们使用的是YARN,为什么要对其进行配置?我的意思是JobTracker不能正常运行?
My doubt is why are configuring it if we are using YARN , I mean JobTracker shouldn't be running right?
如果我的问题很愚蠢,请原谅我.
Forgive me if my question is dumb.
这些是我正在谈论的教程.
These are the tutorials I was talking about.
http://chaalpritam. blogspot.in/2015/01/hadoop-260-multi-node-cluster-setup-on.html
http://pingax.com/install-apache-hadoop-ubuntu -cluster-setup/
https://chawlasumit.wordpress.com/2015/03/09/install-a-multi-node-hadoop-cluster-on-ubuntu-14-04/
推荐答案
这只是一个猜测,但是有关在YARN中配置JobTracker的那些教程是由不知道YARN是什么的人编写的,或者他们设置了以防万一您决定停止与YARN合作.没错:YARN中不存在JobTracker和TaskTracker.您可以根据需要添加属性,但是这些属性将被忽略.用YARN添加了替换JobTracker和TaskTracker的每个组件的新属性,例如yarn.resourcemanager.address
来替换mapred.jobtracker.address
.
This is just a guess, but either those tutorials talking about configuring the JobTracker in YARN are written by people who don't know what YARN is, or they set it in case you decide to stop working with YARN someday. You are right: the JobTracker and TaskTracker do not exist in YARN. You can add the properties if you want, but they will be ignored. New properties for each of the components replacing the JobTracker and the TaskTracker were added with YARN, such as yarn.resourcemanager.address
to replace mapred.jobtracker.address
.
如果在YARN下运行Hadoop时列出了Java进程,则看不到JobTracker
或TaskTracker
:
If you list your Java processes when running Hadoop under YARN, you see no JobTracker
or TaskTracker
:
10561 Jps
20605 NameNode
17176 DataNode
18521 ResourceManager
19625 NodeManager
18424 JobHistoryServer
您可以阅读有关如何纱在这里工作.
这篇关于为什么我们在YARN中配置mapred.job.tracker?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!