YARN UNHEALTHY节点 [英] YARN UNHEALTHY nodes

查看:348
本文介绍了YARN UNHEALTHY节点的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

在我们的YARN集群中,有80%已满,我们看到一些纱线节点管理员被标记为不健康。在挖掘日志之后,我发现它是因为磁盘空间占用了数据目录的90%的空间。出现以下错误:

  2015-02-21 08:33:51,590 INFO org.apache.hadoop.yarn.server.resourcemanager。 rmnode.RMNodeImpl:节点hdp009.abc.com:8041报告不健康与详细信息:4/4本地目录很差:/ data3 / yarn / nm,/ data2 / yarn / nm,/ data4 / yarn / nm,/ data1 /纱/纳米; 
2015-02-21 08:33:51,590 INFO org.apache.hadoop.yarn.server.resourcemanager.rmnode.RMNodeImpl:hdp009.abc.com:8041节点从RUNNING转换为UNHEALTHY
code>

我试图理解纱线标记节点是不是健康&有什么方法可以改变门槛吗?

谢谢 尝试添加属性 yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage 到yarn-site.xml。此属性指定磁盘被标记为坏之后所允许的最大磁盘空间利用率百分比。值可以从0.0到100.0。



yarn-default.xml
$ b

force 健康状态$例如:b

 <?xml version =1.0?> 
<配置>
<属性>
< name> yarn.nodemanager.disk-health-checker.min-healthy-disks< / name>
<值> 0.0< /值>
< / property>
<属性>
< name> yarn.nodemanager.disk-health-checker.max -disk-utilization-per-disk-percentage< / name>
<值> 100.0< /值>
< / property>
< / configuration>


In our YARN cluster which is 80% full, we are seeing some of the yarn nodemanager's are marked as UNHEALTHY. after digging into logs I found its because disk space is 90% full for data dir. With following error

2015-02-21 08:33:51,590 INFO org.apache.hadoop.yarn.server.resourcemanager.rmnode.RMNodeImpl: Node hdp009.abc.com:8041 reported UNHEALTHY with details: 4/4 local-dirs are bad: /data3/yarn/nm,/data2/yarn/nm,/data4/yarn/nm,/data1/yarn/nm;
2015-02-21 08:33:51,590 INFO org.apache.hadoop.yarn.server.resourcemanager.rmnode.RMNodeImpl: hdp009.abc.com:8041 Node Transitioned from RUNNING to UNHEALTHY

I am trying to understand how yarn marks node Unhealthy & is there any way to change the threshold ?

Thanks

解决方案

try adding the property yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage to yarn-site.xml. This property specifies the maximum percentage of disk space utilization allowed after which a disk is marked as bad. Values can range from 0.0 to 100.0.

yarn-default.xml

force to health state e.g.:

<?xml version="1.0"?>
<configuration>    
  <property>
     <name>yarn.nodemanager.disk-health-checker.min-healthy-disks</name>
     <value>0.0</value>
  </property>
  <property>
     <name>yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage</name>
     <value>100.0</value>
  </property>
</configuration>

这篇关于YARN UNHEALTHY节点的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆