将Apache Spark中的所有资源与Yarn一起使用 [英] Using all resources in Apache Spark with Yarn

查看：97 发布时间：2021/4/8 20:07:28 apache-spark yarn

本文介绍了将Apache Spark中的所有资源与Yarn一起使用的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在将Apache Spark与Yarn客户端一起使用.我的Spark集群中有4台工作PC，每台PC都有8个vcpus和30 GB的内存.我将执行程序的内存设置为2G，将实例数设置为33.我的工作需要10个小时才能运行，所有机器都闲置了80％.

I am using Apache Spark with Yarn client. I have 4 worker PCs with 8 vcpus each and 30 GB of ram in my spark cluster. Im set my executor memory to 2G and number of instances to 33. My job is taking 10 hours to run and all machines are about 80% idle.

我不了解执行程序内存和执行程序实例之间的相关性.每个Vcpu是否应该有一个实例?我应该将执行程序的内存设置为每台机器上的机器/#executor的内存吗?

I dont understand the correlation between executor memory and executor instances. Should I have an instance per Vcpu? Should I set the executor memory to be memory of machine/#executors per machine?

将Apache Spark中的所有资源与Yarn一起使用 [英] Using all resources in Apache Spark with Yarn

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

将Apache Spark中的所有资源与Yarn一起使用 [英] Using all resources in Apache Spark with Yarn

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭