在DC/OS上启用GPU资源(CUDA) [英] Enable GPU resources (CUDA) on DC/OS
问题描述
我有一个带有gpu节点(nvidia)的集群,并部署了DC/OS 1.8.我想启用使用gpu隔离在gpu节点上调度作业(批处理和火花)的功能. DC/OS基于支持gpu隔离的mesos 1.0.1.
I have got a cluster with gpu nodes (nvidia) and deployed DC/OS 1.8. I'd like to enable to schedule jobs (batch and spark) on gpu nodes using gpu isolation. DC/OS is based on mesos 1.0.1 that supports gpu isolation.
推荐答案
不幸的是,DC/OS并未正式支持1.8中的GPU(实验性支持将在下一个版本中发布,因为在这里提到: https://github.com/dcos/dcos/pull/766 )
Unfortunately, DC/OS doesn't officially support GPUs in 1.8 (experimental support for GPUs will be coming in the next release as mentioned here: https://github.com/dcos/dcos/pull/766 ).
在此下一发行版中,只有Marathon才能正式启动GPU服务(Metronome(即批处理作业)将不能).
In this next release, only Marathon will officially be able to launch GPU services (Metronome (i.e. batch jobs) will not).
关于spark,与Universe捆绑在一起的spark版本可能尚未内置对Mesos的GPU支持. Spark本身很快就会发布: https://github.com/apache/spark/pull/14644
Regarding spark, the spark version bundled with Universe probably doesn't have GPU support for Mesos built in yet. Spark itself has it coming soon though: https://github.com/apache/spark/pull/14644
这篇关于在DC/OS上启用GPU资源(CUDA)的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!