使用nvidia-docker-compose启动容器,但很快退出 [英] use nvidia-docker-compose launch a container, but exited soon
问题描述
我的 docker-compose.yml
文件:
version: '2'
services:
zl:
image: zl/caffe-torch-gpu:12.27
ports:
- "8801:8888"
- "6001:6008"
devices:
- /dev/nvidia0
volumes:
- ~/dl-data:/root/dl-data
在 nvidia-docker-compose up -d
之后,容器启动了,但是
After nvidia-docker-compose up -d
the container launched, but exited soon.
但是当我通过 nvidia-docker
方式启动一个容器时,它运行良好。
But when I launch a container by nvidia-docker
way, it worked well.
nvidia-docker run -itd -p 6008:6006 -p 8808:8888 -v `pwd`:/root/dl-data --name zl_test
推荐答案
您不必使用nvidia -docker-撰写。
通过正确配置nvdia-docker插件,您可以使用docker-compose!
You don't have to use nvidia-docker-compose. By configuring the nvdia-docker plugin correctly you can just use docker-compose!
通过 nvidia docker git repo :
(可以确认它对我有用)
Via the nvidia docker git repo: (can confirm it works for me)
找出nvidia驱动程序版本(重要)。
运行:
Figure out nvidia driver version (it matters). run:
nvidia-smi
输出:
+ -------------------- ------------------------------------------- +
+---------------------------------------------------------------+
NVIDIA-SMI 367.57驱动程序版本:367.57
NVIDIA-SMI 367.57 Driver Version: 367.57
| ----------------- -------------- + -------- + ---------------------- +
|-------------------------------+--------+----------------------+
创建使用nvidia-docker插件的docker卷必须在compose之外完成
docker volume create --name=nvidia_driver_367.57 -d nvidia-docker
步骤3
在docker-compose.yml文件中:
Step 3
in the docker-compose.yml file:
version: '2'
volumes:
nvidia_driver_367.57: # same name as one created above
external: true #this will use the volume we created above
services:
cuda:
command: nvidia-smi
devices: #this is required
- /dev/nvidiactl
- /dev/nvidia-uvm
- /dev/nvidia0 #in general: /dev/nvidia# where # depends on which gpu card is wanted to be used
image: nvidia/cuda
volumes:
- nvidia_driver_367.57:/usr/local/nvidia/:ro
这篇关于使用nvidia-docker-compose启动容器,但很快退出的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!