GCP(AI平台笔记本)上的“服务器连接错误" [英] 'Server Connection Error' on GCP (AI Platform Notebook)
问题描述
我在使用GCP和AI平台(Jupyterlab)时遇到了一些问题似乎我无法长时间与服务器保持稳定的连接.我不断收到"服务器连接错误"消息.从那里有两种可能性:
I am facing some issues with GCP and the AI Platform (Jupyterlab) It seems that I am unable to maintain a stable connection with the server for a long time. I keep getting those 'server connection error' message. From there two possibilities:
- 什么也没发生,我的手机一直在运转或
- 单元已停止运行,我可以在笔记本右上角看到状态"无内核!".每当我再次选择一个内核(python 3)时,根据运气我可以继续工作,或者该单元格将显示运行状态(其左侧带有*),但左下方的内核状态将保持不变.:已连接"(而不是忙碌").对于后者,我需要重新启动内核并再次运行所有单元,这可能会很长.
- either nothing happens and my cell keeps running or
- the cells have stopped running and I can see the status 'No Kernel!' on the top right of the notebook. Whenever I select a kernel (python 3) again, depending on my luck I can either keep working, or the cell will display the running status (with the * on the left of it) but the kernel status on the bottom left will stay on : 'connected' (instead of 'busy'). For the latter, I need to restart the kernel and run all the cells again, which can be very long.
有时候,这发生在我(重新)启动实例之后运行第一个单元时,有时会稍后.我能够在笔记本上正常工作的最长稳定时间是20、30 ish分钟,这很烦人.
Sometimes this happens as soon as I run the first cell after (re)starting the instance, sometimes a bit later. The longest stable period I was able to work on the notebook without any issue was 20, 30-ish minutes, which is quite annoying.
我的主要实例的配置:-16个CPU-60GB RAM-P100 NVIDIA GPU
Configuration of my main instance : - 16 CPUs - 60gb of RAM - A P100 NVIDIA GPU
我尝试了不同类型的实例,但仍然遇到相同的问题,家里的网络很稳定.
I have tried different types of instance and I keep having the same problem, network at home is stable.
推荐答案
I had a similar issue today: according to the google docs the cause for this is that the docker/ Jupyter service is not starting.
在我们的特定情况下无法启动这些服务的原因是磁盘已满.
The cause why these services couldn't be started in our specific case was a full disk.
这篇关于GCP(AI平台笔记本)上的“服务器连接错误"的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!