由于java.io.FileNotFoundException，Google的Dataproc上的Spark失败:/hadoop/yarn/nm-local-dir/usercache/root/appcache/ [英] Spark on Google's Dataproc failed due to java.io.FileNotFoundException: /hadoop/yarn/nm-local-dir/usercache/root/appcache/

查看：906 发布时间：2020/11/22 2:50:04 apache-spark hadoop google-cloud-storage google-cloud-dataproc

本文介绍了由于java.io.FileNotFoundException，Google的Dataproc上的Spark失败:/hadoop/yarn/nm-local-dir/usercache/root/appcache/的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我已经通过Zeppelin和Dataproc控制台在Dataproc上使用Spark/Hadoop已有几个月了，但是最近我遇到了以下错误.

I've been using Spark/Hadoop on Dataproc for months both via Zeppelin and Dataproc console but just recently I got the following error.

Caused by: java.io.FileNotFoundException: /hadoop/yarn/nm-local-dir/usercache/root/appcache/application_1530998908050_0001/blockmgr-9d6a2308-0d52-40f5-8ef3-0abce2083a9c/21/temp_shuffle_3f65e1ca-ba48-4cb0-a2ae-7a81dcdcf466 (No such file or directory)
at java.io.FileOutputStream.open0(Native Method)
at java.io.FileOutputStream.open(FileOutputStream.java:270)
at java.io.FileOutputStream.<init>(FileOutputStream.java:213)
at org.apache.spark.storage.DiskBlockObjectWriter.initialize(DiskBlockObjectWriter.scala:103)
at org.apache.spark.storage.DiskBlockObjectWriter.open(DiskBlockObjectWriter.scala:116)
at org.apache.spark.storage.DiskBlockObjectWriter.write(DiskBlockObjectWriter.scala:237)
at org.apache.spark.shuffle.sort.BypassMergeSortShuffleWriter.write(BypassMergeSortShuffleWriter.java:151)
at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:96)
at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:53)
at org.apache.spark.scheduler.Task.run(Task.scala:108)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:338)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)

首先，我在Zeppelin笔记本电脑上遇到此类错误，并认为这是Zeppelin问题.但是，此错误似乎是随机发生的.我怀疑这与其中一个Spark工作者无法在该路径中写入有关.因此，我用Google搜索并建议删除每个Spark工作程序上/hadoop/yarn/nm-local-dir/usercache/下的文件，并检查每个工作程序上是否有可用的磁盘空间.这样做之后，我有时仍然会出现此错误.我还在Dataproc上运行了Spark作业，也发生了类似的错误.我使用的是Dataproc图片版本1.2.

First, I got this type of error on Zeppelin notebook and thought it was Zeppelin issue. This error however, seems to occur randomly. I suspect It has something to do with one of the Spark workers not being able to write in that path. So, I googled and was suggested to delete files under /hadoop/yarn/nm-local-dir/usercache/ on each Spark worker and check if there are available disk space on each worker. After doing so, I still sometimes had this error. I also ran a Spark job on Dataproc, this similar error also occurred. I'm on Dataproc image version 1.2.

谢谢

Peeranat F.

由于java.io.FileNotFoundException，Google的Dataproc上的Spark失败:/hadoop/yarn/nm-local-dir/usercache/root/appcache/ [英] Spark on Google's Dataproc failed due to java.io.FileNotFoundException: /hadoop/yarn/nm-local-dir/usercache/root/appcache/

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

由于java.io.FileNotFoundException，Google的Dataproc上的Spark失败:/hadoop/yarn/nm-local-dir/usercache/root/appcache/ [英] Spark on Google&#39;s Dataproc failed due to java.io.FileNotFoundException: /hadoop/yarn/nm-local-dir/usercache/root/appcache/

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

由于java.io.FileNotFoundException，Google的Dataproc上的Spark失败:/hadoop/yarn/nm-local-dir/usercache/root/appcache/ [英] Spark on Google's Dataproc failed due to java.io.FileNotFoundException: /hadoop/yarn/nm-local-dir/usercache/root/appcache/

登录关闭