Apache Spark 不删除临时目录 [英] Apache Spark does not delete temporary directories

查看：47 发布时间：2021/11/12 5:33:42 apache-spark

本文介绍了Apache Spark 不删除临时目录的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

一个spark程序完成后，temp目录下还剩下3个临时目录.目录名是这样的:spark-2e389487-40cc-4a82-a5c7-353c0feefbb7

After a spark program completes, there are 3 temporary directories remain in the temp directory. The directory names are like this: spark-2e389487-40cc-4a82-a5c7-353c0feefbb7

目录为空.

当 Spark 程序在 Windows 上运行时，一个活泼的 DLL 文件也会保留在临时目录中.文件名是这样的:snappy-1.0.4.1-6e117df4-97b6-4d69-bf9d-71c4a627940c-snappyjava

And when the Spark program runs on Windows, a snappy DLL file also remains in the temp directory. The file name is like this: snappy-1.0.4.1-6e117df4-97b6-4d69-bf9d-71c4a627940c-snappyjava

每次运行 Spark 程序时都会创建它们.所以文件和目录的数量不断增加.

They are created every time the Spark program runs. So the number of files and directories keeps growing.

怎样才能让它们被删除?

How can let them be deleted?

Spark 版本是 1.3.1，Hadoop 2.6.

Spark version is 1.3.1 with Hadoop 2.6.

更新

我已经跟踪了 spark 源代码.

I've traced the spark source code.

创建 3 个 'temp' 目录的模块方法如下:

The module methods that create the 3 'temp' directories are as follows:

DiskBlockManager.createLocalDirs
HttpFileServer.initialize
SparkEnv.sparkFilesDir

他们(最终)调用 Utils.getOrCreateLocalRootDirs，然后调用 Utils.createDirectory，它故意不将目录标记为自动删除.

They (eventually) call Utils.getOrCreateLocalRootDirs and then Utils.createDirectory, which intentionally does NOT mark the directory for automatic deletion.

createDirectory 方法的注释说:目录保证是新创建，未标记为自动删除."

The comment of createDirectory method says: "The directory is guaranteed to be newly created, and is not marked for automatic deletion."

我不知道为什么它们没有被标记.这真的是故意的吗?

I don't know why they are not marked. Is this really intentional?

Apache Spark 不删除临时目录 [英] Apache Spark does not delete temporary directories

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Apache Spark 不删除临时目录 [英] Apache Spark does not delete temporary directories

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭