为什么Spark作业会因“打开的文件太多"而失败? [英] Why does Spark job fail with "too many open files"?

查看：350 发布时间：2020/9/4 0:41:40 apache-spark

本文介绍了为什么Spark作业会因“打开的文件太多"而失败?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在我的Spark作业的洗牌阶段，我得到打开的文件太多".为什么我的工作打开这么多文件?我可以采取什么步骤来使我的工作取得成功.

I get "too many open files" during the shuffle phase of my Spark job. Why is my job opening so many files? What steps can I take to try to make my job succeed.

The best way is definitely just to increase the ulimit if possible, this is sort of an assumption we make in Spark that clusters will be able to move it around.

您也许可以通过减少减速器(或每个节点使用的核心)，但这可能会对您的性能产生一些影响工作.

You might be able to hack around this by decreasing the number of reducers [or cores used by each node] but this could have some performance implications for your job.

通常，如果集群中的节点具有C分配的核心，并且您运行使用X减速器的作业，Spark将并行打开C * X文件，然后动笔.洗牌合并将有助于减少总数创建的文件数，但在任何位置打开的文件句柄数时间不会改变，因此不会解决ulimit问题.

In general if a node in your cluster has C assigned cores and you run a job with X reducers then Spark will open C*X files in parallel and start writing. Shuffle consolidation will help decrease the total number of files created but the number of file handles open at any time doesn't change so it won't help the ulimit problem.

-帕特里克·温德尔

这篇关于为什么Spark作业会因“打开的文件太多"而失败?的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

为什么Spark作业会因“打开的文件太多"而失败? [英] Why does Spark job fail with "too many open files"?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

为什么Spark作业会因“打开的文件太多"而失败? [英] Why does Spark job fail with &quot;too many open files&quot;?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

为什么Spark作业会因“打开的文件太多"而失败? [英] Why does Spark job fail with "too many open files"?

登录关闭