spark.sql.shuffle.partitions 究竟指的是什么? [英] What does spark.sql.shuffle.partitions exactly refer to?

查看：58 发布时间：2021/11/14 22:12:11 apache-spark apache-spark-sql

本文介绍了spark.sql.shuffle.partitions 究竟指的是什么?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

spark.sql.shuffle.partitions 到底指的是什么?我们是在谈论作为宽转换结果的分区数量，还是在中间发生的某些事情，例如在宽转换的结果分区之前的某种中间分区?

What exactly does spark.sql.shuffle.partitions refer to? Are we talking of the number of partitions that is the results of a wide transformation, or something that happens in the middle as in some sort of intermediary partitioning before the result partition of the wide transformation?

因为据我所知，根据我们的广泛转变

Because in my understanding, as per a wide transformation we have

Parents RDDs -> shuffle files -> Child RDDs

这里的spark.sql.shuffle.partitions参数指的是什么?shuffles 文件 或 CHILD RDDs 或其他我忽略的东西?

What does the spark.sql.shuffle.partitions parameter refer to here? The shuffles files or the CHILD RDDs or something else that I ignored?

spark.sql.shuffle.partitions 究竟指的是什么? [英] What does spark.sql.shuffle.partitions exactly refer to?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

spark.sql.shuffle.partitions 究竟指的是什么? [英] What does spark.sql.shuffle.partitions exactly refer to?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭