Spark数据框的分区数 [英] Number of Partitions of Spark Dataframe

查看：86 发布时间：2020/9/4 5:54:43 apache-spark dataframe apache-spark-sql

本文介绍了Spark数据框的分区数的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

谁能解释一下将为Spark Dataframe创建的分区数量.

Can anyone explain about the number of partitions that will be created for a Spark Dataframe.

我知道对于RDD，在创建它时，我们可以提及如下所示的分区数量.

I know that for a RDD, while creating it we can mention the number of partitions like below.

val RDD1 = sc.textFile("path" , 6)

但是对于Spark数据帧，在创建时看起来像我们没有选择指定RDD分区数的选项.

But for Spark dataframe while creating looks like we do not have option to specify number of partitions like for RDD.

我认为只有这种可能性，在创建数据框之后，我们可以使用重新分区API.

Only possibility i think is, after creating dataframe we can use repartition API.

df.repartition(4)

所以任何人都可以让我知道是否可以在创建数据帧时指定分区数.

So can anyone please let me know if we can specify the number of partitions while creating a dataframe.