火花访问前n行-限制 [英] spark access first n rows - take vs limit

查看：72 发布时间：2020/4/30 11:33:41 apache-spark apache-spark-sql spark-dataframe limit

本文介绍了火花访问前n行-限制的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我想访问spark数据帧的前100行，并将结果写回CSV文件.

I want to access the first 100 rows of a spark data frame and write the result back to a CSV file.

为什么take(100)基本上是即时的，而

Why is take(100) basically instant, whereas

df.limit(100)
      .repartition(1)
      .write
      .mode(SaveMode.Overwrite)
      .option("header", true)
      .option("delimiter", ";")
      .csv("myPath")

永远存在. 我不想获得每个分区的前100条记录，而只是获得任何100条记录.

takes forever. I do not want to obtain the first 100 records per partition but just any 100 records.

火花访问前n行-限制 [英] spark access first n rows - take vs limit

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

火花访问前n行-限制 [英] spark access first n rows - take vs limit

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭