如何配置 spark 以创建“_$folder$"?S3 中的条目? [英] How can I configure spark so that it creates "_$folder$" entries in S3?

查看：22 发布时间：2021/11/14 23:16:29 scala apache-spark-sql amazon-emr

本文介绍了如何配置 spark 以创建“_$folder$"?S3 中的条目?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

当我使用

df.write
  .format("parquet")
  .mode("overwrite")
  .partitionBy("year", "month", "day", "hour", "gen", "client")
  .option("compression", "gzip")
  .save("s3://xxxx/yyyy")

我在 S3 中得到以下内容

I get the following in S3

year=2018
year=2019

但我想要这个:

year=2018
year=2018_$folder$
year=2019
year=2019_$folder$

从该 S3 位置读取的脚本取决于 *_$folder$ 条目，但我还没有找到配置 spark/hadoop 来生成它们的方法.

The scripts that are reading from that S3 location depend on the *_$folder$ entries, but I haven't found a way to configure spark/hadoop to generate them.

知道什么 hadoop 或 spark 配置设置控制 *_$folder$ 文件的生成吗?

Any idea on what hadoop or spark configuration setting control the generation of *_$folder$ files?

如何配置 spark 以创建“_$folder$"?S3 中的条目? [英] How can I configure spark so that it creates "_$folder$" entries in S3?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何配置 spark 以创建“_$folder$"?S3 中的条目? [英] How can I configure spark so that it creates &quot;_$folder$&quot; entries in S3?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

如何配置 spark 以创建“_$folder$"?S3 中的条目? [英] How can I configure spark so that it creates "_$folder$" entries in S3?

登录关闭