如何配置spark以使其创建"_ $ folder $"S3中的条目? [英] How can I configure spark so that it creates "_$folder$" entries in S3?

查看：56 发布时间：2021/4/3 19:11:02 scala apache-spark-sql amazon-emr

本文介绍了如何配置spark以使其创建"_ $ folder $"S3中的条目?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

当我使用

df.write
  .format("parquet")
  .mode("overwrite")
  .partitionBy("year", "month", "day", "hour", "gen", "client")
  .option("compression", "gzip")
  .save("s3://xxxx/yyyy")

我在S3中得到以下内容

I get the following in S3

year=2018
year=2019

但我想改成这个:

year=2018
year=2018_$folder$
year=2019
year=2019_$folder$

从该S3位置读取的脚本取决于 * _ $ folder $ 条目，但是我还没有找到配置spark/hadoop生成它们的方法.

The scripts that are reading from that S3 location depend on the *_$folder$ entries, but I haven't found a way to configure spark/hadoop to generate them.

关于哪种hadoop或spark配置设置可以控制 * _ $ folder $ 文件的生成的任何想法?

Any idea on what hadoop or spark configuration setting control the generation of *_$folder$ files?

如何配置spark以使其创建"_ $ folder $"S3中的条目? [英] How can I configure spark so that it creates "_$folder$" entries in S3?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何配置spark以使其创建"_ $ folder $"S3中的条目? [英] How can I configure spark so that it creates &quot;_$folder$&quot; entries in S3?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

如何配置spark以使其创建"_ $ folder $"S3中的条目? [英] How can I configure spark so that it creates "_$folder$" entries in S3?

登录关闭