如何在pyspark中设置pivotMaxValues? [英] How to set pivotMaxValues in pyspark?

查看：21 发布时间：2021/11/14 23:17:26 pyspark pyspark-sql

本文介绍了如何在pyspark中设置pivotMaxValues?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试对具有超过 10000 个不同值的列进行透视.Spark 中不同值最大数量的默认限制为 10000，我收到此错误

I am trying to pivot a column which has more than 10000 distinct values. The default limit in Spark for maximum number of distinct values is 10000 and I am receiving this error

数据透视列 COLUMN_NUM_2 有超过 10000 个不同的值，这可能表示存在错误.如果这是有意的，请将 spark.sql.pivotMaxValues 设置为至少枢轴列的不同值的数量

The pivot column COLUMN_NUM_2 has more than 10000 distinct values, this could indicate an error. If this was intended, set spark.sql.pivotMaxValues to at least the number of distinct values of the pivot column

如何在 PySpark 中进行设置?

How do I set this in PySpark?

如何在pyspark中设置pivotMaxValues? [英] How to set pivotMaxValues in pyspark?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何在pyspark中设置pivotMaxValues? [英] How to set pivotMaxValues in pyspark?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭