将Pyspark Dataframe列从数组转换为新列 [英] Convert Pyspark Dataframe column from array to new columns

查看：757 发布时间：2020/10/16 23:30:31 dataframe pyspark

本文介绍了将Pyspark Dataframe列从数组转换为新列的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个具有以下结构的Pyspark数据框：

I've a Pyspark Dataframe with this structure:

root
 |-- Id: string (nullable = true)
 |-- Q: array (nullable = true)
 |    |-- element: struct (containsNull = true)
 |    |    |-- pr: string (nullable = true)
 |    |    |-- qt: double (nullable = true)

类似于以下内容：

 +----+--------------------- ... --+
 | Id |           Q                |
 +----+---------------------- ... -+
 | 001| [ [pr1,1.9], [pr3,2.0]...] |
 | 002| [ [pr2,1.0], [pr9,3.9]...] |
 | 003| [ [pr2,9.0], ...         ] |
  ...

我想将Q数组转换为列（名称pr值qt ）。
另外我也想通过合并（添加）相同的列来避免重复的列。

I wold like to convert Q array into columns (name pr value qt). Also I would like to avoid duplicated columns by merging (add) same columns.

 +----+-----+-----+------+ ... ----+
 | Id | pr1 | pr2 | pr3  | ... prn |
 +----+-----+-----+------+ ... ----+
 | 001| 1.9 | 0.0 | 2.0  | ...     |
 | 002| 0.0 | 1.0 | 0    | ...     |
 | 003| 0.0 | 9.0 | ...  | ...     |
  ...

如何执行此转换？
Thakyou提前！！
Julián。

How can I perform this transformation?. Thakyou in advance!!. Julián.

推荐答案

您可以结合使用爆炸和轴：


import pyspark.sql.functions as F

# explode to get "long" format
df=df.withColumn('exploded', F.explode('Q'))

# get the name and the name in separate columns
df=df.withColumn('name', F.col('exploded').getItem(0))
df=df.withColumn('value', F.col('exploded').getItem(1))

# now pivot
df.groupby('Id').pivot('name').agg(F.max('value')).na.fill(0)


                        这篇关于将Pyspark Dataframe列从数组转换为新列的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！


                    
                        查看全文

将Pyspark Dataframe列从数组转换为新列 [英] Convert Pyspark Dataframe column from array to new columns

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

将Pyspark Dataframe列从数组转换为新列 [英] Convert Pyspark Dataframe column from array to new columns

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭