Spark Scala中的歧义架构 [英] Ambiguous schema in Spark Scala

查看：127 发布时间：2020/9/4 1:31:38 scala apache-spark

本文介绍了Spark Scala中的歧义架构的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

模式:

|-- c0: string (nullable = true)
|-- c1: struct (nullable = true)
|    |-- c2: array (nullable = true)
|    |    |-- element: struct (containsNull = true)
|    |    |    |-- orangeID: string (nullable = true)
|    |    |    |-- orangeId: string (nullable = true)

我试图在火花中展平上面的模式.

I am trying to flatten the schema above in spark.

代码:

var df = data.select($"c0",$"c1.*").select($"c0",explode($"c2")).select($"c0",$"col.orangeID", $"col.orangeId")

展平代码工作正常.问题出在最后一部分，其中两列之间仅相差1个字母(orangeID和orangeId).因此，我收到此错误:

The flattening code is working fine. The problem is in the last part where the 2 columns differ only by 1 letter (orangeID and orangeId). Hence I am getting this error:

错误:

org.apache.spark.sql.AnalysisException: Ambiguous reference to fields StructField(orangeID,StringType,true), StructField(orangeId,StringType,true);

任何避免这种歧义的建议都是很好的.

Any suggestions to avoid this ambiguity will be great.

Spark Scala中的歧义架构 [英] Ambiguous schema in Spark Scala

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Spark Scala中的歧义架构 [英] Ambiguous schema in Spark Scala

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭