首页
其他开发
在调用集合操作的 DataFrame 中不能有映射类型列

在调用集合操作的 DataFrame 中不能有映射类型列 [英] Cannot have map type columns in DataFrame which calls set operations

查看：21 发布时间：2021/11/14 23:08:39 hive pyspark apache-spark-sql amazon-emr

本文介绍了在调用集合操作的 DataFrame 中不能有映射类型列的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

:org.apache.spark.sql.AnalysisException:DataFrame 中不能有调用集合操作(相交、除外等)的映射类型列，但列 map_col 的类型是映射

: org.apache.spark.sql.AnalysisException: Cannot have map type columns in DataFrame which calls set operations(intersect, except, etc.), but the type of column map_col is map

我有一个带有类型列的配置单元表 - MAP.当我尝试在 spark 上下文中对该表进行插入时，出现上述错误.在没有distinct"的情况下，插入工作正常.

I have a hive table with a column of type - MAP<Float, Float>. I get the above error when I try to do an insertion on this table in a spark context. Insertion works fine without the 'distinct'.

create table test_insert2(`test_col` string, `map_col` MAP<INT,INT>) 
location 's3://mybucket/test_insert2';

insert into test_insert2 
select distinct 'a' as test_col, map(0,0) as map_col

推荐答案

尝试将 dataframe 转换为 .rdd 然后应用 .distinct函数.


Try to convert dataframe to .rdd then apply .distinct function.
示例:
spark.sql("select 'a'test_col,map(0,0)map_col 
              union all 
          select 'a'test_col,map(0,0)map_col").rdd.distinct.collect

结果:
Array[org.apache.spark.sql.Row] = Array([a,Map(0 -> 0)])


                        这篇关于在调用集合操作的 DataFrame 中不能有映射类型列的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！


                    
                        查看全文


        
            



        
        
            相关文章
            
                    
                        
                            在调用集合操作的 DataFrame 中不能有映射类型列;
                        
                    
                    
                        
                            MultiView不能有类型的孩子..？;
                        
                    
                    
                        
                            一列的DataGridView细胞不能有不同类型;
                        
                    
                    
                        
                            Scala集合如何能够从映射操作返回正确的集合类型？;
                        
                    
                    
                        
                            Scala 集合如何能够从映射操作返回正确的集合类型?;
                        
                    
                    
                        
                            为什么我们不能有接头的数组集合？;
                        
                    
                    
                        
                            在Pandas DataFrame中跨多个列的映射方法;
                        
                    
                    
                        
                            一列的DataGridview单元格不能有不同的类型;
                        
                    
                    
                        
                            在html中不能有ol;
                        
                    
                    
                        
                            为什么文本列在 MySQL 中不能有默认值?;
                        
                    
                    
                        
                            通过对祖先类型的引用调用并行集合上的映射;
                        
                    
                    
                        
                            为什么我不能有对象的类型约束;
                        
                    
                    
                        
                            为什么我们不能有“char”枚举类型;
                        
                    
                    
                        
                            为什么我们不能有“char"?枚举类型;
                        
                    
                    
                        
                            为什么我们不能有“char”枚举类型;
                        
                    
                    
                        
                            类型错误：不可散列的类型：Pandas DataFrame列;
                        
                    
                    
                        
                            将Pandas DataFrame列映射到字典;
                        
                    
                    
                        
                            车身参数“宽度”。 GET操作不能有身体吗？;
                        
                    
                    
                        
                            pyspark - 在地图类型结构中创建 DataFrame 分组列;
                        
                    
                    
                        
                            不能操作data.table中的列;
                        
                    
                    
                        
                            在NHibernate中映射枚举的集合;
                        
                    
                    
                        
                            EF 6:映射复杂类型集合?;
                        
                    
                    
                        
                            将pandas Dataframe列映射到字典值;
                        
                    
                    
                        
                            我可以在常规Spark映射操作中使用Spark DataFrame吗?;
                        
                    
                    
                        
                            pyspark-在地图类型结构中创建DataFrame分组列;


    
        
            其他开发最新文章
            
                    
                        
                            拒绝显示一个框架，因为它将'X-Frame-Options'设置为'sameorigin';
                        
                    
                    
                        
                            什么是＆QUOT; AW＆QUOT;在部分标志属性是什么意思？;
                        
                    
                    
                        
                            在运行npm install命令时获取'npm WARN弃用'警告;
                        
                    
                    
                        
                            cmake无法找到openssl;
                        
                    
                    
                        
                            从Spark的scala中的* .tar.gz压缩文件中读取HDF5文件;
                        
                    
                    
                        
                            Twitter :: Error :: Forbidden  - 无法验证您的凭据;
                        
                    
                    
                        
                            我什么时候需要一个fb：app_id或者fb：admins？;
                        
                    
                    
                        
                            将.db文件导入R;
                        
                    
                    
                        
                            npm通知创建一个lockfile作为package-lock.json。你应该提交这个文件;
                        
                    
                    
                        
                            拒绝执行内联脚本，因为它违反了以下内容安全策略指令：“script-src'self'”;
                        
                    
            
        
        
            
                热门教程
            
            
                
                    
                        Java教程
                    
                
                
                    
                        Apache ANT 教程
                    
                
                
                    
                        Kali Linux教程
                    
                
                
                    
                        JavaScript教程
                    
                
                
                    
                        JavaFx教程
                    
                
                
                    
                        MFC 教程
                    
                
                
                    
                        Apache HTTP客户端教程
                    
                
                
                    
                        Microsoft Visio 教程
                    
                
            
        
        
            
                热门工具
            
            
                
                
                    
                        Java 在线工具
                    
                
                
                    
                        C(GCC) 在线工具
                    
                
                
                    
                        PHP 在线工具
                    
                
                
                    
                        C# 在线工具
                    
                
                
                    
                        Python 在线工具
                    
                
                
                    
                        MySQL 在线工具
                    
                
                
                    
                        VB.NET 在线工具
                    
                
                
                    
                        Lua 在线工具
                    
                
                
                    
                        Oracle 在线工具
                    
                
                
                    
                        C++(GCC) 在线工具
                    
                
                
                    
                        Go 在线工具
                    
                
                
                    
                        Fortran 在线工具



    
        
            登录
            关闭
        
        
            
                扫码关注1秒登录
            
            
                
            
            
                
                
            
            
                发送“验证码”获取
                |
                15天全站免登陆
            
            
        
    
    





    
		
			友情链接：
            IT屋
            Chrome插件
            谷歌浏览器插件
        
        
            IT屋
            ©2016-2022 琼ICP备2021000895号-1
            站点地图
            站点标签
            SiteMap
            <免责申明>
            本站内容来源互联网,如果侵犯您的权益请联系我们删除.