蜂巢实木复合地板活泼压缩不起作用 [英] Hive parquet snappy compression not working
问题描述
我正在使用表属性创建一个表骨架 TBLPROPERTIES('PARQUET.COMPRESSION'='SNAPPY')
(因为文件为镶木地板格式),并在创建表之前将一些参数设置为:
I am creating one table skeleton using the table properties as
TBLPROPERTIES('PARQUET.COMPRESSION'='SNAPPY')
(as the files are in parquet format) and setting few of the parameters before creating the table as :
set hive.exec.dynamic.partition.mode=nonstrict;
set parquet.enable.dictionary=false;
set hive.plan.serialization.format=javaXML;
SET hive.exec.compress.output=true;
SET mapred.output.compression.type=BLOCK;
set avro.output.codec=snappy;
SET mapred.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec;
add jar /opt/cloudera/parcels/CDH-5.5.1-1.cdh5.5.1.p1168.923/lib/sentry/lib/hive-metastore.jar;
仍然没有压缩表.您能告诉我表格未压缩的原因吗?
Still the table is not getting compressed. Could you please let me know the reason for table not getting compressed.
谢谢您的输入.
推荐答案
解决方案在DDL,而不是"TBLPROPERTIES('PARQUET.COMPRESSION'='SNAPPY')".
The solution is using "TBLPROPERTIES ('parquet.compression'='SNAPPY')"
(and the case matters) in the DDL instead of "TBLPROPERTIES ('PARQUET.COMPRESSION'='SNAPPY')".
您还可以在配置单元中使用以下属性来实现压缩.
You can also achieve the compression using the following property in the hive.
set parquet.compression=SNAPPY
这篇关于蜂巢实木复合地板活泼压缩不起作用的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!