用多字符分隔符创建HIVE表 [英] Create HIVE Table with multi character delimiter

查看:461
本文介绍了用多字符分隔符创建HIVE表的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想创建一个多字符串字符作为分隔符的HIVE表,如

  CREATE EXTERNAL TABlE tableex(id INT ,name STRING)
行格式分隔的字段以','结尾
行终止'\\\
'作为TEXTFILE位置'/ user / myusername'存储;

我希望将分隔符作为〜*之类的多字符串。

解决方案

FILELDS TERMINATED BY 不支持多字符分隔符。最简单的方法是使用 RegexSerDe

  CREATE EXTERNAL TABlE tableex (id INT,name STRING)
ROW FORMAT'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
WITH SERDEPROPERTIES(
input.regex=^(\ \d +)〜\\ *(。*)$

存为文本文件
LOCATION'/ user / myusername';


I want to create a HIVE Table with multi string character as a delimiter such as

CREATE EXTERNAL TABlE tableex(id INT, name STRING) 
ROW FORMAT delimited fields terminated by ','
LINES TERMINATED BY '\n' STORED AS TEXTFILE LOCATION '/user/myusername';

I want to have delimiter as a multi string like "~*".

解决方案

FILELDS TERMINATED BY does not support multi-character delimiters. The easiest way to do this is to use RegexSerDe:

CREATE EXTERNAL TABlE tableex(id INT, name STRING) 
ROW FORMAT 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
WITH SERDEPROPERTIES (
  "input.regex" = "^(\\d+)~\\*(.*)$"
)
STORED AS TEXTFILE 
LOCATION '/user/myusername';

这篇关于用多字符分隔符创建HIVE表的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆