Hive是否具有字符串拆分功能? [英] Does Hive have a String split function?
问题描述
我正在寻找Hive中的内置字符串分割函数?
例如如果String是
A | B | C | D | E 那么我想要一个像
array split(字符串输入,字符分隔符)
这样我就可以回到[A,B,C,D,E]。
Hive中是否存在这样的内置分割函数?
我只能看到regexp_extract和regexp_replace。我希望看到一个indexOf()和split()
字符串函数。
谢谢
Ajay
确实存在基于正则表达式的分割函数。它未在本教程中列出,但列在 wiki上的语言手册:
split(字符串str,字符串pat)
分割str(pat是一个正则表达式)
在你的情况下,分隔符 |
作为一个正则表达式具有特殊含义,所以它应该被引用作为 \\ |
。
I am looking for a in-built String split function in Hive? E.g. if String is
A|B|C|D|E
then I want to have a function like array split(string input, char delimiter)
so that I get back [A,B,C,D,E].
Does such a in-built split function exist in Hive. I can only see regexp_extract and regexp_replace. I would love to see a indexOf() and split() string functions.
Thanks
Ajay
There does exist a split function based on regular expressions. It's not listed in the tutorial, but it is listed on the language manual on the wiki:
split(string str, string pat)
Split str around pat (pat is a regular expression)
In your case, the delimiter "|
" has a special meaning as a regular expression, so it should be referred to as "\\|
".
这篇关于Hive是否具有字符串拆分功能?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!