使用Hive删除特殊字符 [英] Removing special characters using Hive
问题描述
我的数据存储在Cassandra 1.2中,如下所示。 sValue下有特殊字符 - 突出显示为粗体。我如何使用配置单元函数来删除它?
Date |时间戳| payload_Timestamp | actDate | actHour | actMinute | sDesc | sName | sValue
--------------------------------- + ----------- --------------------------- + ---------------------- ---- + ---------------------- + ---------------------- + ------------------------ + ------------------------ --- + -------------------------------- + ------------- --------
2014-06-25 00:00:00-0400 | 2014-06-25 08:31:23-0400 | 2014-06-25 08:31:23-0400 | 06-25-2014 | 8 | 31 |标签| / t1 / t2 / 100/200/11/99 | 2743326591.03\x00
您可以使用regexp_replace()函数。
$ b
更多详情请参阅
https: //cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF
I have data stored in Cassandra 1.2 as shown below. There is special character under sValue - highlighted as bold. How can I use hive function to remove this ?
Date | Timestam | payload_Timestamp | actDate | actHour | actMinute | sDesc | sName | sValue ---------------------------------+--------------------------------------+--------------------------+----------------------+----------------------+------------------------+---------------------------+--------------------------------+--------------------- 2014-06-25 00:00:00-0400 | 2014-06-25 08:31:23-0400 | 2014-06-25 08:31:23-0400 | 06-25-2014 | 8 | 31 | lable | /t1/t2/100/200/11/99 | 2743326591.03\x00
You can use regexp_replace() function.
More details available on https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF
这篇关于使用Hive删除特殊字符的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!