在配置单元中使用子字符串和LPAD/RPAD替换数字 [英] Replace Digits Using SubString and lpad/rpad In Hive
本文介绍了在配置单元中使用子字符串和LPAD/RPAD替换数字的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
大家好
有人帮我在配置单元中实现了以下逻辑。我在配置单元中有2个表(表1,表2)。我需要在满足某些条件的情况下将Table1的一列中的0替换为9,并需要生成一个输出列,然后生成与Table2列(High列)相同的输出列。我将从Table2列(High列)联接,并将从这两个表生成输出。table1
SCHEME_1 LOW_1
01 12340000
01 12345000
01 12300000
table2
SCHEME HIGH
01 12349999
01 12345999
01 12399999
从表1开始,我需要使用以下条件生成新的列输出。
scenario:
1. I need to check 0's from right to left and as soon as i find 0 before any digit then i need to replace all the trailing 0's by 9.
2. In output at-least 4 digit should be there before 9.
3. In Input if 4 or less digits are available in input then I need to skip some 0's and make sure that at-least 4 digits are there before 9.
4. If more than 4 digits are available before trailing 0's then only need to replace 0,No need to replace digits.
LOW_1 output
12340000 12349999
12345000 12345999
12300000 12309999
然后需要将此"输出"列与表2的高度联接,然后希望生成数据。
expected output
SCHEME LOW_1 output HIGH
01 12340000 12349999 12349999
01 12345000 12345999 12345999
我正在使用下面的查询,并希望编写输出列逻辑,以便它可以与表2联接。
with table1 as
(
select LOW_1,SCHEME_1 from table1 where SCHEME_1='01'
)
select table2.*,
SCHEME_1,
LOW_1
from table2 inner join table1
on
(
table2.high=output
);
下面是我要添加到hobe查询中的输出列的逻辑。
with table1 as (
select LOW_1
) select LOW_1,
lpad(concat(splitted[0], translate(splitted[1],'0','9')),8,0) as output
from (
select LOW_1, split(regexp_replace(LOW_1,'(\d{*?}?)(0+)$','$1|$2'),'\|') splitted from table1 )s;
Can Someone Suggest me to Implement the same.
推荐答案
现在起作用了。我更新了我的查询如下。
with table1 as
(
select LOW_1,SCHEME_1,
lpad(
concat(
split(regexp_replace('(\d{4,}?)(0+)$','$1|$2'),'\|')[0],
translate(split(regexp_replace(LOW_1,'(\d{4,}?)(0+)$','$1|$2'),'\|')[1],'0','9')),8,0 )
as output
from table1 where SCHEME_1='01'
)
select table2.*,
SCHEME_1,
LOW_1
from table2 inner join table1
on
(
table2.high=output
);
这篇关于在配置单元中使用子字符串和LPAD/RPAD替换数字的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文