如何遍历talend中的XML类型数据列的子元素 [英] How to loop through child elements of XML type data column in talend
问题描述
我想从旧表迁移数据,该旧表的一列包含XML格式的数据。我试图使用talend破坏XML,但总是一次又一次地获得第一个元素。
这是我的旧数据库中的数据:
I want to migrate data from an old table which has one column which has data in XML form. I am trying to break the XML using talend but always getting the first element again and again. This is the data i have in my OLD DB:
ID DETAIL
1 <ORDER_DETAIL>
<PACK>
<ID>703</ID>
<NAME>iPhone 6-16 GB-Space Grey-KD 57 Pack</NAME>
<STANDALONE_PRICE>0</STANDALONE_PRICE>
<MONTHLY_PRICE>57</MONTHLY_PRICE>
<UPFRONT_PRICE>0</UPFRONT_PRICE>
<ITEMS>
<ITEM>
<ID>177</ID>
<NAME>Bundle_2500_30GB_UNLIMITED</NAME>
<ROLE>Primary</ROLE>
<TYPE>Service</TYPE>
<ITEM_MINUTES>2500|Local Mins.</ITEM_MINUTES>
<ITEM_MESSAGES>Unlimited|SMS and MMS</ITEM_MESSAGES>
<ITEM_INTERNET>30 GB|4G LTE Internet</ITEM_INTERNET>
</ITEM>
<ITEM>
<ID>226</ID>
<NAME>Gold</NAME>
<ROLE>Primary</ROLE>
<TYPE>SIM</TYPE>
<NUMBER_CLASS>G</NUMBER_CLASS>
</ITEM>
<ITEM>
<ID>560</ID>
<NAME>iPhone 6 - 16 GB - Space Grey</NAME>
<ROLE>Primary</ROLE>
<TYPE>Device</TYPE>
<COLOR>Space Gray|#808080</COLOR>
<SIZE>16 GB</SIZE>
</ITEM>
<ITEM>
<ID>561</ID>
<NAME>4G LTE Hotspot - 0 GB - Black</NAME>
<ROLE>Secondary Optional</ROLE>
<TYPE>Device</TYPE>
<COLOR>Black|#000000</COLOR>
<SIZE>16 GB</SIZE>
</ITEM>
</ITEMS>
</PACK>
</ORDER_DETAIL>
我的输出低于输出值:
-------------+--------------------------+--------.
| tLogRow_2 |
|=------------+--------------------------+-------=|
|pivot_key |pivot_value |ORDER_ID|
|=------------+--------------------------+-------=|
|NAME |Bundle_2500_30GB_UNLIMITED|3150711 |
|ID |177 |3150711 |
|ROLE |Primary |3150711 |
|TYPE |Service |3150711 |
|ITEM_MINUTES |2500|Local Mins. |3150711 |
|ITEM_MESSAGES|Unlimited|SMS and MMS |3150711 |
|ITEM_INTERNET|30 GB|4G LTE Internet |3150711 |
|SIZE |16 GB |3150711 |
|COLOR |Space Gray|#808080 |3150711 |
|NUMBER_CLASS |G |3150711 |
|NAME |Bundle_2500_30GB_UNLIMITED|3150711 |
|ID |177 |3150711 |
|ROLE |Primary |3150711 |
|TYPE |Service |3150711 |
|ITEM_MINUTES |2500|Local Mins. |3150711 |
|ITEM_MESSAGES|Unlimited|SMS and MMS |3150711 |
|ITEM_INTERNET|30 GB|4G LTE Internet |3150711 |
|SIZE |16 GB |3150711 |
|COLOR |Space Gray|#808080 |3150711 |
|NUMBER_CLASS |G |3150711 |
|NAME |Bundle_2500_30GB_UNLIMITED|3150711 |
|ID |177 |3150711 |
|ROLE |Primary |3150711 |
|TYPE |Service |3150711 |
|ITEM_MINUTES |2500|Local Mins. |3150711 |
|ITEM_MESSAGES|Unlimited|SMS and MMS |3150711 |
|ITEM_INTERNET|30 GB|4G LTE Internet |3150711 |
|SIZE |16 GB |3150711 |
|COLOR |Space Gray|#808080 |3150711 |
|NUMBER_CLASS |G |3150711 |
|NAME |Bundle_2500_30GB_UNLIMITED|3150711 |
|ID |177 |3150711 |
|ROLE |Primary |3150711 |
|TYPE |Service |3150711 |
|ITEM_MINUTES |2500|Local Mins. |3150711 |
|ITEM_MESSAGES|Unlimited|SMS and MMS |3150711 |
|ITEM_INTERNET|30 GB|4G LTE Internet |3150711 |
|SIZE |16 GB |3150711 |
|COLOR |Space Gray|#808080 |3150711 |
|NUMBER_CLASS |G |3150711 |
'-------------+--------------------------+--------'
一次又一次地获取所有子节点的第一个元素。
这是我的工作:
tOracleInput-> tMap> tExtraxtXMLField-> tUnPivotRow--tLogRow
Getting first element of all child nodes again and again. This is my job: tOracleInput-->tMap>tExtraxtXMLField-->tUnPivotRow--tLogRow
请有人帮我解决这个
Someone please help me on this
推荐答案
您是否在tXmlMap中为标签设置了作为可选循环?
Do you set up "as optional loop" for your tag in tXmlMap?
您可以尝试使用tExtractXmlFiled和 Loop XPATH query分别使用 / ORDER_DETAIL / ITEMS和 tFlowToIterate来分别读取 Items。
You can try to read "Items" separatly, using tExtractXmlFiled with "Loop XPATH query" : "/ORDER_DETAIL/ITEMS" and then "tFlowToIterate".
这篇关于如何遍历talend中的XML类型数据列的子元素的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!