如何将名称空间从lxml树中剥离? [英] How can I strip namespaces out of an lxml tree?
本文介绍了如何将名称空间从lxml树中剥离?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
接着使用python删除XML中的子元素. ..
感谢@Tichodroma,我有以下代码:
Thanks to @Tichodroma, I have this code:
如果您可以使用 lxml ,请尝试以下操作:
If you can use lxml, try this:
import lxml.etree
tree = lxml.etree.parse("leg.xml")
for dog in tree.xpath("//Leg1:Dog",
namespaces={"Leg1": "http://what.not"}):
parent = dog.xpath("..")[0]
parent.remove(dog)
parent.text = None
tree.write("leg.out.xml")
现在leg.out.xml
看起来像这样:
<?xml version="1.0"?>
<Leg1:MOR xmlns:Leg1="http://what.not" oCount="7">
<Leg1:Order>
<Leg1:CTemp id="FO">
<Leg1:Group bNum="001" cCount="4"/>
<Leg1:Group bNum="002" cCount="4"/>
</Leg1:CTemp>
<Leg1:CTemp id="GO">
<Leg1:Group bNum="001" cCount="4"/>
<Leg1:Group bNum="002" cCount="4"/>
</Leg1:CTemp>
</Leg1:Order>
</Leg1:MOR>
如何修改代码以从所有元素的标记名称中删除Leg1:
名称空间前缀?
How do I modify my code to remove the Leg1:
namespace prefix from all of the elements' tag names?
推荐答案
从每个元素中删除名称空间前缀的一种可能方法:
One possible way to remove namespace prefix from each element :
def strip_ns_prefix(tree):
#iterate through only element nodes (skip comment node, text node, etc) :
for element in tree.xpath('descendant-or-self::*'):
#if element has prefix...
if element.prefix:
#replace element name with its local name
element.tag = etree.QName(element).localname
return tree
另一个版本,它在xpath中有名称空间检查,而不是使用if
语句:
Another version which has namespace checking in the xpath instead of using if
statement :
def strip_ns_prefix(tree):
#xpath query for selecting all element nodes in namespace
query = "descendant-or-self::*[namespace-uri()!='']"
#for each element returned by the above xpath query...
for element in tree.xpath(query):
#replace element name with its local name
element.tag = etree.QName(element).localname
return tree
这篇关于如何将名称空间从lxml树中剥离?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文