子类化 ElementTree 解析器以保留注释 [英] Subclassing ElementTree parser to retain comments

查看：38 发布时间：2021/10/1 20:14:56 python xml

本文介绍了子类化 ElementTree 解析器以保留注释的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

尝试使用ElementTree解析xml文件；由于默认情况下解析器不保留注释，因此使用了来自 http://bugs.python.org/issue8277<的以下代码/a>:

Trying to use the ElementTree to parse xml files; since by default the parser does not retain comments, used the following code from http://bugs.python.org/issue8277:

import xml.etree.ElementTree as etree

class CommentedTreeBuilder(etree.TreeBuilder):
    """A TreeBuilder subclass that retains comments."""

    def comment(self, data):
        self.start(etree.Comment, {})
        self.data(data)
        self.end(etree.Comment)

parser = etree.XMLParser(target = CommentedTreeBuilder())

以上在documents.py中.测试:

The above is in documents.py. Tested with:

class TestDocument(unittest.TestCase):

    def setUp(self):
        filename = os.path.join(sys.path[0], "data", "facilities.xml")
        self.doc = etree.parse(filename, parser = documents.parser)

    def testClass(self):
        print("Class is {0}.".format(self.doc.__class__.__name__))
        #commented out tests.

if __name__ == '__main__':
    unittest.main()

这会引起:

Traceback (most recent call last):
File "/home/goncalo/documents/games/ja2/modding/mods/xml-overhaul/src/scripts/../tests/test_documents.py", line 24, in setUp
    self.doc = etree.parse(filename, parser = documents.parser)
File "/usr/lib/python3.3/xml/etree/ElementTree.py", line 1242, in parse
    tree.parse(source, parser)
File "/usr/lib/python3.3/xml/etree/ElementTree.py", line 1726, in parse
    parser.feed(data)
IndexError: pop from empty stack

我做错了什么?顺便说一句，文件中的 xml 是有效的(由独立程序检查)并且采用 utf-8 编码.

What am I doing wrong? By the way, the xml in the file is valid (as checked by an independent program) and in utf-8 encoding.

注意事项:

使用 Python 3.3.在 Kubuntu 13.04 中，以防万一.我确保使用python3"(而不仅仅是python")来运行测试脚本.

这里是使用的示例xml文件；它非常小(让我们看看我是否可以正确设置格式):

edit: here is the sample xml file used; it is very small (let's see if I can get the formatting right):

<?xml version="1.0" encoding="utf-8"?>
<!-- changes to facilities.xml by G. Rodrigues: ar overhaul.-->
<SECTORFACILITIES>
    <!-- Drassen -->
    <!-- Small airport -->
    <FACILITY>
        <SectorGrid>B13</SectorGrid>
        <FacilityType>4</FacilityType>
        <ubHidden>0</ubHidden>
    </FACILITY>
</SECTORFACILITIES>

子类化 ElementTree 解析器以保留注释 [英] Subclassing ElementTree parser to retain comments

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

子类化 ElementTree 解析器以保留注释 [英] Subclassing ElementTree parser to retain comments

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭