Java的：xml文件中跳过的二进制数据在解析 [英] Java: skip binary data in xml file while parsing

查看：510 发布时间：2016/8/6 22:59:47 java xml parsing binary

本文介绍了Java的：xml文件中跳过的二进制数据在解析的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我想分析Java中的XML文件，其中包含二进制数据：这里是XML文件的例子：

I want to parse a xml file in java which contains binary data: here is an example of the xml file:

<?xml version="1.0" encoding="utf-8"?>
<documents>
  <document>
    <element name="docid">
      <value><![CDATA[0902307e8004c74c]]></value>
    </element>
    <element name="published">
      <value><![CDATA[2012-01-01T00:00:00]]></value>
    </element>
    <element name="documenttype">
      <value><![CDATA[Circular]]></value>
    </element>
    <element name="data">
      <value><![CDATA[%PDF-1.6
%����
1020 0 obj
<</Filter/FlateDecode/First 20/Length 270/N 3/Type/ObjStm>>stream
�o^���)|�,�Ypoef�
l���o�>����u���b"Cb�|���%&��D�yD��q�q�q�q�q��%_ja�LJob��/��3"=����o���]V11}�    }a�+'6@����C�,^}�d%�۠�`s��q��5�׷^(�N��{S<S�����A��������-������f\ڌ��|U/݌�z���f�I9����g�g���s���0z'��X~
endstream
endobj
startxref
55097
%%EOF
]]></value>
    </element>
    <element name="dataname">
      <value><![CDATA[sdfsfsfsdsdfsd.pdf]]></value>
    </element>
  </document>
</documents>

通常我会解析这样的XML文件方式：

Normally I would parse such an xml file that way:

Document doc = null;
DocumentBuilder documentBuilder = null;
documentBuilderFactory = DocumentBuilderFactory.newInstance();
        try {
            documentBuilder = documentBuilderFactory.newDocumentBuilder();
        } catch (ParserConfigurationException e) {
            e.printStackTrace();
        }
try {

            doc = documentBuilder.parse(fastXMLFile);

        } catch (SAXException e) {
            System.out.println("SAXExept");
            e.printStackTrace();
        } catch (IOException e) {
            System.out.println("Test");
            return;
        }

但由于其中包含二进制数据的数据元素，调试器告诉我：

But because of the "data" element which contains binary data, the debugger tells me:

[Fatal Error] xmlfile.xml:58:10: An invalid XML character (Unicode: 0x1a) was found in the CDATA section.
SAXExept
org.xml.sax.SAXParseException: An invalid XML character (Unicode: 0x1a) was found in the CDATA section.

我不需要现在来分析此数据字段，我可以跳过它。我只是想分析数据的其余部分。这可能吗？

I dont need to parse this data field by now, I could just skip it. I just want to parse the rest of the data. Is this possible?

Java的：xml文件中跳过的二进制数据在解析 [英] Java: skip binary data in xml file while parsing

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

Java的：xml文件中跳过的二进制数据在解析 [英] Java: skip binary data in xml file while parsing

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭