拆分XML文档转换成块 [英] split xml document into chunks

查看:86
本文介绍了拆分XML文档转换成块的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一个需要大量XML文档在同一时间内处理100个记录

I have a large xml document that needs to be processed 100 records at a time

这是正在用C#Windows服务中完成。

It is being done within a Windows Service written in c#.

的结构如下:

<docket xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="docket.xsd">
    <order>
    	<Date>2008-10-13</Date>
    	<orderNumber>050758023</orderNumber>
    	<ParcelID/>
    	<CustomerName>sddsf</CustomerName>
    	<DeliveryName>dsfd</DeliveryName>
    	<Address1>sdf</Address1>
    	<Address2>sdfsdd</Address2>
    	<Address3>sdfdsfdf</Address3>
    	<Address4>dffddf</Address4>
    	<PostCode/>

    </order>
    <order>
    	<Date>2008-10-13</Date>
    	<orderNumber>050758023</orderNumber>
    	<ParcelID/>
    	<CustomerName>sddsf</CustomerName>
    	<DeliveryName>dsfd</DeliveryName>
    	<Address1>sdf</Address1>
    	<Address2>sdfsdd</Address2>
    	<Address3>sdfdsfdf</Address3>
    	<Address4>dffddf</Address4>
    	<PostCode/>

    </order>

    .....

    .....

</docket>

有可能是上千份订单在案卷。

There could be thousands of orders in a docket.

我想切这个为100元块

I need to chop this into 100 element chunks

不过每个100份订单仍需要与父母案卷节点被包裹,并且具有相同的命名空间等。

However each of the 100 orders still need to be wrapped with the parent "docket" node and have the same namespace etc

这可能吗?

推荐答案

另一种天真的解决方案;这次为.NET 2.0。它应该给你如何去了解你想要什么的想法。使用XPath表达式,而不是LINQ到XML。 。块100为了案卷到10日程在我devbox第二下

Another naive solution; this time for .NET 2.0. It should give you an idea of how to go about what you want. Uses Xpath expressions instead of Linq to XML. Chunks a 100 order docket into 10 dockets in under a second on my devbox.

 public List<XmlDocument> ChunkDocket(XmlDocument docket, int chunkSize)
    {
        List<XmlDocument> newDockets = new List<XmlDocument>();
        //            
        int orderCount = docket.SelectNodes("//docket/order").Count;
        int chunkStart = 0;
        XmlDocument newDocket = null;
        XmlElement root = null;
        XmlNodeList chunk = null;

        while (chunkStart < orderCount)
        {
            newDocket = new XmlDocument();
            root = newDocket.CreateElement("docket");
            newDocket.AppendChild(root);

            chunk = docket.SelectNodes(String.Format("//docket/order[position() > {0} and position() <= {1}]", chunkStart, chunkStart + chunkSize));

            chunkStart += chunkSize;

            XmlNode targetNode = null;
            foreach (XmlNode c in chunk)
            {
                targetNode = newDocket.ImportNode(c, true);
                root.AppendChild(targetNode);
            }

            newDockets.Add(newDocket);
        } 

        return newDockets;
    }

这篇关于拆分XML文档转换成块的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆