最佳COM pression算法XML? [英] Best compression algorithm for XML?

查看:178
本文介绍了最佳COM pression算法XML?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我几乎不知道有关COM pression的事,如此忍受我(这可能是一个愚蠢的,非常明显的问题)。

I barely know a thing about compression, so bear with me (this is probably a stupid and painfully obvious question).

因此​​,可以说我有一些标签的XML文件。

So lets say I have an XML file with a few tags.

<verylongtagnumberone>
  <verylongtagnumbertwo>
    text
  </verylongtagnumbertwo>
</verylongtagnumberone>

现在可以说,我有我的多个XML文件的许多属性一堆这些非常长的标签。我需要COM preSS他们最小尺寸的可能。最好的办法是使用XML特定算法分配单独的标签假名像VLT1或vlt2。但是,这不会是作为一种开放作为我米试图去了,我想用一个通用的算法像DEFLATE或LZ。它还helpes如果归档是一个.zip文件。

Now lets say I have a bunch of these very long tags with many attributes in my multiple XML files. I need to compress them to the smallest size possible. The best way would be to use an XML-specific algorithm which assigns individual tags pseudonyms like vlt1 or vlt2. However, this wouldn't be as 'open' of a way as I m trying to go for, and I want to use a common algorithm like DEFLATE or LZ. It also helpes if the archive was a .zip file.

由于我负责的纯文本(如图像没有二进制文件),我想适合纯文本的算法。哪一个产生最小的文件大小(无损算法是preferred)?

Since I'm dealing with plain text (no binary files like images), I'd like an algorithm that suits plain text. Which one produces the smallest file size (lossless algorithms are preferred)?

顺便说一句,该方案是这样的:我创建的文档的标准,如ODF和微软Office XML,包含XML文件,打包在一个.zip

By the way, the scenario is this: I am creating a standard for documents, like ODF or MS Office XML, that contain XML files, packaged in a .zip.

编辑:'加密'的事情是一个错字;它应该大道奔COM pression'。

The 'encryption' thing was a typo; it should ave ben 'compression'.

推荐答案

有一个W3(还未发布)标准命名的 EXI(高效的XML交换)

There is a W3 (not-yet-released) standard named EXI (Efficient XML Interchange).

应该成为在未来的COM pressing XML数据的数据格式(自称是最后一个必要的二进制格式)。正在为XML优化,它融为一体presses XML的方法比任何传统的COM pression算法更有效。

Should become THE data format for compressing XML data in the future (claimed to be the last necessary binary format). Being optimized for XML, it compresses XML more ways more efficient than any conventional compression algorithm.

使用EXI,可以在对飞玉米pressed XML数据操作(无需uncom preSS或重新玉米preSS它)。

With EXI, you can operate on compressed XML data on the fly (without the need to uncompress or re-compress it).

EXI =(XML + XML模式)为二进制。

EXI = (XML + XMLSchema) as binary.

在这里,你去与开源实现(不知道它已经稳定):
Exificient

And here you go with the opensource implementation (don't know if it's already stable):
Exificient

这篇关于最佳COM pression算法XML?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆