为什么从文件末尾查找BZip2文件而不是Gzip文件？ [英] Why is seeking from the end of a file allowed for BZip2 files and not Gzip files?

查看：120 发布时间：2020/6/7 18:57:58 python gzip bzip2

本文介绍了为什么从文件末尾查找BZip2文件而不是Gzip文件？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用Python 2.7.6解析大型压缩文件，并且想在开始之前了解未压缩文件的大小。我正在尝试使用此SO答案中介绍的第二种技术。它适用于bzip2格式的文件，但不适用于gzip格式的文件。导致这种情况的两种压缩算法有何不同？

I am parsing large compressed files in Python 2.7.6 and would like to know the uncompressed file size before starting. I am trying to use the second technique presented in this SO answer. It works for bzip2 formatted files but not gzip formatted files. What is different about the two compression algorithms that causes this?

该代码片段演示了行为，假设您当前的工作目录中包含 test.bz2和 test.gz：

This code snipped demonstrates the behavior, assuming you have "test.bz2" and "test.gz" present in your current working directory:

import os
import bz2
import gzip

bz = bz2.BZ2File('test.bz2', mode='r')
bz.seek(0, os.SEEK_END)
bz.close()

gz = gzip.GzipFile('test.gz', mode='r')
gz.seek(0, os.SEEK_END)
gz.close()

显示以下回溯：

Traceback（最近一次通话）：

  文件 zip_test.py，第10行，在
$ b中$ b    gz.seek（0，os.SEEK_END）

  文件 /usr/lib64/python2.6/gzip.py，第420行， in seek

     raise ValueError（'不支持从头开始搜索'）

ValueError：不支持从头开始搜索

Traceback (most recent call last):
  File "zip_test.py", line 10, in
    gz.seek(0, os.SEEK_END)
  File "/usr/lib64/python2.6/gzip.py", line 420, in seek
    raise ValueError('Seek from end not supported')
ValueError: Seek from end not supported

为什么对* .bz2文件有效，但对* .gz文件无效？

Why does this work for *.bz2 files but not *.gz files?

为什么从文件末尾查找BZip2文件而不是Gzip文件？ [英] Why is seeking from the end of a file allowed for BZip2 files and not Gzip files?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

为什么从文件末尾查找BZip2文件而不是Gzip文件？ [英] Why is seeking from the end of a file allowed for BZip2 files and not Gzip files?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭