python中的可重复gzip文件 [英] Repeatably gzip files in python

查看：98 发布时间：2020/11/21 23:52:01 python gzip

本文介绍了python中的可重复gzip文件的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在用python编写脚本，以将静态站点部署到AWS(S3，Cloudfront，Route53).因为我不想在每次部署中都上载每个文件，所以我通过比较md5哈希值和e-tag(将s3设置为对象的md5哈希值)来检查修改了哪些文件.这对所有文件都适用，除了我的构建脚本在上传之前使用gzip压缩的文件以外.看一下文件内部，看来gzip并不是真正的纯函数.每次运行gzip时，即使源文件没有更改，输出文件也会有很小的差异.

I'm writing a script in python for deploying static sites to aws (s3, cloudfront, route53). Because I don't want to upload every file on every deploy, I check which files were modified by comparing their md5 hash with their e-tag (which s3 sets to be the object's md5 hash). This works well for all files except for those that my build script gzips before uploading. Taking a look inside the files, it seems like gzip isn't really a pure function; there are very slight differences in the output file every time gzip is run, even if the source file hasn't changed.

我的问题是:有没有办法让gzip在给定完全相同的输入的情况下可靠且可重复地输出完全相同的文件?还是我最好只是检查文件是否已压缩，解压缩并计算md5哈希值/而是手动为其设置电子标签值?

My question is this: is there any way to get gzip to reliably and repeatably output the exact same file given the exact same input? Or am I better off just checking if the file is gzipped, unzipping it and computing the md5 hash/manually setting the e-tag value for it instead?

python中的可重复gzip文件 [英] Repeatably gzip files in python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

python中的可重复gzip文件 [英] Repeatably gzip files in python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭