如何将pip/pypi安装的python软件包转换为zip文件以在AWS Glue中使用 [英] How to turn pip / pypi installed python packages into zip files to be used in AWS Glue

查看：121 发布时间：2020/8/23 6:12:14 python amazon-web-services amazon-s3 pyspark aws-glue

本文介绍了如何将pip/pypi安装的python软件包转换为zip文件以在AWS Glue中使用的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用AWS Glue和PySpark ETL脚本，并希望将辅助库(例如google_cloud_bigquery)用作我的PySpark脚本的一部分.

I am working with AWS Glue and PySpark ETL scripts, and want to use auxiliary libraries such as google_cloud_bigquery as a part of my PySpark scripts.

文档指出，这应该有可能. 之前的堆栈溢出讨论，尤其是其中一个答案中的一条评论似乎提供了进一步的证明.但是，怎么做对我来说还不清楚.

The documentation states this should be possible. This previous Stack Overflow discussion, especially one comment in one of the answers seems to provide additional proof. However, how to do it is unclear to me.

所以目标是将pip install ed软件包转换为一个或多个zip文件，以便能够将软件包托管在S3上并像这样指向它们:

So the goal is to turn the pip installed packages into one or more zip files, in order to be able to just host the packages on S3 and point to them like so:

s3://bucket/prefix/lib_A.zip,s3://bucket_B/prefix/lib_X.zip

应该如何做我所看过的地方都没有明确说明.

How that should be done is not clearly stated anywhere I've looked.

即我如何pip install一个软件包，然后将其转换为一个zip文件，我可以将其上传到S3，以便PySpark可以将其与这样的S3 URL一起使用?

i.e. how do I pip install a package and then turn it into a zip file that I can upload to S3 so PySpark can use it with such an S3 URL?

通过使用命令pip download，我已经能够提取这些库，但是默认情况下它们不是.zip文件，而是.whl文件或.tar.gz

By using the command pip download I have been able to fetch the libs, but they are not .zip files by default but instead either .whl files or .tar.gz

.. so不确定如何将它们转换为AWS Glue可以消化的zip文件.也许使用.tar.gz，我可以先tar -xf将它们备份，然后再zip将它们备份，但是whl文件呢?

..so not sure what to do to turn them into zip files that AWS Glue can digest. Maybe with .tar.gz I could just tar -xf them and then zip them back up, but how about whl files?

如何将pip/pypi安装的python软件包转换为zip文件以在AWS Glue中使用 [英] How to turn pip / pypi installed python packages into zip files to be used in AWS Glue

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

如何将pip/pypi安装的python软件包转换为zip文件以在AWS Glue中使用 [英] How to turn pip / pypi installed python packages into zip files to be used in AWS Glue

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭