如何安装 pyspark 以在独立脚本中使用? [英] How do I install pyspark for use in standalone scripts?

查看：21 发布时间：2021/11/12 5:40:47 python apache-spark

本文介绍了如何安装 pyspark 以在独立脚本中使用?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试将 Spark 与 Python 结合使用.我从下载页面为 Hadoop 2 二进制分发版安装了 Spark 1.0.2.我可以在 Python 交互模式下运行快速入门示例，但现在我想编写一个使用 Spark 的独立 Python 脚本.快速入门文档说导入 pyspark，但这不起作用，因为它不在我的 PYTHONPATH 中.

I'm am trying to use Spark with Python. I installed the Spark 1.0.2 for Hadoop 2 binary distribution from the downloads page. I can run through the quickstart examples in Python interactive mode, but now I'd like to write a standalone Python script that uses Spark. The quick start documentation says to just import pyspark, but this doesn't work because it's not on my PYTHONPATH.

我可以运行 bin/pyspark 并看到该模块安装在 SPARK_DIR/python/pyspark 下.我可以手动将它添加到我的 PYTHONPATH 环境变量中，但我想知道首选的自动化方法.

I can run bin/pyspark and see that the module is installed beneath SPARK_DIR/python/pyspark. I can manually add this to my PYTHONPATH environment variable, but I'd like to know the preferred automated method.

为独立脚本添加 pyspark 支持的最佳方法是什么?我在 Spark 安装目录下的任何地方都没有看到 setup.py .如何为依赖 Spark 的 Python 脚本创建 pip 包?

What is the best way to add pyspark support for standalone scripts? I don't see a setup.py anywhere under the Spark install directory. How would I create a pip package for a Python script that depended on Spark?

如何安装 pyspark 以在独立脚本中使用? [英] How do I install pyspark for use in standalone scripts?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

如何安装 pyspark 以在独立脚本中使用? [英] How do I install pyspark for use in standalone scripts?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭