pydoop vs hadoopy-hadoop python客户端 [英] pydoop vs hadoopy - hadoop python client

查看:78
本文介绍了pydoop vs hadoopy-hadoop python客户端的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

在python客户端中搜索Hadoop时,我发现了pydoop和hadoopy两个模块.似乎两者都可以使用,但是不确定哪一个在安装方面比另一个更具优势.

While searching python client for Hadoop, I found two modules pydoop and hadoopy. It seems both are good enough to work with, but not sure which one has more advantages than the other to install one.

推荐答案

我认为最全面的文档是

The most comprehensive documentation of this I think is http://blog.cloudera.com/blog/2013/01/a-guide-to-python-frameworks-for-hadoop/

最近,我真的认为 mrjob 显然是领先者.它的邮件列表非常活跃,而且似乎相对稳定并且是最新的.它还与Amazon EMR集成良好.

Recently, I really think that mrjob has come out ahead as a clear frontrunner. It has a very active mailing list and it seems to be relatively stable and up to date. It also has nice integration with Amazon EMR.

这篇关于pydoop vs hadoopy-hadoop python客户端的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆