猪与蜂巢的区别?为什么都有? [英] Difference between Pig and Hive? Why have both?

查看:145
本文介绍了猪与蜂巢的区别?为什么都有?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我的背景 - Hadoop世界的4周大。使用Cloudera的Hadoop VM在Hive,Pig和Hadoop中混了一下。阅读了Google关于Map-Reduce和GFS的文章( PDF链接)。



我明白 -


  • Pig的语言Pig Latin是一种从
    开始的转换(符合程序员的想法)
    SQL像声明式
    编程和Hive的查询语言
    类似于SQL。 小猪坐在Hadoop之上,而
    原则也可以坐在
    Dryad之上。我可能是错的,但Hive与Hadoop紧密耦合。
  • >

    我的问题 - 当一个人(比如说猪)能够达到目的时,两者的目标是什么?仅仅因为猪被雅虎传播了吗?和Hive by Facebook?

    解决方案

    查看此

    My background - 4 weeks old in the Hadoop world. Dabbled a bit in Hive, Pig and Hadoop using Cloudera's Hadoop VM. Have read Google's paper on Map-Reduce and GFS (PDF link).

    I understand that-

    • Pig's language Pig Latin is a shift from(suits the way programmers think) SQL like declarative style of programming and Hive's query language closely resembles SQL.

    • Pig sits on top of Hadoop and in principle can also sit on top of Dryad. I might be wrong but Hive is closely coupled to Hadoop.

    • Both Pig Latin and Hive commands compiles to Map and Reduce jobs.

    My question - What is the goal of having both when one (say Pig) could serve the purpose. Is it just because Pig is evangelized by Yahoo! and Hive by Facebook?

    解决方案

    Check out this post from Alan Gates, Pig architect at Yahoo!, that compares when would use a SQL like Hive rather than Pig. He makes a very convincing case as to the usefulness of a procedural language like Pig (vs. declarative SQL) and its utility to dataflow designers.

    这篇关于猪与蜂巢的区别?为什么都有?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆