将 Hive 与 Pig 一起使用 [英] Using Hive with Pig
问题描述
我的 hive 查询有多个外连接,执行时间很长.我想知道将其分解为多个较小的查询并使用 pig 进行转换是否有意义.
My hive query has multiple outer joins and takes very long to execute. I was wondering if it would make sense to break it into multiple smaller queries and use pig to work the transformations.
有没有一种方法可以在猪脚本中查询配置单元表或读取配置单元表数据?
Is there a way I could query hive tables or read hive table data within a pig script?
谢谢
推荐答案
Howl 项目的目标 是允许 Pig 和 Hive 共享单个元数据存储库.一旦 Howl 成熟,您将能够在相同的表.现在,您可以尝试处理存储在 HDFS 中的数据.
The goal of the Howl project is to allow Pig and Hive to share a single metadata repository. Once Howl is mature, you'll be able to run PigLatin and HiveQL queries over the same tables. For now, you can try to work with the data as it is stored in HDFS.
请注意,Howl 已重命名为 HCatalog.
Note that Howl has been renamed to HCatalog.
这篇关于将 Hive 与 Pig 一起使用的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!