包含org.apache.hadoop.hive.dynamodb的罐子 [英] jar containing org.apache.hadoop.hive.dynamodb

查看:105
本文介绍了包含org.apache.hadoop.hive.dynamodb的罐子的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我试图以编程方式将dynamodb表加载到HDFS中(通过java和 not 配置单元),我无法在网上找到有关如何执行此操作的示例,因此以为我会下载包含以下内容的jar org.apache.hadoop.hive.dynamodb进行反向工程.

I was trying to programmatically Load a dynamodb table into HDFS (via java, and not hive), I couldnt find examples online on how to do it, so thought I'd download the jar containing org.apache.hadoop.hive.dynamodb and reverse engineer the process.

不幸的是,我也找不到该文件:(.

Unfortunately, I couldn't find the file as well :(.

有人可以为我回答以下问题(按优先顺序列出).

Could someone answer the following questions for me (listed in order of priority).

  1. 将动态表加载到HDFS中的Java示例(可以将其作为表输入格式传递给映射器).
  2. 装有org.apache.hadoop.hive.dynamodb的罐子.

谢谢!

推荐答案

它在hive-bigbird-handler.jar中.不幸的是,AWS没有提供任何来源或至少没有Java文档.但是您可以在EMR群集的任何节点上找到该jar:

It's in hive-bigbird-handler.jar. Unfortunately AWS doesn't provide any source or at least Java Doc about it. But you can find the jar on any node of an EMR Cluster:

/home/hadoop/.versions/hive-0.8.1/auxlib/hive-bigbird-handler-0.8.1.jar

您可能想签出这篇文章:

You might want to checkout this Article:

不幸的是,亚马逊尚未发布 hive-bigbird-handler.jar,考虑到它的实用性,这是一个耻辱. 特别值得注意的是,它似乎还包括对的内置支持. Hadoop的输入和输出格式,因此可以直接编写 MapReduce Jobs,直接写入DynamoDB.

Unfortunately, Amazon haven’t released the sources for hive-bigbird-handler.jar, which is a shame considering its usefulness. Of particular note, it seems it also includes built-in support for Hadoop’s Input and Output formats, so one can write straight on MapReduce Jobs, writing directly into DynamoDB.

提示:搜索hive-bigbird-handler.jar进入有趣的部分...;-)

Tip: search for hive-bigbird-handler.jar to get to the interesting parts... ;-)

这篇关于包含org.apache.hadoop.hive.dynamodb的罐子的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
相关文章
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆