在docker容器上运行hadoop集群 [英] Run a hadoop cluster on docker containers

查看:243
本文介绍了在docker容器上运行hadoop集群的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想运行一个多节点的hadoop集群,每个节点位于不同主机上的docker容器中。此图片 - https://github.com/sequenceiq/hadoop-docker 很好地开始hadoop在伪分布式模式下,最简单的修改方法是将每个节点放在单独的ec2主机上的不同容器中。

I want to run a multi-node hadoop cluster, with each node inside a docker container on a different host. This image - https://github.com/sequenceiq/hadoop-docker works well to start hadoop in a pseudo distributed mode, what is the easiest way to modify this to have each node in a different container on a separate ec2 host?

推荐答案

我用两个容器在两个不同的ubuntu主机上运行主节点和从属节点。我使用编织在容器之间进行了联网。我已经在docker hub帐户div4上添加了容器的映像。我以相同的方式安装了hadoop,因为它安装在不同的主机上。我已经添加了两个图像,在这里运行haddop:

I did this with two containers running master and slave nodes on two different ubuntu hosts. I did the networking between containers using weave. I have added the images of the containers on docker hub account div4. I installed hadoop in the same way, as its installed on different hosts. I have added the two images with coomands to run haddop on them here:

https://registry.hub.docker.com/u/div4/hadoop_master/
https://registry.hub.docker.com/u/div4/hadoop_slave/

这篇关于在docker容器上运行hadoop集群的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆