单节点和单节点之间有什么区别? Hadoop中的伪分布式模式? [英] What is the difference between single node & pseudo-distributed mode in Hadoop?

查看:1872
本文介绍了单节点和单节点之间有什么区别? Hadoop中的伪分布式模式?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想知道从配置的角度和理论上的观点有什么不同?

这两种模式使用不同的端口号吗?或任何其他差异?

解决方案

我的2分钱。

节点设置(独立设置)



默认情况下,Hadoop被配置为以非分布式或独立模式运行,即单个Java进程。没有守护进程正在运行,并且所有内容都在单个JVM实例中运行。不使用HDFS。



除了 JAVA_HOME 。只需下载tarball,将其解压缩,然后就可以开始使用了。

伪分布式模式



Hadoop守护进程在本地机器上运行,从而以小规模模拟群集。不同的Hadoop守护进程在不同的JVM实例中运行,但在单台机器上运行。使用HDFS代替本地FS。

就伪分布式设置而言,您需要至少设置以下两个属性以及 JAVA_HOME


  1. fs.default.name in core-site.xml


  2. mapred.job。 tracker mapred-site.xml


您可以拥有多个datanodes和tasktrackers,但在单台机器上没有多大意义。



HTH


I'd like to know what is difference from the configuration point of view as well as theoretical point of view?

Do these two modes use different port numbers? or any other difference?

解决方案

My 2 cents.

Single node setup (standalone setup)

By default, Hadoop is configured to run in a non-distributed or standalone mode, as a single Java process. There are no daemons running and everything runs in a single JVM instance. HDFS is not used.

You don't have to do anything as far as configuration is concerned, except the JAVA_HOME. Just download the tarball, unzip it, and you are good to go.

Pseudo-distributed mode

The Hadoop daemons run on a local machine, thus simulating a cluster on a small scale. Different Hadoop daemons run in different JVM instances, but on a single machine. HDFS is used instead of local FS.

As far as pseudo-distributed setup is concerned, you need to set at least following 2 properties along with JAVA_HOME:

  1. fs.default.name in core-site.xml.

  2. mapred.job.tracker in mapred-site.xml.

You could have multiple datanodes and tasktrackers, but that doesn't make much sense on a single machine.

HTH

这篇关于单节点和单节点之间有什么区别? Hadoop中的伪分布式模式?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆