hadoop中的job客户端如何计算inputSplits [英] How job client in hadoop compute inputSplits

查看：16 发布时间：2022/1/14 8:11:25 hadoop mapreduce

本文介绍了hadoop中的job客户端如何计算inputSplits的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试深入了解 map reduce 架构.我正在咨询这个 http://answers.oreilly.com/topic/2141-how-mapreduce-works-with-hadoop/ 文章.我对 mapreduce 框架的组件 JobClient 有一些疑问.我的问题是:

I am trying to get the insight of map reduce architecture. I am consulting this http://answers.oreilly.com/topic/2141-how-mapreduce-works-with-hadoop/ article. I have some questions regarding the component JobClient of mapreduce framework. My questions is:

JObClient 如何计算数据的输入拆分?

How the JObClient Computes the input Splits on the data?

根据我所咨询的内容，Job Client 在运行作业时计算位于指定 HDFS 上的输入路径中的数据的输入拆分.文章说，然后 Job Client 将资源(jar 和计算输入拆分)复制到 HDFS.现在这是我的问题，当输入数据在 HDFS 中时，为什么 jobClient 将计算出的 inputsplits 复制到 HDFS 中.

According to the stuff to which i am consulting , Job Client computes input splits on the data located in the input path on the HDFS specified while running the job. the article says then Job Client copies the resources(jars and compued input splits) to the HDFS. Now here is my question, when the input data is in HDFS, why jobClient copies the computed inputsplits into HDFS.

让我们假设 Job Client 将输入拆分复制到 HDFS，现在当 JOb 提交到 Job Tracker 和 Job Tracker 时详细说明作业为什么它从 HDFS 检索输入拆分?

Lets assume that Job Client copies the input splits to the HDFS, Now when the JOb is submitted to the Job Tracker and Job tracker intailize the job why it retrieves input splits from HDFS?

抱歉，如果我的问题不清楚.我是初学者.:)

Apologies if my question is not clear. I am a beginner. :)

hadoop中的job客户端如何计算inputSplits [英] How job client in hadoop compute inputSplits

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

hadoop中的job客户端如何计算inputSplits [英] How job client in hadoop compute inputSplits

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭