如何在windows下的Eclipse中运行1.9 [英] How to run nutch 1.9 in eclipse on windows?

查看:134
本文介绍了如何在windows下的Eclipse中运行1.9的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想在Windows上的Eclipse中运行Nutch 1.9。我遵循了 http://wiki.apache.org/nutch/RunNutchInEclipse 的教程,并打开项目在Eclipse。



但是当我运行Nutch时,我收到以下错误:

  2014-09-19 17:45:48,039 INFO crawl.Injector(Injector.java:inject(283)) - 注射器:从2014-09-19 17:45:48开始
2014-09-19 17 :45:48,043 INFO crawl.Injector(Injector.java:inject(284)) - 注射器:crawlDb:K:/kumar/Nutch/apache-nutch-1.9/crawlresult
2014-09-19 17:45: 48,043 INFO crawl.Injector(Injector.java:inject(285)) - 注射器:urlDir:K:/kumar/Nutch/apache-nutch-1.9/urls
2014-09-19 17:45:48,043 INFO crawl 。注射器(Injector.java:inject(294)) - 注射器:将注入的URL转换为爬取数据库条目。
2014-09-19 17:45:48,207 INFO jvm.JvmMetrics(JvmMetrics.java:init(71)) - 使用processName = JobTracker初始化JVM指标,sessionId =
2014-09-19 17: 45:48,252 WARN mapred.JobClient(JobClient.java:configureCommandLineOptions(661)) - 没有作业jar文件集。可能找不到用户类。请参阅JobConf(Class)或JobConf#setJar(String)。
2014-09-19 17:45:48,268 INFO mapred.FileInputFormat(FileInputFormat.java:listStatus(192)) - 要处理的总输入路径:1
2014-09-19 17:45:48,485 INFO mapred.JobClient(JobClient.java:monitorAndPrintJob(1275)) - 运行作业:job_local_0001
2014-09-19 17:45:48,487 INFO mapred.FileInputFormat(FileInputFormat.java:listStatus(192)) - 总输入要处理的路径:1
2014-09-19 17:45:48,526 INFO mapred.MapTask(MapTask.java:runOldMapper(347)) - numReduceTasks:0
2014-09-19 17:45: 48,565 INFO plugin.PluginRepository(PluginManifestParser.java:parsePluginFolder(87)) - 插件:查找:K:\Nutch\apache-nutch-1.9\plugins
2014-09-19 17:45:48,566 WARN plugin.PluginRepository(PluginManifestParser.java:parsePluginFolder(101)) - java.io.FileNotFoundException:K:\Nutch\apache-nutch-1.9\plugins\creativecommons\plugin.xml(系统找不到文件指定)

似乎Hadoop是导致错误。我不知道如何解决这个问题。我知道Nutch需要Unix环境。但是,我想在Windows上运行Nutch。



有人可以帮我解决这个问题吗?

解决方案

下载cygwin,然后将其添加到您的环境变量的路径中。我认为你的问题是由Windows无法调用unix本机命令的事实引起的。这是我做的,但一旦我遇到了这个问题,我遇到其他问题。


I want to run Nutch 1.9 in Eclipse on Windows. I followed the tutorial from http://wiki.apache.org/nutch/RunNutchInEclipse and opened the project in Eclipse.

But when I run Nutch, I get the following error:

2014-09-19 17:45:48,039 INFO  crawl.Injector (Injector.java:inject(283)) - Injector: starting at 2014-09-19 17:45:48
2014-09-19 17:45:48,043 INFO  crawl.Injector (Injector.java:inject(284)) - Injector: crawlDb: K:/kumar/Nutch/apache-nutch-1.9/crawlresult
2014-09-19 17:45:48,043 INFO  crawl.Injector (Injector.java:inject(285)) - Injector: urlDir: K:/kumar/Nutch/apache-nutch-1.9/urls
2014-09-19 17:45:48,043 INFO  crawl.Injector (Injector.java:inject(294)) - Injector: Converting injected urls to crawl db entries.
2014-09-19 17:45:48,207 INFO  jvm.JvmMetrics (JvmMetrics.java:init(71)) - Initializing JVM Metrics with processName=JobTracker, sessionId=
2014-09-19 17:45:48,252 WARN  mapred.JobClient (JobClient.java:configureCommandLineOptions(661)) - No job jar file set.  User classes may not be found. See JobConf(Class) or JobConf#setJar(String).
2014-09-19 17:45:48,268 INFO  mapred.FileInputFormat (FileInputFormat.java:listStatus(192)) - Total input paths to process : 1
2014-09-19 17:45:48,485 INFO  mapred.JobClient (JobClient.java:monitorAndPrintJob(1275)) - Running job: job_local_0001
2014-09-19 17:45:48,487 INFO  mapred.FileInputFormat (FileInputFormat.java:listStatus(192)) - Total input paths to process : 1
2014-09-19 17:45:48,526 INFO  mapred.MapTask (MapTask.java:runOldMapper(347)) - numReduceTasks: 0
2014-09-19 17:45:48,565 INFO  plugin.PluginRepository (PluginManifestParser.java:parsePluginFolder(87)) - Plugins: looking in: K:\Nutch\apache-nutch-1.9\plugins
2014-09-19 17:45:48,566 WARN  plugin.PluginRepository (PluginManifestParser.java:parsePluginFolder(101)) - java.io.FileNotFoundException: K:\Nutch\apache-nutch-1.9\plugins\creativecommons\plugin.xml (The system cannot find the file specified)

It seems that Hadoop is the causing error. I don't know how to solve this problem. I know Nutch requires Unix environment. But, I want to run Nutch in Eclipse on Windows.

Can anybody help me to solve this?

解决方案

Download cygwin, then add that to your path of the environment variables. I think your problem is caused by the fact that windows can't invoke a unix native command. That is what I did however as soon as i got past that problem, I encountered other problems.

这篇关于如何在windows下的Eclipse中运行1.9的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆