简体   繁体   English

如何在Windows上的Eclipse中运行nutch 1.9?

[英]How to run nutch 1.9 in eclipse on windows?

I want to run Nutch 1.9 in Eclipse on Windows. 我想在Windows的Eclipse中运行Nutch 1.9。 I followed the tutorial from http://wiki.apache.org/nutch/RunNutchInEclipse and opened the project in Eclipse. 我遵循了来自http://wiki.apache.org/nutch/RunNutchInEclipse的教程,并在Eclipse中打开了该项目。

But when I run Nutch, I get the following error: 但是当我运行Nutch时,出现以下错误:

2014-09-19 17:45:48,039 INFO  crawl.Injector (Injector.java:inject(283)) - Injector: starting at 2014-09-19 17:45:48
2014-09-19 17:45:48,043 INFO  crawl.Injector (Injector.java:inject(284)) - Injector: crawlDb: K:/kumar/Nutch/apache-nutch-1.9/crawlresult
2014-09-19 17:45:48,043 INFO  crawl.Injector (Injector.java:inject(285)) - Injector: urlDir: K:/kumar/Nutch/apache-nutch-1.9/urls
2014-09-19 17:45:48,043 INFO  crawl.Injector (Injector.java:inject(294)) - Injector: Converting injected urls to crawl db entries.
2014-09-19 17:45:48,207 INFO  jvm.JvmMetrics (JvmMetrics.java:init(71)) - Initializing JVM Metrics with processName=JobTracker, sessionId=
2014-09-19 17:45:48,252 WARN  mapred.JobClient (JobClient.java:configureCommandLineOptions(661)) - No job jar file set.  User classes may not be found. See JobConf(Class) or JobConf#setJar(String).
2014-09-19 17:45:48,268 INFO  mapred.FileInputFormat (FileInputFormat.java:listStatus(192)) - Total input paths to process : 1
2014-09-19 17:45:48,485 INFO  mapred.JobClient (JobClient.java:monitorAndPrintJob(1275)) - Running job: job_local_0001
2014-09-19 17:45:48,487 INFO  mapred.FileInputFormat (FileInputFormat.java:listStatus(192)) - Total input paths to process : 1
2014-09-19 17:45:48,526 INFO  mapred.MapTask (MapTask.java:runOldMapper(347)) - numReduceTasks: 0
2014-09-19 17:45:48,565 INFO  plugin.PluginRepository (PluginManifestParser.java:parsePluginFolder(87)) - Plugins: looking in: K:\Nutch\apache-nutch-1.9\plugins
2014-09-19 17:45:48,566 WARN  plugin.PluginRepository (PluginManifestParser.java:parsePluginFolder(101)) - java.io.FileNotFoundException: K:\Nutch\apache-nutch-1.9\plugins\creativecommons\plugin.xml (The system cannot find the file specified)

It seems that Hadoop is the causing error. 看来Hadoop是造成错误的原因。 I don't know how to solve this problem. 我不知道如何解决这个问题。 I know Nutch requires Unix environment. 我知道Nutch需要Unix环境。 But, I want to run Nutch in Eclipse on Windows. 但是,我想在Windows的Eclipse中运行Nutch。

Can anybody help me to solve this? 有人可以帮我解决这个问题吗?

Download cygwin, then add that to your path of the environment variables. 下载cygwin,然后将其添加到环境变量的路径中。 I think your problem is caused by the fact that windows can't invoke a unix native command. 我认为您的问题是由Windows无法调用Unix本机命令引起的。 That is what I did however as soon as i got past that problem, I encountered other problems. 那就是我所做的,但是一旦我克服了这个问题,便遇到了其他问题。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM