简体   繁体   English

Eclipse上的Nutch错误tika.TikaParser

[英]Nutch ERROR tika.TikaParser on Eclipse

I am running Nutch 2.2.1 on Eclipse Juno SR1 and JRE 1.7.0_25 我在Eclipse Juno SR1和JRE 1.7.0_25上运行Nutch 2.2.1

The PARSE step is failing with this error: PARSE步骤失败,并显示以下错误:

2013-08-15 19:35:26,555 ERROR tika.TikaParser - Can't retrieve Tika parser for mime-type application/pdf
2013-08-15 19:35:26,557 WARN  parse.ParseUtil - Unable to successfully parse content

It seems like this error is coming from TikaConfig.java because I get an empty iterator: 似乎此错误来自TikaConfig.java,因为我得到了一个空的迭代器:

Iterator<Parser> iterator = ServiceRegistry.lookupProviders(
    Parser.class, this.getClass().getClassLoader());

The same PARSE call from CYGWIN is successful, I get all PARSERS from Service registry. 从CYGWIN进行的同一个PARSE调用是成功的,我从服务注册表中获得了所有的PARSERS。 So maybe the solution is configuration for Eclipse related to Service registry. 因此,也许解决方案是与Service Registry相关的Eclipse配置

Nutch-Eclipse is well configured about nutch-site.xml, parse-plugins.xml Nutch-Eclipse关于nutch-site.xml,parse-plugins.xml的配置很好

I appreciate any idea to solve this. 我感谢解决此问题的任何想法。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM