[英]Nutch ERROR tika.TikaParser on Eclipse
I am running Nutch 2.2.1 on Eclipse Juno SR1 and JRE 1.7.0_25 我在Eclipse Juno SR1和JRE 1.7.0_25上运行Nutch 2.2.1
The PARSE step is failing with this error: PARSE步骤失败,并显示以下错误:
2013-08-15 19:35:26,555 ERROR tika.TikaParser - Can't retrieve Tika parser for mime-type application/pdf
2013-08-15 19:35:26,557 WARN parse.ParseUtil - Unable to successfully parse content
It seems like this error is coming from TikaConfig.java because I get an empty iterator: 似乎此错误来自TikaConfig.java,因为我得到了一个空的迭代器:
Iterator<Parser> iterator = ServiceRegistry.lookupProviders(
Parser.class, this.getClass().getClassLoader());
The same PARSE call from CYGWIN is successful, I get all PARSERS from Service registry. 从CYGWIN进行的同一个PARSE调用是成功的,我从服务注册表中获得了所有的PARSERS。 So maybe the solution is configuration for Eclipse related to Service registry.
因此,也许解决方案是与Service Registry相关的Eclipse配置。
Nutch-Eclipse is well configured about nutch-site.xml, parse-plugins.xml Nutch-Eclipse关于nutch-site.xml,parse-plugins.xml的配置很好
I appreciate any idea to solve this. 我感谢解决此问题的任何想法。
Check out this thread, seems like he had a similar problem and resolved it: 看看这个线程,似乎他有一个类似的问题并解决了:
http://lucene.472066.n3.nabble.com/Nutch-2-x-Eclipse-Can-t-retrieve-Tika-parser-for-mime-type-application-pdf-td4015896.html http://lucene.472066.n3.nabble.com/Nutch-2-x-Eclipse-Can-t-retrieve-Tika-parser-for-mime-type-application-pdf-td4015896.html
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.