简体繁体 English

索引PDF-使用Apache Solr和Apache Tika进行分面搜索

[英]Indexing PDF - Faceted Search with Apache Solr and Apache Tika

原文 2016-10-25 14:44:37 9 1 regex/ apache/ solr/ apache-tika/ handles

Two weeks ago I'm having trouble finding the Internet a way for my solution. 两个星期前，我很难找到一种解决方案。 I need to integrate a web application with Apache Solr and Apache tika, to be made faceted search PDF's that are in the database of the system. 我需要将Web应用程序与Apache Solr和Apache tika集成在一起，以便进行多面搜索系统数据库中的PDF。 The configuration of solr and tika on my server everything is ok, but as I am new with these two tools, I'm not sure how to integrate one another and also with the application. 在我的服务器上配置solr和tika一切正常，但是由于我是这两个工具的新手，所以我不确定如何相互集成以及如何与应用程序集成。

1 个解决方案

Solr 6.2 ships with files example in the example/files that is configured specifically to index and browse rich-content files (like PDF). Solr 6.2附带了example / files中的文件示例，该文件示例专门配置为索引和浏览内容丰富的文件（例如PDF）。

Start by using that and try to understand how it is put together. 首先使用它，并尝试了解它是如何组合在一起的。

Apache Solr 和用于搜索的自定义字段过滤器 - Apache Solr and customized field filter for search

如何使用Java Apache Lucene检索PDF文档中的正则表达式搜索字母数字文本？ - How to retrieve regex search alphanumeric text in a PDF document using java Apache Lucene?

Apache URL重写搜索/替换 - Apache URL Rewrite search/replace

索引和查询时多个令牌过滤器的Apache Solr性能问题 - Apache Solr performance issue for multiple token filters at index and query time

正则表达式搜索并替换Apache标头 - Regex search and replace on Apache header edit

PHP和Apache Mod重写：搜索？ - PHP & Apache Mod rewrite: search?keyword

Apache RewriteEngine - Apache RewriteEngine

apache 位置匹配 - apache LocationMatch

Apache Ant：我可以搜索特定正则表达式的所有文件，然后将匹配项打印到文件中吗？ - Apache Ant: Can I search all files for a specific regex and then print the matches to a file?

Solr Webapp正则表达式搜索 - Solr Webapp Regular Expression Search

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Apache Solr 和用于搜索的自定义字段过滤器 - Apache Solr and customized field filter for search 如何使用Java Apache Lucene检索PDF文档中的正则表达式搜索字母数字文本？ - How to retrieve regex search alphanumeric text in a PDF document using java Apache Lucene? Apache URL重写搜索/替换 - Apache URL Rewrite search/replace 索引和查询时多个令牌过滤器的Apache Solr性能问题 - Apache Solr performance issue for multiple token filters at index and query time 正则表达式搜索并替换Apache标头 - Regex search and replace on Apache header edit PHP和Apache Mod重写：搜索？ - PHP & Apache Mod rewrite: search?keyword Apache RewriteEngine - Apache RewriteEngine apache 位置匹配 - apache LocationMatch Apache Ant：我可以搜索特定正则表达式的所有文件，然后将匹配项打印到文件中吗？ - Apache Ant: Can I search all files for a specific regex and then print the matches to a file? Solr Webapp正则表达式搜索 - Solr Webapp Regular Expression Search

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM