简体繁体 English

Solr索引镶木地板文件

[英]Solr indexing parquet file

原文 2015-04-21 19:18:01 4 1 indexing/ solr/ parquet

I have a solr instance up and running and it should read parquet files to index. 我有一个Solr实例正在运行，它应该读取镶木地板文件以建立索引。 Right now, I am converting the parquet to flat text file and then having solr index them. 现在，我将实木复合地板转换为平面文本文件，然后使用solr对其进行索引。 I'd like to know if it is possible to read the parquet file directly for Solr to consume? 我想知道是否可以直接读取实木复合地板文件以供Solr使用吗？

Thanks 谢谢

1 个解决方案

Directly: no, not possible. 直接：不，不可能。

If you want something more integrated than what you are actually doing (converting to text and indexing might be good enough already), you can follow two ways: 如果您想要比实际所做的事情更集成的东西（转换为文本和建立索引可能已经足够好了），可以采用以下两种方法：

Create an specialized code around DIH, you probably can write a specialized DataSource , so you could use DIH to do the indexing. 围绕DIH创建专用代码，您可能可以编写专用DataSource ，因此可以使用DIH进行索引。
Just write some java code using SolrJ that reads your file and indexes to Solr 只需使用SolrJ编写一些Java代码即可读取文件并索引Solr

使用solr索引json文件 - indexing json file using solr

在Apache Solr中索引XML文件 - Indexing an XML file in Apache Solr

在solr中索引xml文件时找不到404 - 404 not found while indexing xml file in solr

Solr 4.3.1使用DataImportHandler索引xml文件 - solr 4.3.1 indexing an xml file with DataImportHandler

Solr索引编制问题，Solr索引编制链不完整 - Issue with Solr Indexing, Solr Indexing Chain is not complete

使用TikaEntityProcessor获取图像文件元数据并索引到Solr - Obtaining Image File Metadata and Indexing to Solr using TikaEntityProcessor

Solr索引文件删除了html标签和垃圾内容表单索引 - Solr Index file removing html tags and garbage content form indexing

在 S3 中索引和分区 Parquet - Indexing and partitioning Parquet in S3

索引到Solr中的特定核心 - Indexing to a specific core in Solr

在 Solr 中索引和查询 URL - Indexing and Querying URLS in Solr

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 使用solr索引json文件 - indexing json file using solr 在Apache Solr中索引XML文件 - Indexing an XML file in Apache Solr 在solr中索引xml文件时找不到404 - 404 not found while indexing xml file in solr Solr 4.3.1使用DataImportHandler索引xml文件 - solr 4.3.1 indexing an xml file with DataImportHandler Solr索引编制问题，Solr索引编制链不完整 - Issue with Solr Indexing, Solr Indexing Chain is not complete 使用TikaEntityProcessor获取图像文件元数据并索引到Solr - Obtaining Image File Metadata and Indexing to Solr using TikaEntityProcessor Solr索引文件删除了html标签和垃圾内容表单索引 - Solr Index file removing html tags and garbage content form indexing 在 S3 中索引和分区 Parquet - Indexing and partitioning Parquet in S3 索引到Solr中的特定核心 - Indexing to a specific core in Solr 在 Solr 中索引和查询 URL - Indexing and Querying URLS in Solr

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM