标签[indexwriter] - 堆栈内存溢出

自定义nutch索引编写器将值映射到弹性docin multielvel（如JSON） - customizing the nutch indexwriter to map values into elastic docin multielvel(like JSON)

我将为apache nutch开发一个插件来自定义索引编写器，我的问题是，当你有权访问NutchDocument时，你只需将数据放在第一级而不是第二级。例如，对于“a”，“location”和“url”，您可以轻松地通过doc.add("url", "www.csad.com");放置数据d ...

Python Lucene函数将字段内容添加到文档中不起作用 - Python lucene function add field contents to document not working

我正在用python Lucene索引URL页面。我在尝试向文档中添加字段时遇到了一些错误。我不知道为什么。错误提示： JavaError：，> Java stacktrace：java.lang.IllegalArgumentException：既没有索引也没有存储 ...

Apache Lucene的实现 - Implementation of Apache Lucene

我正在阅读过去几周的Apache Lucene的源代码，并试图找出主要负责在磁盘上编写发布列表/索引的方法或类。我已经阅读了很多有关索引的文章，并试图找到在将索引写到磁盘上的过程中正在调用某种方法的地方，但是失败了。我知道某些内部缓冲区已满时会定期写入索引或发布列表。如果有人已经阅读 ...

Lucene IndexWriter.Close（）与indexWriter.Commit（） - Lucene IndexWriter.Close() vs indexWriter.Commit()

当我只有indexWriter的一个实例时， IndexWriter.Close()和IndexWriter.Commit()什么indexWriter ？注意：我要创建索引的数据很大，所以我无法关闭IndexWriter运行时。注意：我想同时索引数据时搜索文档。 ...

如何修改由nutch索引编写器用于elasticsearch的索引模板？ - How to modify the index template used by the nutch index writer for elasticsearch?

现成的用于Elasticsearch的nutch索引编写器会使用属性元素中的nutch-site.xml（或nutch-default.xml）中提供的名称在elasticsearch中生成一个索引：对于这样一个自动生成的索引，elasticsearch中的映射部分始终具有以下结构 ...

Lucene-Appengine的SegmentIndexInput readByte方法中的NullPointerException - NullPointerException at Lucene-Appengine's SegmentIndexInput readByte method

我从运行Lucene得到以下错误跟踪。我包括提示错误的代码段。具体的项目/连接器是Lucene for Appengine 码错误这是我的appengine-web.xml文件，我也按照建议使用修改后的RamUsageEstimator ...

如何在不删除文档的情况下保持Lucene索引 - How to keep Lucene index without deleted documents

这是我关于堆栈溢出的第一个问题，请祝我好运。我正在使用Java对Lucene索引进行分类，因此我需要更新名为category的文档字段。为此，我一直在使用带有索引编写器updateDocument（）函数的Lucene 4.2，除了删除部分外，它运行得很好。即使我在更新后使用fo ...

提交后调用将文档添加到Lucene Index Writer - Adding documents to Lucene Index Writer after commit is called

我正在使用Lucene 4.6。我创建了一个Lucene IndexWriter（在CREATE MODE中）并添加了文档并提交了它（没有关闭它）。然后运行搜索查询并存储结果。再次，我向索引编写器添加了文档，并提交并关闭了它。并对其进行搜索查询。它给出了新数据和旧数据的结果。 ...

IndexWriter.commit（）不足以更改索引 - IndexWriter.commit() not sufficient to change index

我刚刚将我们的Lucene实现从3.1移植到4.9，但是如果没有forceMerge（1）（曾经是优化的），我似乎仍然无法更新索引中的文档。我了解，每当更新文档时，旧文档就会标记为已删除，新文档会存储在新的段/文件中。即使升级后，我仍然看到的结果是该文档被标记为已删除，但是搜索者似乎从未 ...