Solr拼写检查多词短语

Question

我对多词短语的solr拼写检查建议有问题。 用“红辣椒”查询

q=red+chillies&wt=xml&indent=true&spellcheck=true&spellcheck.extendedResults=true&spellcheck.collate=true

我懂了

<lst name="suggestions">
  <lst name="chillies">
    <int name="numFound">2</int>
    <int name="startOffset">4</int>
    <int name="endOffset">12</int>
    <int name="origFreq">0</int>
    <arr name="suggestion">
      <lst><str name="word">chiller</str><int name="freq">4</int></lst>
      <lst><str name="word">challis</str><int name="freq">2</int></lst>
    </arr>
  </lst>
  <bool name="correctlySpelled">false</bool>
  <str name="collation">red chiller</str>
</lst>

问题是，即使“ chiller”在索引中有4个结果，“ red chiller”也没有。 因此，我们最终建议一个结果为0的短语。

如何使拼写检查仅对整个短语起作用？ 我尝试在查询中使用KeywordTokenizerFactory：

<fieldType name="text_spell" class="solr.TextField" positionIncrementGap="100">
  <analyzer type="index">
    <tokenizer class="solr.StandardTokenizerFactory" />
    <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />
    <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/> 
    <filter class="solr.LowerCaseFilterFactory" />
  </analyzer>
  <analyzer type="query">
    <tokenizer class="solr.KeywordTokenizerFactory" />
    <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />
    <filter class="solr.LowerCaseFilterFactory" />
  </analyzer>
</fieldType>

我也尝试添加

<str name="sp.query.extendedResults">false</str>

内

<lst name="spellchecker">

在solrconfig.xml中。

但是，两者似乎都没有什么不同。

使拼写检查仅给出对整个短语都有结果的排序规则的最佳方法是什么？ 谢谢！

Answer 1

真正的问题是，您需要指定spellcheck.collateParam.q.op=AND ，并且（可选）指定spellcheck.collateParam.mm=100%这些参数强制正确执行了整理查询。

您可以在solr文档中阅读有关此内容的更多信息

Solr拼写检查多词短语

问题描述

1 个解决方案

解决方案1
0 2018-07-03 08:22:50

Solr拼写检查多词短语

问题描述

1 个解决方案

解决方案1 0 2018-07-03 08:22:50

解决方案1
0 2018-07-03 08:22:50