Lucence SOLR not highlighting special characters like dot, slash in search result

Question

Like in titile, I got the result from solr and the special characters are not highlighting in searching word

<em>00</em>:<em>00.000Z</em>

solr parameter

&hl.simple.pre=<em>&hl.simple.post=</em>

Example query: all:* and get Hello/World as

<em>Hello</em> / <em>World</em>

field analyzer:

<fieldType name="text_en" class="solr.TextField" positionIncrementGap="100">
  <analyzer type="index">
    <tokenizer class="solr.StandardTokenizerFactory"/>
    <!-- in this example, we will only use synonyms at query time
    <filter class="solr.SynonymFilterFactory" synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/>
    -->
    <!-- Case insensitive stop word removal.
      add enablePositionIncrements=true in both the index and query
      analyzers to leave a 'gap' for more accurate phrase queries.
    -->
    <filter class="solr.StopFilterFactory"
            ignoreCase="true"
            words="lang/stopwords_en.txt"
            enablePositionIncrements="true"
            />
    <filter class="solr.LowerCaseFilterFactory"/>
    <filter class="solr.EnglishPossessiveFilterFactory"/>
    <filter class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/>
<!-- Optionally you may want to use this less aggressive stemmer instead of PorterStemFilterFactory:
    <filter class="solr.EnglishMinimalStemFilterFactory"/>
-->
    <filter class="solr.PorterStemFilterFactory"/>
  </analyzer>
  <analyzer type="query">
    <tokenizer class="solr.StandardTokenizerFactory"/>
    <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
    <filter class="solr.StopFilterFactory"
            ignoreCase="true"
            words="lang/stopwords_en.txt"
            enablePositionIncrements="true"
            />
    <filter class="solr.LowerCaseFilterFactory"/>
    <filter class="solr.EnglishPossessiveFilterFactory"/>
    <filter class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/>
<!-- Optionally you may want to use this less aggressive stemmer instead of PorterStemFilterFactory:
    <filter class="solr.EnglishMinimalStemFilterFactory"/>
-->
    <filter class="solr.PorterStemFilterFactory"/>
  </analyzer>
</fieldType>

Answer 1

StandardTokenizer will weed out characters like / , so perhaps what you're looking for here is actually WhitespaceTokenizer . Other than that, the colon : sign has special significane for the lucene query parser and for the edismax, so perhaps you want to try your luck with the simpler but more robust dismax query parser

Lucence SOLR not highlighting special characters like dot, slash in search result

Question

1 answers

solution1
0 2014-06-27 13:02:54

Lucence SOLR not highlighting special characters like dot, slash in search result

Question

1 answers

solution1 0 2014-06-27 13:02:54

solution1
0 2014-06-27 13:02:54