how to make lucene be case-insensitive

Question

By default word "Word" and "word" are not the same. How can I make Lucene be case-insensitive?

Answer 1

The easiest approach is lowercasing all searchable content, as well as the queries. See the LowerCaseFilter documentation. You could also use Wildcard queries for case insensitive search since it bypasses the Analyzer .

You can store content in different fields to capture different case configurations if preferred.

Answer 2

The StandardAnalyzer applies a LowerCaseFilter that would make "Word" and "word" the same. You could simply pass that to your uses of IndexWriter and QueryParser . Eg a few line snippets:

Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);
IndexWriter writer = new IndexWriter(dir, analyzer, true, mlf);
QueryParser parser = new QueryParser(Version.LUCENE_30, field, analyzer);

Answer 3

除了使用StandardAnalyzer （包括LowerCaseFilter和常见英语单词（例如“ the”）的过滤器）之外，还应确保使用TextField而非用于精确搜索的StringField来构建文档。

Answer 4

Add LowerCaseFilterFactory to your fieldType for that field in Schema.xml. Example,

<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
            <analyzer type="index">
                <tokenizer class="solr.WhitespaceTokenizerFactory"/>
                <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0" splitOnCaseChange="1" preserveOriginal="1" />
                <filter class="solr.LowerCaseFilterFactory"/>
            </analyzer>

            <analyzer type="query">
                <tokenizer class="solr.WhitespaceTokenizerFactory"/>
                <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0" splitOnCaseChange="1"/>
                <filter class="solr.LowerCaseFilterFactory"/>
            </analyzer>
        </fieldType>

how to make lucene be case-insensitive

Question

4 answers

solution1
13 ACCPTED 2011-04-01 11:38:40

solution2
5 2011-04-01 12:01:26

solution3
1 2018-09-29 00:11:58

solution4
1 2012-02-15 23:40:11

how to make lucene be case-insensitive

Question

4 answers

solution1 13 ACCPTED 2011-04-01 11:38:40

solution2 5 2011-04-01 12:01:26

solution3 1 2018-09-29 00:11:58

solution4 1 2012-02-15 23:40:11

solution1
13 ACCPTED 2011-04-01 11:38:40

solution2
5 2011-04-01 12:01:26

solution3
1 2018-09-29 00:11:58

solution4
1 2012-02-15 23:40:11