簡體   English   中英

Elasticsearch 沒有使用相同的令牌返回結果?

[英]Elasticsearch not returning result with same token?

ElasticSearch 中插入的數據是韓文,所以我不能給出確切的大小寫,但假設我有一個詞ABBCC被標記為["A","BBCC"] AZZXXX ["A","BBCC"]和另一個詞AZZXXX標記為["A","ZZXXX"] .

如果我搜索 ABBCC,那么 AZZXXX 不應該出現,因為它們具有相同的標記嗎? 或者這不是elasticsearch的工作方式嗎?

這是我檢查分析詞的方式:

GET recpost_test/_analyze
{
  "analyzer": "my_analyzer",
  "text":"my query String!" 
}

這就是我創建索引的方式:

PUT recpost
{
  "settings": {
    "index": {
      "analysis": {
        "tokenizer": {
          "nori_user_dict": {
            "type": "nori_tokenizer",
            "decompound_mode": "mixed",
            "user_dictionary": "userdict_ko.txt"
          }
        },
        "analyzer": {
          "my_analyzer": {
            "type": "custom",
            "tokenizer": "nori_user_dict"
          }
        },
        "filter": {
        "substring": {
          "type": "edgeNGram",
          "min_gram": 1,
          "max_gram": 10
        }
      }
      }
    }
  }
}

這就是我搜索的方式:

GET recpost/_search
{
  "_source": [""],
  "from": 0,
  "size": 2,
  "query":{
    "multi_match": {
      "query" : "my query String!",
      "type": "best_fields", 
      "fields" : [
        "brandkor",
        "content",
        "itemname",
        "name",
        "review",
        "shortreview^2",
        "title^3"]
    }
  }
}

編輯:我嘗試添加“分析器”字段進行搜索,但仍然無效

GET recpost/_search
{
  "_source": [""],
  "from": 0,
  "size": 2,
  "query":{
    "multi_match": {
      "query" : "깡스",
      "analyzer": "my_analyzer", 
      "type": "best_fields", 
      "fields" : [
        "brandkor",
        "content",
        "itemname",
        "name",
        "review",
        "shortreview^2",
        "title^3"]
    }
  }
}

EDIT2:這是我的映射:

{
  "recpost_test" : {
    "mappings" : {
      "properties" : {
        "@timestamp" : {
          "type" : "date"
        },
        "brandkor" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        },
        "content" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        },
        "field_statistics" : {
          "type" : "boolean"
        },
        "fields" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        },
        "itemname" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        },
        "name" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        },
        "offsets" : {
          "type" : "boolean"
        },
        "payloads" : {
          "type" : "boolean"
        },
        "positions" : {
          "type" : "boolean"
        },
        "review" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        },
        "shortreview" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        },
        "term_statistics" : {
          "type" : "boolean"
        },
        "title" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        },
        "type" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        }
      }
    }
  }
}

我沒有看到您將字段安裝到索引(映射)。 所以就我所知,您正在將所有字段(brandkor、內容等)索引為text .. 並且基本上您正在匹配精確值。

除非您將每個字段與其分析器相關聯。

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM