簡體   English   中英

Django elasticsearch dsl 術語和短語搜索不起作用

[英]Django elasticsearch dsl term and phrase search is not working

我使用了兩個包(即django-elasticsearch-dsl==7.1.4django-elasticsearch-dsl-drf==0.20.8 )將搜索引擎添加到我的 Django 項目中。 我用彈性索引的模型是:

class Article(models.Model):
    created_time = models.DateTimeField(_('created time'), auto_now_add=True)
    updated_time = models.DateTimeField(_('updated time'), auto_now=True)
    profile = models.ForeignKey('accounts.UserProfile', verbose_name=_('profile'), on_delete=models.PROTECT)
    approved_user = models.ForeignKey(settings.AUTH_USER_MODEL, verbose_name=_('approved user'), blank=True, null=True, editable=False, on_delete=models.CASCADE, related_name='article_approved_users')
    approved_time = models.DateTimeField(_('approved time'), blank=True, null=True, db_index=True, editable=False)
    title = models.CharField(_('title'), max_length=50)
    image = models.ImageField(_('image'), blank=True, upload_to=article_directory_path)
    slug = models.SlugField(_('slug'), max_length=50, unique=True)
    content = models.TextField(_('content'))
    summary = models.TextField(_('summary'))
    views_count = models.PositiveIntegerField(verbose_name=_('views count'), default=int, editable=False)
    is_free = models.BooleanField(_('is free'), default=True)
    is_enable = models.BooleanField(_('is enable'), default=True)

    tags = TaggableManager(verbose_name=_('tags'), related_name='articles')
    categories = models.ManyToManyField('Category', verbose_name=_('categories'), related_name='articles')
    namads = models.ManyToManyField('namads.Namad', verbose_name=_('namads'), related_name='articles', blank=True)

我使用以下文檔來索引我的Article模型:

html_strip = analyzer(
    'html_strip',
    tokenizer="whitespace",
    filter=["lowercase", "stop", "snowball"],
    char_filter=["html_strip"]
)


@registry.register_document
class ArticleDocument(Document):
    title = fields.TextField(
        analyzer=html_strip,
        fields={
            'raw': fields.TextField(analyzer='keyword'),
            'suggest': fields.CompletionField(),
        }
    )

    tags = fields.ObjectField(
        properties={
            "name": fields.TextField(
                analyzer=html_strip,
                fields={
                    'raw': fields.TextField(analyzer='keyword'),
                    'suggest': fields.CompletionField(),
                }
            )
        }
    )
    categories = fields.ObjectField(
        properties={
            'id': fields.IntegerField(),
            'title': fields.TextField(
                analyzer=html_strip,
                fields={
                    'raw': fields.TextField(analyzer='keyword'),
                    'suggest': fields.CompletionField(),
                }
            )
        }
    )
    namads = fields.ObjectField(
        properties={
            "id": fields.IntegerField(),
            "name": fields.TextField(
                analyzer=html_strip,
                fields={
                    'raw': fields.TextField(analyzer='keyword'),
                    'suggest': fields.CompletionField(),
                }
            ),
            "group_name": fields.TextField(
                analyzer=html_strip,
                fields={
                    'raw': fields.TextField(analyzer='keyword'),
                    'suggest': fields.CompletionField(),
                }
            )
        }
    )

    class Index:
        name = settings.ARTICLE_INDEX_NAME
        settings = {
            "number_of_shards": 1,
            "number_of_replicas": 0
        }

    def get_queryset(self):
        return super(ArticleDocument, self).get_queryset().filter(
            approved_user__isnull=False,
            is_enable=True
        ).prefetch_related(
            'tags',
            'categories',
            'namads'
        )

    class Django:
        model = Article
        fields = ['id', 'summary']

最后使用以下視圖集對其結果進行搜索(基於此文檔。)

class ArticleSearchViewSet(DocumentViewSet):
    """

        list:
            Search on all articles, ordered by most recently added.

            query parameters
            -  Search fields: 'title', 'summary', 'tags.name', 'categories.title', 'namads.name',
                'namads.group_name' . Ex: ?search=some random name.
        retrieve:
            Return a specific article details.

    """
    serializer_class = ArticleDocumentSerializer
    document = ArticleDocument
    pagination_class = PageNumberPagination
    lookup_field = 'id'
    filter_backends = [
        FilteringFilterBackend,
        IdsFilterBackend,
        OrderingFilterBackend,
        DefaultOrderingFilterBackend,
        CompoundSearchFilterBackend,
        SuggesterFilterBackend,
    ]
    search_fields = (
        'title',
        'summary',
        'tags.name',
        'categories.title',
        'namads.name',
        'namads.group_name'
    )
    filter_fields = {
        'id': {
            'field': 'id',
            # Note, that we limit the lookups of id field in this example,
            # to `range`, `in`, `gt`, `gte`, `lt` and `lte` filters.
            'lookups': [
                LOOKUP_FILTER_RANGE,
                LOOKUP_QUERY_IN,
                LOOKUP_QUERY_GT,
                LOOKUP_QUERY_GTE,
                LOOKUP_QUERY_LT,
                LOOKUP_QUERY_LTE,
            ],
        },
        'namads': {
            'field': 'namads',
            # Note, that we limit the lookups of `pages` field in this
            # example, to `range`, `gt`, `gte`, `lt` and `lte` filters.
            'lookups': [
                LOOKUP_FILTER_RANGE,
                LOOKUP_QUERY_GT,
                LOOKUP_QUERY_GTE,
                LOOKUP_QUERY_LT,
                LOOKUP_QUERY_LTE,
            ],
        },
        'title': "title.raw",
        'summary': 'summary',
        'categories': {
            'field': 'categories',
            # Note, that we limit the lookups of `pages` field in this
            # example, to `range`, `gt`, `gte`, `lt` and `lte` filters.
            'lookups': [
                LOOKUP_FILTER_RANGE,
                LOOKUP_QUERY_GT,
                LOOKUP_QUERY_GTE,
                LOOKUP_QUERY_LT,
                LOOKUP_QUERY_LTE,
            ],
        },

        'tags': {
            'field': 'tags',
            # Note, that we limit the lookups of `tags` field in
            # this example, to `terms, `prefix`, `wildcard`, `in` and
            # `exclude` filters.
            'lookups': [
                LOOKUP_FILTER_TERMS,
                LOOKUP_FILTER_PREFIX,
                LOOKUP_FILTER_WILDCARD,
                LOOKUP_QUERY_IN,
                LOOKUP_QUERY_EXCLUDE,
            ],
        },
    }
    # Suggester fields
    suggester_fields = {
        'title_suggest': {
            'field': 'title.suggest',
            'suggesters': [
                SUGGESTER_TERM,
                SUGGESTER_COMPLETION,
                SUGGESTER_PHRASE,

            ],
            'default_suggester': SUGGESTER_COMPLETION,
            'options': {
                'size': 10,  # Number of suggestions to retrieve.
                'skip_duplicates': True,  # Whether duplicate suggestions should be filtered out.
            },
        },
        'tags_suggest': {
            'field': 'tags.name.suggest',
            'suggesters': [
                SUGGESTER_COMPLETION,
            ],
            # 'options': {
            #     'size': 20,  # Override default number of suggestions
            # },
        },
        'categories_suggest': {
            'field': 'categories.title.suggest',
            'suggesters': [
                SUGGESTER_TERM,
                SUGGESTER_COMPLETION,
                SUGGESTER_PHRASE,
            ],
        },
        'namads_name_suggest': {
            'field': 'namads.name.suggest',
            'suggesters': [
                SUGGESTER_COMPLETION,
            ],
        },
        'namad_group_name_suggest': {
            'field': 'namads.group_name.suggest',
            'suggesters': [
                SUGGESTER_COMPLETION,
            ],
        },

    }
    ordering_fields = {
        'id': 'id',
        'title': 'title',
        'summary': 'summary',
    }
    # Specify default ordering
    ordering = ('-id', )

我的文檔序列化程序是:

class ArticleDocumentSerializer(DocumentSerializer):
    class Meta:
        document = ArticleDocument
        fields = ['id', 'title', 'summary', 'namads', 'categories', 'tags']

一切正常,除非我使用帶有以下查詢參數的termphrase搜索:

?title_suggest__phrase=fi

?title_suggest__term=fi

在 url localhost/api/v1/blog/articles-search/suggest/但在這兩種情況下,結果都相同,如下所示:

{
    "title_suggest__term": [
        {
            "text": "fi",
            "offset": 0,
            "length": 2,
            "options": []
        }
    ]
}.

而且我很確定我的索引中有First article ,當我使用完成建議器時,一切正常(即?title_suggest__completion=fi )並返回結果。 我錯過了什么嗎? 我想在我的項目中添加術語和短語搜索。 如何解決此問題(術語和短語結果為 0)?

問題出在我的建議者配置中。 首先, termphrase建議我們不需要完成字段(即'suggest': fields.CompletionField() ),我們只需要在我們的Index聲明我們的字段,例如:

title = fields.TextField(
        fields={
            'raw': fields.TextField(analyzer=html_strip)
        }
    ) # which goes in documents.py

只需將以下內容添加到任何字段即可啟用termphrase建議:

suggester_fields = {
        'title_suggest': {
            'field': 'title',
            'suggesters': [
                SUGGESTER_TERM,
                SUGGESTER_PHRASE,

            ],
        },
    } # Which goes in views.py and search view suggester_fields

並查看相關的建議結果,我們應該發送查詢參數,如?title_suggest__term=something?title_suggest__phrase=something 最后,如果我們也需要向建議字段添加完成,我們應該使用其他一些鍵添加它,例如:

suggester_fields = {
        'title_suggest': {
            'field': 'title',
            'suggesters': [
                SUGGESTER_TERM,
                SUGGESTER_PHRASE,

            ],
        },
        'title': {
            'field': 'title',
            'suggesters': [
                SUGGESTER_COMPLETION,
            ],
        },
    }

現在有了這個配置,我們有關於標題字段的三種類型的建議(即termphrasecompletion )。 因此,如果我們想要基於這三個建議器的整個結果,我們應該使用三個查詢參數,例如:

localhost/api/v1/blog/articles-search/suggest/?title_suggest__term=something&title_suggest__phrase=something&title__completion=something

並且不要忘記更改Index的字段配置(以防我們需要完成建議):

title = fields.TextField(
        fields={
            'raw': fields.TextField(analyzer=html_strip),
            'suggest': fields.CompletionField(),
        }
    ) # Which goes in documents.py

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM