HTML中的Django Haystack搜索

Question

i was just wondering (since i didn't find anything quick on Google) if its possible (and how do i achieve that) to search directly in an html file, and ignore the tags or not as i please? 我只是想知道（因为我没有在Google上快速找到任何东西）是否有可能（以及如何实现这一点）直接在html文件中进行搜索，而忽略标签则不是我所希望的？

explaining a bit further. 进一步解释。 we wrote a crawler and obviously the crawler gives back the HTML of the page. 我们编写了一个搜寻器，很显然，该搜寻器返回了页面的HTML。 But if i feel like searching the content of the crawler, do i need 2 separate fields one with html and one without or i can just have one field with html and search ignoring the html tags or not. 但是，如果我想搜索爬虫的内容，我是否需要2个单独的字段，其中一个带有html，一个不带html，或者我只能拥有一个带有html的字段，并且搜索是否忽略html标签。

thanks in advance. 提前致谢。

Answer 1

If i correctly understand you, all you need is to set search indexes without html tags? 如果我正确理解您的意思，那么您所需要做的就是设置没有html标签的搜索索引？

We solved that problem this way: 我们通过以下方式解决了这个问题：

class PostIndex(indexes.SearchIndex, indexes.Indexable):
    text = indexes.CharField(model_attr='text', use_template=True, document=True)

and in template ( search/indexes/blogs/post_test.html ) we just used striptags filter 在模板（ search/indexes/blogs/post_test.html ）中，我们仅使用了striptags过滤器

{{ object.content|striptags }}

After that you need to build_schema and rebuild_index . 之后，您需要build_schema和rebuild_index 。 Now it search correctly without tags. 现在，它可以正确搜索无标签。

HTML中的Django Haystack搜索

问题描述

1 个解决方案

解决方案1
1 已采纳 2014-05-28 16:35:47

HTML中的Django Haystack搜索

问题描述

1 个解决方案

解决方案1 1 已采纳 2014-05-28 16:35:47

解决方案1
1 已采纳 2014-05-28 16:35:47