[英]The whoosh can not search the key word successfully
1.我正在写一个非常简单的 whoosh 项目。 首先,我读取一个 txt 文件并使用 read() 方法获取 txt 文件中的所有内容。 然后为这个内容建立一个索引。
2.实现代码如下:
对于txt文件内容:
#import functions from whoosh
import whoosh
from whoosh.index import create_in
from whoosh.fields import *
from whoosh.qparser import QueryParser
schema = Schema(title=TEXT(stored=True), path=ID(stored=True), content=TEXT)
ix = create_in(".", schema)
writer = ix.writer()
i = 0
f = open("read.txt", "r")
print(f.read())
writer.add_document(title=u"document "+str(i), path=u".",content=f.read()) #python iterator i starting from 0
writer.commit(optimize=True)
searcher = ix.searcher()
parser = QueryParser("content", ix.schema)
stringquery = parser.parse("Hello")
results = searcher.search(stringquery)
print ("search 1 result:")
print (results)
for r in results:
print (r)
对于txt文件内容:
Hello this is the test
I hope you are doing well
I think you can do it without problem
This is so cool without funciton
'Hello' 假设存储在索引中,但是当我尝试搜索 hello 时,它什么也不返回
search 1 result:
<Top 0 Results for Term('content', 'hello') runtime=7.878600001731684e-05>
您的第一次调用f.read()
打印文件中的文本,下一次调用f.read()
没有可读取的内容并且不返回任何内容。 存储文本。
file_content = f.read()
print(file_content)
writer.add_document(title=u"document "+str(i), path=u".", content=file_content)
为了进一步证明,
$ cat test.txt
Hello this is the test
I hope you are doing well
I think you can do it without problem
This is so cool without funciton
$ ipython
Python 3.9.0 (default, Dec 2 2020, 10:34:08)
Type 'copyright', 'credits' or 'license' for more information
IPython 7.19.0 -- An enhanced Interactive Python. Type '?' for help.
In [1]: f = open("test.txt", "r")
In [2]: print(f.read())
Hello this is the test
I hope you are doing well
I think you can do it without problem
This is so cool without funciton
In [3]: print(f.read())
In [4]: quit
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.