简体   繁体   English

Google搜索结果限制

[英]Google search results limits

I tried to search in google search engine the word "sunday". 我试图在Google搜索引擎中搜索“星期日”一词。

The total number of results is 1.390.000.000 . 结果的总数为1.390.000.000 However I can see only the first 420 results, until 42 page of results. 但是,我只能看到前420个结果,直到42页结果为止。

Is there any way to take all the results of google search? 有什么办法可以获取Google搜索的所有结果?

You can use the google module to control the number of results from your query. 您可以使用google模块来控制查询结果的数量。

Install it using pip : 使用pip安装它:

pip install google

Usage: 用法:

from google import search

n = 10  # number of results
query = 'sunday'
results = google.search(query, stop=n)  # returns a generator

for result in results:
    print(result)

Before using this method, keep this in mind about the pause parameter: 使用此方法之前,请记住有关pause参数的注意事项:

pause (float) - Lapse to wait between HTTP requests. 暂停(浮动)-间隔以在HTTP请求之间等待。 A lapse too long will make the search slow, but a lapse too short may cause Google to block your IP. 时延过长会使搜索变慢,但时延过短可能会导致Google阻止您的IP。 Your mileage may vary! 你的旅费可能会改变!

Docs 文件

This 'feature' of google - and other search engines - applies to the web interface too. 谷歌和其他搜索引擎的“功能”也适用于网络界面。 The claimed number of matches on the first page is not reflected in the actual number of results returned. 第一页要求的匹配数不会反映在实际返回的结果数中。

For example, if you search Google for "Systematic Literature Review" it will claim a few milion results on the first page, but if you go to page 3 (at 100 results/page) it will 'revise' the estimate to 200-300 results. 例如,如果您在Google上搜索“系统文献评论” ,它将在第一页上声明几百万个结果,但是,如果您转到第3页(每页100个结果),则会将估算值“修订”为200-300结果。

This seems too high in the initial estimate but may be a problem with filtering large datasets and is possibly forgivable. 最初的估算值似乎过高,但可能是过滤大型数据集的问题,并且可以原谅。 However it is definitely too small a number of actual results returned for this topic. 但是,对于该主题返回的实际结果肯定太少了。 Bing and DuckDuckGo are similar. Bing和DuckDuckGo相似。 Google Scholar claims 11k or so results but returns a server error if you try going beyond 1000. Google学术搜索声称获得11k左右的结果,但如果尝试超过1000,则会返回服务器错误。

Speculating: this might be in order to encourage people to change their search terms, rather than return a huge number of results that are then filtered programmatically locally (which is what I would like to do!). 推测:这可能是为了鼓励人们更改搜索条件,而不是返回大量结果,然后以编程方式在本地进行过滤(这就是我想做的!)。

This isn't an answer to getting more results I'm afraid, but it is an explanation that the problem is not the library, it's the search engine(s). 恐怕这不是获得更多结果的答案,而是一种解释,问题不是库,而是搜索引擎。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM