繁体 English 中英

从Google搜索中抓取网址

[英]scrape urls from google search

原文 2013-07-23 16:41:33 9 1 python/ url/ screen-scraping

我正在尝试编写一个代码，以获取Google在某个单词中搜索的HTTP页面的1000个第一个URL。 我在Python中使用此代码获取了1000个第一个URL

import GoogleScraper
import urllib

urls = GoogleScraper.scrape('english teachers', number_pages=2)
for url in urls:
    print(urllib.parse.unquote(url.geturl()))

print('[!] Received %d results by asking %d pages with %d results per page' %
        (len(urls), 2, 100))`

但是此代码返回0个收到的结果。 还有另一种方法可以方便地从Google搜索中获取大量URL？ 我也尝试了xgoogle和pygoogle模块，但是它们只处理少量的页面请求即可。

1 个解决方案

Google有一个自定义搜索API ，可让您每天免费进行100个查询。 假设每页每页有10个结果，那么一天之内您几乎不能容纳1000个结果。 xgoogle和pygoogle只是此API的包装，因此我认为您无法通过使用它们获得更多结果。

如果您确实需要更多，请考虑使用另一个API密钥创建另一个Google帐户，这将使您的限额实际上翻倍。 如果您对结果稍差一点没问题，可以尝试使用Bing的Search API （它们每月提供5000个请求）。

从 Python 和 BeautifulSoup 中的搜索结果中抓取网址

[英]Scrape urls from search results in Python and BeautifulSoup

使用 Python 抓取谷歌搜索结果标题和网址

[英]Scrape google search results titles and urls using Python

从谷歌搜索页面抓取代码段文本

[英]Scrape the snippet text from google search page

从多个页面抓取网址

[英]Scrape urls from multiple pages

使用Python，如何从Google搜索中抓取链接的描述性文字？

[英]With Python, how to scrape the descriptive text of a link from a Google search?

将 div 中的跨度定位到从谷歌搜索结果中抓取

[英]targeting a span within a div to Scrape from google search results

从搜索字符串或 URL 获取 Google 搜索结果 URL

[英]Getting Google Search Result URLs from Search String or URL

如何从谷歌搜索中抓取“人们也问”框？

[英]How to scrape 'People also ask' box from Google search?

如何使用 Xpath 抓取 Google URL（包含和不包含）

[英]How to scrape Google URLs with Xpath (Contains and not contains)

抓取谷歌搜索片段结果

[英]Scrape google search snippet results

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 从 Python 和 BeautifulSoup 中的搜索结果中抓取网址使用 Python 抓取谷歌搜索结果标题和网址从谷歌搜索页面抓取代码段文本从多个页面抓取网址使用Python，如何从Google搜索中抓取链接的描述性文字？将 div 中的跨度定位到从谷歌搜索结果中抓取从搜索字符串或 URL 获取 Google 搜索结果 URL 如何从谷歌搜索中抓取“人们也问”框？如何使用 Xpath 抓取 Google URL（包含和不包含）抓取谷歌搜索片段结果

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM