簡體 English 中英

在使用請求和 beautifulsoup 抓取頁面時接受 cookies

[英]Accepting cookies while scraping page with requests and beautifulsoup

原文 2020-12-06 19:15:35 2 2 python/ web-scraping/ beautifulsoup/ python-requests

我做了一個腳本，在許多不同的頁面上跟蹤產品的價格。 問題是某些網站使用 cookies，您必須單擊接受 cookies 才能看到價格。

這可能無濟於事，但這是瑞典語的網站，所以你們中的許多人都不會理解。

如何在 web 刮擦時接受 cookies？

2 個解決方案

沒有 cookies 參與請求。 我覺得你不應該在執行 get 或 post 請求時遇到任何問題。

編輯：試試這段代碼：

r = requests.get('https://www.google.com/')

with open('test.html', 'w') as f:
    f.write(r.text)
    f.close()

在 web 瀏覽器中運行test.html文件並嘗試查看差異。 test.html是您的代碼所看到的，這與普通人在具有完整 GUI 的 web 瀏覽器中看到的不同。

當你抓取一個網站時，你不必接受那些 cookies。 但是，如果您想接受，則只需單擊網站上的“接受按鈕”即可。 您可以使用以下方法執行此操作：

點擊 Selenium

右鍵單擊網站獲取 X-Path 並檢查 cookie 按鈕。

頁面分頁/用請求抓取/ BeautifulSoup

[英]Page Pagination/Scraping with Requests/BeautifulSoup

在 python 中接受 cookies 后抓取 web 頁面

[英]Scraping web page after accepting cookies in python

使用Python / Requests / BeautifulSoup進行高效的網頁抓取

[英]Efficient web page scraping with Python/Requests/BeautifulSoup

使用Beautifulsoup和Requests刮取“ N”頁（如何獲取真實的頁碼）

[英]Scraping 'N' pages with Beautifulsoup and Requests (How to obtain the true page number)

BeautifulSoup - 抓論壇頁面

[英]BeautifulSoup - scraping a forum page

網頁抓取 Python (BeautifulSoup,Requests)

[英]Web Scraping Python (BeautifulSoup,Requests)

python 網頁抓取請求和beautifulsoup

[英]python web scraping with requests and beautifulsoup

抓取時激活按鈕進入下一頁（Python，BeautifulSoup）

[英]Activate button to get to next page while scraping (Python, BeautifulSoup)

使用BeautifulSoup抓取網站時閱讀頁碼

[英]Read the page number while scraping a website using BeautifulSoup

使用BeautifulSoup和Requests解析html頁面源時出現內存泄漏

[英]Memory Leak while parsing html page source with BeautifulSoup & Requests

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 頁面分頁/用請求抓取/ BeautifulSoup 在 python 中接受 cookies 后抓取 web 頁面使用Python / Requests / BeautifulSoup進行高效的網頁抓取使用Beautifulsoup和Requests刮取“ N”頁（如何獲取真實的頁碼） BeautifulSoup - 抓論壇頁面網頁抓取 Python (BeautifulSoup,Requests) python 網頁抓取請求和beautifulsoup 抓取時激活按鈕進入下一頁（Python，BeautifulSoup）使用BeautifulSoup抓取網站時閱讀頁碼使用BeautifulSoup和Requests解析html頁面源時出現內存泄漏

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM