簡體 English 中英

如何使用BeautifulSoup遍歷網站的每個頁面以進行網頁抓取

[英]How to loop through each page of website for web scraping with BeautifulSoup

原文 2017-09-20 23:04:25 0 1 python/ html/ web-scraping/ beautifulsoup

我正在使用BeautifulSoup從網站抓取職位發布數據。 我有滿足我需要的工作代碼，但它只會刮取職位發布的第一頁。 我在弄清楚如何迭代更新URL以刮擦每個頁面時遇到了麻煩。 我是Python的新手，曾經研究過幾種解決類似問題的方法，但是還沒有弄清楚如何將其應用於我的特定網址。 我認為我需要迭代更新URL或以某種方式單擊“下一步”按鈕，然后在每個頁面中循環我現有的代碼。 我感謝任何解決方案。

網址： https ： //jobs.utcaerospacesystems.com/search-jobs

1 個解決方案

首先，BeautifulSoup與獲取網頁沒有任何關系-您可以自己獲取網頁，然后將其提供給bs4進行處理。

您鏈接的頁面的問題在於它是javascript-僅在瀏覽器（或任何其他javascript VM）中正確顯示。

@Fabricator處在正確的軌道上-您需要觀察開發人員控制台，並查看ajax請求js將其發送到服務器的內容。 在這種情況下，還要看一下查詢字符串參數，其中包括一個稱為CurrentPage的參數-可能是您要關注的參數。

如何使用BeautifulSoup遍歷URL列表進行Web抓取

[英]How to loop through a list of urls for web scraping with BeautifulSoup

使用BeautifulSoup進行Python網絡抓取，如何循環訪問復雜的URL？

[英]Python web scraping using BeautifulSoup, how to loop through complicated URL?

如何使用 BeautifulSoup 循環瀏覽網站？

[英]How to loop through website with BeautifulSoup?

如何遍歷嵌套網頁進行網頁抓取？

[英]How to loop through a nested web page for web scraping?

Web 刮過多個頁面並沒有保存每個結果-beautifulsoup

[英]Web scraping through multiple pages doesnt save each result -beautifulsoup

如何使用BeautifulSoup遍歷在多個網頁上抓取多個文檔？

[英]How to loop through scraping multiple documents on multiple web pages using BeautifulSoup?

通過Python BeautifulSoup進行網頁爬取

[英]Web Scraping through Python BeautifulSoup

通過BeautifulSoup分頁進行網頁抓取

[英]Web scraping through pagination with BeautifulSoup

如何進行網頁抓取-beautifulSoup

[英]how to web scraping - beautifulSoup

如何遍歷 BeautifulSoup Web Scraping 的 URL 列表？

[英]How to iterate through a list of URLs for BeautifulSoup Web Scraping?

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 如何使用BeautifulSoup遍歷URL列表進行Web抓取使用BeautifulSoup進行Python網絡抓取，如何循環訪問復雜的URL？如何使用 BeautifulSoup 循環瀏覽網站？如何遍歷嵌套網頁進行網頁抓取？ Web 刮過多個頁面並沒有保存每個結果-beautifulsoup 如何使用BeautifulSoup遍歷在多個網頁上抓取多個文檔？通過Python BeautifulSoup進行網頁爬取通過BeautifulSoup分頁進行網頁抓取如何進行網頁抓取-beautifulSoup 如何遍歷 BeautifulSoup Web Scraping 的 URL 列表？

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM