繁体 English 中英

python 中的 requests.get(url) 在循环中使用时表现不同

[英]requests.get(url) in python behaving differently when used in loop

原文 2020-04-15 05:52:16 9 1 python/ web-scraping/ xml-parsing/ python-requests/ html-parsing

我是 python 编程的新手，并试图抓取我的 Urls.txt 文件中可用的每个链接。 我写的代码是：

import requests
from bs4 import BeautifulSoup
from fake_useragent import UserAgent
user_agent = UserAgent()
fp = open("Urls.txt", "r")
values = fp.readlines()
fin = open("soup.html", "a")
for link in values:
    print( link )
    page = requests.get(link, headers={"user-agent": user_agent.chrome})
    html = page.content
    soup = BeautifulSoup(html, "html.parser")
    fin.write(str(soup))

当链接直接作为字符串而不是变量提供时，代码工作得非常好，但是当它按原样使用时，output 不同。

1 个解决方案

也许您从文件中读取的字符串有换行符。 要删除它，请使用link.strip("\n")

Python：requests.get，循环遍历URL

[英]Python: requests.get, iterating url in a loop

requests.get（）行为异常？

[英]requests.get() behaving erratically?

带有多个“.”的 Python requests.get() url

[英]Python requests.get() url with multiple "."

python requests.get 无效 URL

[英]python requests.get with invalid URL

Python requests.get() 循环不返回任何内容

[英]Python requests.get() loop returns nothing

Python request.get（）

[英]Python requests.get()

Python请求：requests.get（url）.json（）错误

[英]Python Requests: requests.get(url).json() Error

Python请求：requests.get在有效网址上返回404

[英]Python requests: requests.get returns 404 on valid url

Python requests.get(URL) 在使用带点的 URL 时返回 404 错误

[英]Python requests.get(URL) returns 404 error when using URL with dot

python循环request.get（）仅返回第一个循环

[英]python loop requests.get() only returns first loop

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Python：requests.get，循环遍历URL requests.get（）行为异常？带有多个“.”的 Python requests.get() url python requests.get 无效 URL Python requests.get() 循环不返回任何内容 Python request.get（） Python请求：requests.get（url）.json（）错误 Python请求：requests.get在有效网址上返回404 Python requests.get(URL) 在使用带点的 URL 时返回 404 错误 python循环request.get（）仅返回第一个循环

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM