繁体 English 中英

Beautifulsoup4 Python提取数据

[英]Beautifulsoup4 Python extracting data

原文 2017-03-17 10:48:59 3 1 python/ string/ web-scraping/ beautifulsoup

我正在尝试从该站点提取地址，并且html如下所示：

<div class="col-xs-12 col-sm-6 col-address">
<div>ul. Małachowskiego 45<br />42-500 Będzin<br />woj. śląskie</div>
</div>

到目前为止，我使用

soup = BeautifulSoup(firma, "lxml")
address = soup.find("div", class_="col-address")
if address:
    address_firmy = (address.text)

我得到： "ul. Małachowskiego 4542-500 Będzinwoj. śląskie"

所以现在有两个问题：

如何在原来br标签所在的位置放置空格？
如何将字符串拆分为单独的字段（在csv中）：街道，邮政编码，城镇，区域？

这可能很简单，但是我对编程和Python还是完全陌生的。

1 个解决方案

In [56]: soup.div.get_text(separator=',', strip=True)
Out[56]: 'ul. Małachowskiego 45,42-500 Będzin,woj. śląskie'

您可以使用separator指定用于将文本位连接在一起的字符串
您可以使用strip=True告诉Beautiful Soup从文本的每一位的开头和结尾去除空格

使用BeautifulSoup4和Python从不一致的HTML页面中提取数据

[英]Extracting data from an inconsistent HTML page using BeautifulSoup4 and Python

网页抓取数据 python beautifulsoup4

[英]webscrape data python beautifulsoup4

beautifulsoup4 python处理已解析的数据

[英]beautifulsoup4 python working with parsed data

python beautifulsoup4解析谷歌财务数据

[英]python beautifulsoup4 parsing google finance data

UnicodeEncodeError：使用Python和beautifulsoup4抓取数据

[英]UnicodeEncodeError: Scraping data using Python and beautifulsoup4

使用 BeautifulSoup4 在 Python 中存储标签中的数据

[英]Storing data from a tag in Python with BeautifulSoup4

Python Beautifulsoup4：

[英]Python Beautifulsoup4:

Python 3和BeautifulSoup4中的UnicodeEncodeError

[英]UnicodeEncodeError in Python 3 and BeautifulSoup4

使用BeautifulSoup4解析数据

[英]Parse data with BeautifulSoup4

我该如何抓取这些数据？ [带Python的BeautifulSoup4]

[英]How can I scrape this data? [BeautifulSoup4 with Python]

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 使用BeautifulSoup4和Python从不一致的HTML页面中提取数据网页抓取数据 python beautifulsoup4 beautifulsoup4 python处理已解析的数据 python beautifulsoup4解析谷歌财务数据 UnicodeEncodeError：使用Python和beautifulsoup4抓取数据使用 BeautifulSoup4 在 Python 中存储标签中的数据 Python Beautifulsoup4： Python 3和BeautifulSoup4中的UnicodeEncodeError 使用BeautifulSoup4解析数据我该如何抓取这些数据？ [带Python的BeautifulSoup4]

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM