使用BeautifulSoup从html文件中提取特定文本

Question

I have a code like given below. 我有下面给出的代码。 I am using BeautifulSoup to scrape text in class = 'product'. 我正在使用BeautifulSoup刮擦class ='product'中的文本。 But I wanted only 2nd and 4th value(ie. 'Product 2' and 'Product 4') in my extracted csv file. 但是我提取的csv文件中只需要第二和第四值（即“产品2”和“产品4”）。 As of now I only know to extract all the values(ie. 'Product 1' 'Product 2' 'Product 3' 'Product 4'). 到目前为止，我只知道提取所有值（即“产品1”，“产品2”，“产品3”，“产品4”）。

 <body> <div class="product">Product 1</div> <div class="product">Product 2</div> <div class="product">Product 3</div> <div class="product">Product 4</div> </body>

Answer 1

find_all returns a list, so use indexes to get the desired elements find_all返回一个列表，因此使用索引来获取所需的元素

result = data_soup.find_all(attrs={"class": "product"})
print(result[1], result[3])

使用BeautifulSoup从html文件中提取特定文本

问题描述

1 个解决方案

解决方案1
2 2018-09-15 10:22:55

使用BeautifulSoup从html文件中提取特定文本

问题描述

1 个解决方案

解决方案1 2 2018-09-15 10:22:55

解决方案1
2 2018-09-15 10:22:55