如何使用scrapy刮掉没有类的html标签并更改id？

Question

I want to scrape the sold price of property fromhttp://house.speakingsame.com/p.php?q=Brisbane+City&sta=qld我想从http://house.speakingsame.com/p.php?q=Brisbane+City&sta=qld获取房产的售价

It has no formatting, no classes and the content is represented in tables.它没有格式，没有类，内容以表格形式表示。

what should I do in this case?在这种情况下我该怎么办？ Each table represents each property.每个表代表每个属性。 I need the sold price for each property and hence each table.我需要每处房产的售价，因此需要每张桌子的售价。

response.css('tbody').getall() returns nothing at all. response.css('tbody').getall()什么都不返回。

Answer 1

using xpath you could use:使用 xpath 你可以使用：

for element in response.xpath("//table//table//table"):
    sold = element.xpath(".//b")[0].xpath("./text()").get()
    print(sold)
    date = element.xpath(".//td")[0].xpath("./text()").get()

output:输出：

Sold $640,000
Sold $640,000
Sold $320,000
Sold $320,000
Sold $145,000
Sold $145,000
Sold $145,000
Sold $145,000
Sold $239,000
Sold $239,000
Sold $695,000
Sold $695,000
Sold $740,000
Sold $740,000
Sold $375,000
Sold $375,000
Sold $390,000
Sold $390,000
Sold $695,000
Sold $695,000

如何使用scrapy刮掉没有类的html标签并更改id？

问题描述

1 个解决方案

解决方案1
1 已采纳 2022-07-10 20:40:16

如何使用scrapy刮掉没有类的html标签并更改id？

问题描述

1 个解决方案

解决方案1 1 已采纳 2022-07-10 20:40:16

解决方案1
1 已采纳 2022-07-10 20:40:16