在::before (xpath, css) 之后抓取文本

Question

我已经构建了一个 Scrapy Spider 并希望从以下位置获取 email 文本：

::before "E-Mail" "E-Mail I would like to scrape"

我试过：'email': response.css('#content > div.segment.morecontact.clearfix > div > div.secondary > ul > li:nth-child(1) > a > i::text') .extract()，但我只收到“电子邮件”而不是实际地址

Answer 1

您需要一个简单的 XPath 的following-sibling::* ：

email = response.xpath('//i[contains(@class, "icon_email")]/following-sibling::text()[1]').get()

您可以使用另一种方法并从href属性获取 email： email = response.xpath('//a[i[contains(@class, "icon_email")]]/@href').re_first(r'mailto:(. +)')

在::before (xpath, css) 之后抓取文本

问题描述

1 个解决方案

解决方案1
1 2019-11-15 11:30:46

在::before (xpath, css) 之后抓取文本

问题描述

1 个解决方案

解决方案1 1 2019-11-15 11:30:46

解决方案1
1 2019-11-15 11:30:46