如何使用 scrapy 从 html 标签中提取数据

Question

I need to extract address information from this HTML code.我需要从这个 HTML 代码中提取地址信息。

     <span>
        <span class="icon"> <i class="fas fa-building"></i> </span> 8  Phạm Hùng
         Cau Giay
         Ha Noi
     </span>

How can I get that information.我怎样才能得到这些信息。 If I do something like如果我做类似的事情

response.css('div.company-info__location').get()

I got back我回来了

<div class="company-info__location">      <span>\n        <span class="icon"> <i class="fas fa-building"></i> </span> 8  Phạm Hùng\nCau Giay\nHa Noi\n 
     </span>\n    </div>

Or或者

response.css('div.company-info__location::text').get()

It only return space.它只返回空间。 Not exactly what I want不完全是我想要的

Answer 1

You can try string() XPath expression:你可以试试string() XPath 表达式：

response.xpath('string(//div[@class="info__location"])').get()

如何使用 scrapy 从 html 标签中提取数据

问题描述

1 个解决方案

解决方案1
0 已采纳 2020-05-12 14:34:42

如何使用 scrapy 从 html 标签中提取数据

问题描述

1 个解决方案

解决方案1 0 已采纳 2020-05-12 14:34:42

解决方案1
0 已采纳 2020-05-12 14:34:42