Using Scrapy Python not able to extract data from response html with xpath due to namespace

Question

I am using scrapy with xpath to extract data from a webpage. My html response looks like this,

I want to extract the href link present in the highlighted "a" tag .

Usually I use response.xpath('//a[@id="jr-alt-sw"]/@href') to get the data, but here I think due to the namespace problem the result is empty. How can I get the data if namespace is present.

Any help is appreciated!!

Answer 1

Is that true about namespace? Another reason to use css instead:

response.css('a#jr-alt-sw::attr(href)')

Answer 2

此处选择的a标签没有可用的href属性，请查看下a包含href属性的a标签。

response.xpath('//a[@id="jr-pdf-sw"]/@href')

Using Scrapy Python not able to extract data from response html with xpath due to namespace

Question

2 answers

solution1
0 2020-03-18 23:41:29

solution2
0 2020-03-19 02:33:17

Using Scrapy Python not able to extract data from response html with xpath due to namespace

Question

2 answers

solution1 0 2020-03-18 23:41:29

solution2 0 2020-03-19 02:33:17

solution1
0 2020-03-18 23:41:29

solution2
0 2020-03-19 02:33:17