如何使用 scrapy 在两个不同字符之间提取 URL 的一部分

Question

I have url that has a following structure:我有 url 具有以下结构：

https://example.com/string?rest

I am trying to extract only the 'string' part and so far I can only think of using我试图只提取“字符串”部分，到目前为止我只能想到使用

response.url.split('/')[3]

to extract everything after third '/' Is it possible to extract the part after third '/' and before the '?'在第三个“/”之后提取所有内容是否可以在第三个“/”之后和“？”之前提取部分？ sign?符号？

Answer 1

Might be able to use something in urllib.parse :可能可以在urllib.parse中使用一些东西：

from urllib.parse import urlparse
print(urlparse(response.url).path.split('/')[-1])

如何使用 scrapy 在两个不同字符之间提取 URL 的一部分

问题描述

1 个解决方案

解决方案1
2 2020-07-21 19:58:10

如何使用 scrapy 在两个不同字符之间提取 URL 的一部分

问题描述

1 个解决方案

解决方案1 2 2020-07-21 19:58:10

解决方案1
2 2020-07-21 19:58:10