如何使用lxml xpath和python中的請求在文本中提取href

Question

首先，我是python的新手。 我需要從網頁中的文本中提取一個鏈接，我將lxml與Python 3.5結合使用，但我無法弄清楚。 這是我到目前為止的內容：

url = someUrl
page = requests.get(url)
webpage = html.fromstring(page.content)
fulllinks = webpage.xpath('//a/@href')
fulltext = webpage.xpath('//a/text()')


for line in fulltext:
    if line.startswith("SomethingHere"):
    'get the link from SomethingHere and do other stuff'

其中"somethingHere"是文本，我想要該文本的鏈接（例如www.someweb.com.br/trends ）。

我有點迷路了。 提前致謝。

Answer 1

得到了我想要的東西。 答案是：

webpage.xpath("//a[starts-with(text(),'SomethingHere')]/@href")

不管怎么說，還是要謝謝你。

如何使用lxml xpath和python中的請求在文本中提取href

問題描述

1 個解決方案

解決方案1
0 2017-03-16 14:07:18

如何使用lxml xpath和python中的請求在文本中提取href

問題描述

1 個解決方案

解決方案1 0 2017-03-16 14:07:18

解決方案1
0 2017-03-16 14:07:18