python lxml xpath獲取具有特定字符串模式的節點屬性

Question

我正在學習xpath並嘗試使用python lxml / html獲取具有特定節點屬性的節點的值，例如（google playstore）。 從下面的代碼中，我想從節點“ a”獲得以“ mailto：”開頭的屬性“ href”的開發人員電子郵件值。 我的python代碼段返回了應用名稱，但開發人員電子郵件為空。 謝謝

<html>
<div class="id-app-title" tabindex="0">Candy Crush Saga</div>
<div class="meta-info meta-info-wide"> 
<div class="title"> Developer </div> 
<a class="dev-link" href="https://www.google.com/url?q=http://candycrush.com" rel="nofollow" target="_blank"> Visit website </a>
<a class="dev-link" href="mailto:candycrush@kingping.com"
rel="nofollow" target="_blank">candycrush@kingping.com </a> ##Interesting part here
</div>
</html>

Python代碼（2.7）

 def get_app_from_link(self,link):
    start_page=requests.get(link)
    #print start_page.text
    tree = html.fromstring(start_page.text)
    name = tree.xpath('//div[@class="id-app-title"]/text()')[0]
    #developer=tree.xpath('//div[@class="dev-link"]//*/div/@href')
    developer=tree.xpath('//div[contains(@href,"mailto") and @class="dev-link"]/text()')
    print name,developer
    return

Answer 1

現在，您使用的標簽div ，而不是a ：

'//a[contains(@href,"mailto") and @class="dev-link"]/text()'

另外，您的函數不會返回任何項目。 使用return這樣的：

def get_app_from_link(self,link)::
    # your code
    return name, developer

python lxml xpath獲取具有特定字符串模式的節點屬性

問題描述

Python代碼（2.7）

1 個解決方案

解決方案1
1 已采納 2016-04-22 03:40:48

python lxml xpath獲取具有特定字符串模式的節點屬性

問題描述

Python代碼（2.7）

1 個解決方案

解決方案1 1 已采納 2016-04-22 03:40:48

解決方案1
1 已采納 2016-04-22 03:40:48