繁体   English   中英

lxml xpath没有找到锚文本

[英]lxml xpath not finding anchor text

我有两个xpath,只有一个正确从下面的URL中提取作业标题。 知道为什么xpath1(我发现使用Chrome的'inspect element / copy XPath'功能)不起作用,而xpath2呢?

import requests
from lxml import html

url = 'http://www.mynextmove.org/find/browse?c=54'

xpath1 = '//*[@id="content"]/table[1]/tbody/tr/td[1]/a/text()'
xpath2 = '//a[contains(@href, "profile")]/text()'

page = requests.get(url)
tree = html.fromstring(page.text)

jobs = tree.xpath(xpath2)
print 'jobs:', jobs

xpath1返回空列表[]。

xpath2返回['人类学家','考古学家',...]

没有tbody它看起来像它更改为:

`xpath1 = '//*[@id="content"]/table[1]/tr/td[1]/a/text()'`

并尝试一下。

这就是我这样做的结果:

In [31]: tree.xpath(xpath1)
Out[31]:
['Anthropologists',
 'Archeologists',
 'Architects',
 'Architectural Drafters',
 'Biochemists & Biophysicists',
 'Civil Drafters',
 'Civil Engineers',
 'Environmental Engineering Technicians',
 'Environmental Engineers',
 'Geodetic Surveyors',
 'Lawyers',
 'Legal Secretaries',
 'Mapping Technicians',
 'Marine Architects',
 'Marine Engineers',
 'Paralegals & Legal Assistants',
 'Survey Researchers',
 'Surveying Technicians',
 'Surveyors',
 'Tax Preparers',
 'Transportation Engineers',
 'Veterinarians',
 'Veterinary Assistants & Laboratory Animal Caretakers',
 'Veterinary Technologists & Technicians',
 'Water/Wastewater Engineers']

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM