xpath query on a html table always returns empty string in python

Question

here's the python code of what i've tried:

from lxml import html
import requests


page = requests.get('http://www.rsssf.com/tablese/eng2017det.html')
tree = html.fromstring(page.content)
print(tree.xpath('/html/body/table/tbody/tr[2]//text()'))

I'm always getting my output as [] I have also checked the html page, the URL isn't broken

Answer 1

Do not use tbody tag in your XPath. Note that developer might skip this tag, so it will be added automatically by browser while page rendering.

Simply try

print(tree.xpath('/html/body/table//tr[2]//text()'))

or

print([i for i in tree.xpath('/html/body/table//tr[2]//text()') if i.strip()])

to avoid printing new line characters

xpath query on a html table always returns empty string in python

Question

1 answers

solution1
1 ACCPTED 2018-10-02 12:35:21

xpath query on a html table always returns empty string in python

Question

1 answers

solution1 1 ACCPTED 2018-10-02 12:35:21

solution1
1 ACCPTED 2018-10-02 12:35:21