有什么办法可以进入 <option>使用lxml.html解析表单时输入文本？

Question

I'm trying parse a html form that looks like that: 我正在尝试解析如下所示的html表单：

<select name="country">
<option value="1">Afghanistan</option>
<option value="2">Albania</option>
<option value="3">Algeria</option>
<option value="4">Andorra</option>
....
</select>

After I parse the document using lxml.html.parse, I can access the list of values using: 使用lxml.html.parse解析文档之后，可以使用以下方法访问值列表：

doc.forms[0].elements["country"].value_options

However, this returns a list of raw values (['1', '2', '3', '4' ...]) without the corresponding country names. 但是，这将返回原始值的列表（['1'，'2'，'3'，'4'...]），而没有相应的国家/地区名称。 Is there an easy way to get the contents of the option tag including both text and values? 是否有一种简单的方法来获取选项标签的内容，包括文本和值？

Answer 1

I use xpath to get go through html... try: 我使用xpath通过html ...尝试：

options = doc.xpath("//select[@name='country']/option")
option_text = [option.text for option in options]

有什么办法可以进入 <option>使用lxml.html解析表单时输入文本？

问题描述

1 个解决方案

解决方案1
1 已采纳 2012-08-20 10:50:27

有什么办法可以进入 <option>使用lxml.html解析表单时输入文本？

问题描述

1 个解决方案

解决方案1 1 已采纳 2012-08-20 10:50:27

解决方案1
1 已采纳 2012-08-20 10:50:27