用美丽的汤提取href

Question

I use this code to get acces to my link : 我使用此代码来访问我的链接：

links = soup.find("span", { "class" : "hsmall" })
links.findNextSiblings('a')
for link in links:
  print link['href']
  print link.string

Link have no ID or class or whatever, it's just a classic link with a href attribute. 链接没有ID或类或其他什么，它只是一个带有href属性的经典链接。

The response of my script is : 我的脚本的响应是：

print link['href']
TypeError: string indices must be integers

Can you help me to get href value ? 你能帮助我获得href价值吗？ Thx ! 谢谢！

Answer 1

Links is still referring to your soup.find. 链接仍指你的汤。发现。 So you could do something like: 所以你可以这样做：

links = soup.find("span", { "class" : "hsmall" }).findNextSiblings('a')
for link in links:
    print link['href']
    print link.string

Answer 2

Okay, it works now with following code : 好的，它现在可以使用以下代码：

linkSpan = soup.find("span", { "class" : "hsmall" })
link = [tag.attrMap['href'] for tag in linkSpan.findAll('a', {'href': True})]
for lien in link:
  print "LINK = " + lien`

用美丽的汤提取href

问题描述

2 个解决方案

解决方案1
9 2011-08-25 00:49:46

解决方案2
4 已采纳 2011-08-25 12:44:29

用美丽的汤提取href

问题描述

2 个解决方案

解决方案1 9 2011-08-25 00:49:46

解决方案2 4 已采纳 2011-08-25 12:44:29

解决方案1
9 2011-08-25 00:49:46

解决方案2
4 已采纳 2011-08-25 12:44:29