解析html漂亮汤

Question

I have a html page 我有一个html页面

<a email="corporate@max.ru" href="http://www.max.ru/agent?message&to=corporate@max.ru" title="Click herе" class="mf_spIco spr-mrim-9"></a><a class="mf_t11" type="booster" href="http://max.ru/mail/corporate/">

I neeed a parse email string 我需要解析电子邮件字符串

    soup = BeautifulSoup(data
    string = soup.find("a",{"email": ""})
    print string

But it not working. 但它不起作用。 Where mistake? 哪里有错？

Answer 1

Your mistake was in using the attrs dict to look for elements with an email attribute that is empty. 您的错误在于使用attrs字典查找电子邮件属性为空的元素。 Try this instead. 试试这个吧。

#!/usr/bin/env python

from BeautifulSoup import BeautifulSoup
import urllib2

req = urllib2.urlopen('http://worldnuclearwar.ru')

soup = BeautifulSoup(req)
print soup.find("a", email=True)["email"]

To print the email attribute of the first a element which has an email attribute. 要打印email的第一个属性a它有一个元素email属性。 If you want all emails, try 如果您需要所有电子邮件，请尝试

for link in soup.findAll("a", email=True):
    print link["email"]

解析html漂亮汤

问题描述

1 个解决方案

解决方案1
4 2010-10-02 18:38:52

解析html漂亮汤

问题描述

1 个解决方案

解决方案1 4 2010-10-02 18:38:52

解决方案1
4 2010-10-02 18:38:52