繁体 English 中英

如何将Beautiful汤和lxml Parser结合使用以在网站中找到关键字？

[英]How can I use Beautiful soup in combination with lxml Parser to find a keyword in a website?

原文 2014-04-08 21:17:04 4 1 python/ html-parsing/ beautifulsoup/ lxml

def main():


  openurl = urllib2.urlopen("http://www.pythonforbeginners.com")
    content = openurl.read()
    code = openurl.code

    soup = BeautifulSoup(content) #I think I need to change something here!!
    print soup
    if soup.body.find(text=re.compile('python', re.IGNORECASE)):
      print "i think it's working"
    openurl.close()

如何修改此代码，以允许我将lxml解析器与Beautiful Soup结合使用以在网站正文中找到关键字？ 请注意，上面的代码有效，但是它没有使用我想要的解析器。

1 个解决方案

要将lxml用作解析器，请提供“ lxml”作为第二个参数。

soup = BeautifulSoup(content, 'lxml')

美丽的汤解析器找不到链接

[英]beautiful soup parser can't find links

没有像Beautiful Soup这样的解析器，如何在Selenium中找到特定但晦涩的元素？

[英]How can I find specific but obscure elements in Selenium, without a parser like Beautiful Soup?

我将如何使用美丽的汤从这个网站上抓取数据？

[英]How would I use beautiful soup to webscrape data from this website?

如何让 Beautiful Soup 显示更多网站内容？

[英]How can I make Beautiful Soup show more of a website?

如何使用 Python 的 Beautiful Soup 来查找自定义属性的值？

[英]How can I use Python's Beautiful Soup to find the value of a custom attribute?

我怎样才能用美丽的汤从这个网页上刮掉这个符号？

[英]How can I use beautiful soup to scrape the symbol from this webpage?

如何使用“美丽汤”解析面糊的名称？

[英]How can I use Beautiful Soup to parse the batter's names?

如何使用Beautiful Soup 4来查找属性

[英]How to use Beautiful Soup 4 to find attribute

如何使用 Beautiful Soup 从 html 页面中找到每个链接作为字符串？（ findAll function 没有找到适合这个网站的地方）

[英]how can I find each link as a string from html page with Beautiful Soup ? ( findAll function is not finding well for this website)

如何使用Python，请求和漂亮的汤查找与关键字关联的链接

[英]How to Find Link Associated with Keyword using Python, Requests, and Beautiful soup

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 美丽的汤解析器找不到链接没有像Beautiful Soup这样的解析器，如何在Selenium中找到特定但晦涩的元素？我将如何使用美丽的汤从这个网站上抓取数据？如何让 Beautiful Soup 显示更多网站内容？如何使用 Python 的 Beautiful Soup 来查找自定义属性的值？我怎样才能用美丽的汤从这个网页上刮掉这个符号？如何使用“美丽汤”解析面糊的名称？如何使用Beautiful Soup 4来查找属性如何使用 Beautiful Soup 从 html 页面中找到每个链接作为字符串？（ findAll function 没有找到适合这个网站的地方）如何使用Python，请求和漂亮的汤查找与关键字关联的链接

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM