简体   繁体   English

如何使用html文件中的lxml在python中提取段落文本?

[英]How to extract paragraph text in python using lxml from html file?

I am trying to extract the paragraph but getting [<Element p at 0x7f8c81a26548>] instead of the paragraph. 我正在尝试提取该段落,但得到[<Element p at 0x7f8c81a26548>]而不是该段落。 How can I extract the paragraph? 如何提取该段落?

 Selector_1 = "div.bloco-imovel-texto p" tree.cssselect(Selector_1) 
 <div class="bloco-imovel-texto"> <h3 class="lbl_description"> Description </h3> <p>At vero eos et accusamus et iusto odio dignissimos ducimus qui blanditiis praesentium voluptatum deleniti atque corrupti quos dolores et quas molestias excepturi sint occaecati cupiditate non provident, similique sunt in culpa qui officia deserunt mollitia animi, id est laborum et dolorum fugaEt harum quidem rerum facilis est et expedita distinctio.Nam libero tempore, cum soluta nobis est eligendi optio cumque nihil impedit quo minus id quod maxime placeat facere possimus, omnis voluptas assumenda est, omnis dolor repellendus.</p> </div> 

一定是

tree.cssselect(Selector_1)[0].text

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM