简体   繁体   中英

Is there a way of deleting the tables inside a PDF using Python?

Is there a way of deleting the tables inside a PDF using Python? So far I've been working with pdfplumber to parse through the text within the PDF, and wanted to know is there is a way to ignore or delete the text inside a the tables.

I would recommend using a Python package called Camelot. It is very efficient in extracting tabular data from PDF's. Here is the link to the documentation .

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM