简体繁体中英

PDF data into XML format using Python

原文 2020-03-15 17:35:25 3 1 python/ python-3.x/ deep-learning/ neural-network

Extracting text from PDF files in python can be done using python different packages , but i am looking deep learning solution? How deep learning can be used to extract the text in format xml ? Heard a lot of times deep learning can be used ? Can any one have any used case and explain the process?

1 answers

pdf解析的问题不是输出而是页面分析的过程。所以如果你分析页面，你可以以你想要的任何格式输出结果（这应该是最简单的部分）。我建议阅读源代码pdfminer 女巫我认为是最复杂的，所以你可以开始学习如何开始，这样你就可以解析 pdf。至于深度学习，我认为这会很复杂，但是它的应用程序是 pdf 文件中最难的问题是管理文本方向、行间距、垂直或横向、字边距等。如果您开始一个项目并始终记住 PDF 是邪恶的，那么祝您好运。

Convert pdf data to JSON format using Python?

Import XML data into a PDF form using Python

is there any possible way to export xml data from a pdf using python

Format this data as XML or JSON with Python

Python - Parsing PDF Data into a Table Format

Data format using python?

How to convert the extracted text from PDF to JSON or XML format in Python?

Convert HTML to PDF using Python with format PDF/X-1a

Pdf to XML/json using Python module

How to convert a pdf to XML using python code?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Convert pdf data to JSON format using Python? Import XML data into a PDF form using Python is there any possible way to export xml data from a pdf using python Format this data as XML or JSON with Python Python - Parsing PDF Data into a Table Format Data format using python? How to convert the extracted text from PDF to JSON or XML format in Python? Convert HTML to PDF using Python with format PDF/X-1a Pdf to XML/json using Python module How to convert a pdf to XML using python code?

Related Tags

PDF data into XML format using Python

Question

1 answers

solution1 0 2020-03-15 17:54:49

solution1
0 2020-03-15 17:54:49