简体   繁体   English

将pdf转换为word doc文件

[英]Convert pdf into word doc file

How to convert the pdf into the word doc file? 如何将pdf转换为word doc文件?

The pdf file was generated by JasperReports and which has one table in which one column contains text with html body part like <p><b>test</b></p> pdf文件由JasperReports生成,其中有一个表,其中一列包含带有html正文部分的文本,如<p><b>test</b></p>

So I just want to convert this pdf file in doc with proper formating like text display in bold format. 所以我只想在doc中转换这个pdf文件,并使用粗体格式的文本显示进行适当的格式化。

Pro grammatically you can do it with Apachi POI. 在语法上,您可以使用Apachi POI做到这一点。 You can first read the PDF and then write it to a Word Doc using the API. 您可以先阅读PDF,然后使用API​​将其写入Word Doc。

Much of the format information is removed in converting a file into a PDF so you can not just convert it back unless the PDF was created as Marked content with additional meta tags in it. 在将文件转换为PDF时删除了大部分格式信息,因此您不能只将其转换回来,除非将PDF创建为带有附加元标记的标记内容。

I wrote a blog article explaining about PDF text at http://www.jpedal.org/PDFblog/2009/04/pdf-text/ 我在http://www.jpedal.org/PDFblog/2009/04/pdf-text/上写了一篇博客文章,解释了有关PDF文本的内容

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM