简体繁体中英

Best API for reading a huge .pdf file from java

原文 2011-02-09 04:59:04 2 2 java/ sql/ api/ pdf

I have a huge pdf file (20 mb/800 pages) which contains some information.

It has got index with hyperlinks. Also most of the remaining information is in Tabular format (in pdf). I need to retrieve this information using Java and store it in SQL Server.

Which is the best API available to read this kind of file from Java?

2 answers

It is unlikely to be in tabular format inside the PDF as PDF does not contain structure information unless explicitly added at creation time. I wrote an article explaining some of the issues with text extraction from at PDF at http://www.jpedal.org/PDFblog/2009/04/pdf-text/

Have you tried iText :

Reading huge file in Java

Reading bytes from compressed PDF file in Java

Java - OutofMemoryError while reading a huge csv file

Reading a huge csv file and converting to JSON with Java 8

Which API in Java to use for file reading to have best performance?

Best way to read huge file in MB in java

Reading huge line of string from text file

Reading PDF in java as a file and making “PDF” editable

Java: Reading a pdf file from URL into Byte array/ByteBuffer in an applet

What is the best way of reading configuration parameters from configuration file in Java?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Reading huge file in Java Reading bytes from compressed PDF file in Java Java - OutofMemoryError while reading a huge csv file Reading a huge csv file and converting to JSON with Java 8 Which API in Java to use for file reading to have best performance? Best way to read huge file in MB in java Reading huge line of string from text file Reading PDF in java as a file and making “PDF” editable Java: Reading a pdf file from URL into Byte array/ByteBuffer in an applet What is the best way of reading configuration parameters from configuration file in Java?

Related Tags

Best API for reading a huge .pdf file from java

Question

2 answers

solution1
2 ACCPTED 2011-02-09 08:28:07

solution2
1 2011-02-09 05:12:13

Best API for reading a huge .pdf file from java

Question

2 answers

solution1 2 ACCPTED 2011-02-09 08:28:07

solution2 1 2011-02-09 05:12:13

solution1
2 ACCPTED 2011-02-09 08:28:07

solution2
1 2011-02-09 05:12:13